T-Gene Tutorial

What T-Gene does

Typical T-Gene applications

How T-Gene works

T-Gene uses a gene annotation file and, optionally histone modification and gene expression data for a panel of 'tissues'. It computes a distance based score---the Distance p-value---for each potential regulatory link between one of your Loci and a transcript in the gene annotation file. Potential links are 1) all locus-transcript pairs where the transcription start site (TSS) is located within a specified maximum distance from the locus (500,000bp by default), or 2) where the transcript is the closest transcript to the locus ("Closest-TSS link"), or 3) where the locus is the closest locus to the transcript ("Closest-Locus link"). If you provide a Tissue Panel, T-Gene also computes the level of each histone modification in the panel at each of the genomic loci (typically ChIP-seq peaks) that you provide, and then it computes the Pearson correlation between a histone modification at the locus the expression of the transcript. T-Gene then estimates the p-value of this correlation, and combines it with the Distance p-value for form the CnD p-value (Correlation aNd Distance p-value). Finally, T-Gene computes the q-value of the CnD p-value, which is the minimum false discovery rate (FDR) required to consider this link statistically significant. T-Gene reports these scores for each locus-TSS-histone trio. Note that if no tissue panel is provided, T-Gene bases the q-value on the Distance p-value rather than the CnD p-value.

Loci

You provide a set of genomic loci to T-Gene via a 'locus file' in BED format. Each line in the locus file specifies a single locus specified as three fields separated by tabs: <chrom> <chromStart> <chromEnd>. Any additional fields on each line are ignored by T-Gene, as are comment lines, which must start with the '#' character.

T-Gene Web Server Genome or Histone/Expression Panels

You must choose a Genome or Tissue Panel for the exact genome release (assembly) used by your loci file for T-Gene to work correctly. If a Genome or Panel does not exist for your genome release, you may be able to use the LiftOver tool to convert your loci to the correct coordinates.

T-Gene Command Line Genome or Histone/Expression Panels

You can use T-Gene on your own computer if you download and install the MEME Suite. (See instructions in the "Download & Install" menu on the left.) You should also download the T-Gene Databases, which include all of the Genomes and Histone/Expression Panels available to the T-Gene Web Server. You can find gene annotation files for other genomes as GTF files at ftp.ensembl.org/pub and ftp.ensemblgenomes.org/pub. For example, you can download a gene annotation file in GTF format for Cow at ftp://ftp.ensembl.org/pub/release-97/gtf/bos_taurus. You can reference this file on the T-Gene command-line as long as your locus file (e.g., ChIP-seq peak file) uses Ensembl format for chromosome names and the genomic coordinates are for Ensembl release 97. (Gene annotation files for older releases are also available at the same FTP site.)

Creating your own Histone/Expression Panels

Under construction...

Interpreting T-Gene's HTML Output

FAQ