I am pretty sure this is correct. The first and last exons of the "earliest starting" and "last ending" transcripts contain the UTRs too and that is by definition the bounds of a gene. What would not be contained are the flanking regions and that sounds right as they are not parts of the genes. A quick way I use to check what coordinates correspond to what regions, I choose the "Sequences" radio button in the attributes and can see the diagramatic representation (red boxes) for each kind of region.
On Wed, May 9, 2012 at 7:13 AM, Yong Li <[email protected]>wrote: > Dear mailing list, > > I was using to Ensembl Biomart web interface to retrieve the chromsome > locations of my genes of interest (gene symbols). I chose Ensembl Genes 66 > and Homo Sapiens genes (GRCh37.p6) as dataset. For filters I chose to limit > to genes with HGNC IDs, and gave a few HGNC genes symbols. For Attributes I > chose Ensembl Gene ID, Ensembl Transcript ID, Chromosome Name, Gene Start > (bp), Gene End (bp), Transcript Start (bp) and Transcript End (bp). In the > results table I have many transcripts with different start and stop > positions corresponding to one gene. This is fine. It looks like the Gene > Start is the minimum of all Transcript Start and the Gene Stop is the > maximum of all Transcript End. Could anyone confirm this? This is important > to me because in the next step I need to find the SNPs in these chromosome > regions. > > Thank you in advance! > > Yong > > -- > Dr. Yong Li > AG PD Dr. Anna Koettgen, MPH > Universitaetsklinikum Freiburg > Innere Medizin IV > > Tel +49 761 270-78210 / Fax -78040 > Berliner Allee 29, 79110 Freiburg > [email protected] > ______________________________**_________________ > Users mailing list > [email protected] > https://lists.biomart.org/**mailman/listinfo/users<https://lists.biomart.org/mailman/listinfo/users> >
_______________________________________________ Users mailing list [email protected] https://lists.biomart.org/mailman/listinfo/users
