Gala database, Human April 2003 data release

Category Number of entries Source Example fields from this category
Genes 39,023 Locus Link and Refseq gene track at UCSC orientation, exons, CDS
Genes with product and function data 21,481 Locus Link product, biological process, cellular component, molecular function
Expression data 1,069,139 UniGene at NCBI tissue, gene ID
Microarray expression data 14,199 GNF tissue, expression level
Genetic Disorders 3,723 OMIM disorder, gene ID
Gene Orthologs mouse 10,642
rat 3,585
Database of evolutionary distances and Ensembl species, gene symbol, coordinates in mouse, percent CDS identity,percent protein identity
Alternate Gene Model: Acembly Genes 195,827 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: Ensembl Genes 31,296 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: Geneid Genes 31,600 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: Genscan Genes 42,143 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: MGC Genes 14,911 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: RefSeq Genes 18,664 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: SGP Genes 42,696 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: Twinscan Genes 25,434 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: UCSC Known Genes 34,946 UCSC Golden Path orientation, exons, CDS
Multiple alignments 15,770,095 UCSC Golden Path and Penn State - Bioinformatics Group scoring using humor score
Local alignments human vs. mouse 1,698,637 UCSC Golden Path and Penn State - Bioinformatics Group score, lav file sequence and header fields
Gap free alignments human vs. mouse 35,578,739 UCSC Golden Path and Penn State - Bioinformatics Group percent identity, local alignment ID
3-way conserved 3,351,078 Penn State - Bioinformatics Group repetitive element
Regulatory potential score (based on hmr alignments) 131,086,075 Penn State - Bioinformatics Group score
Mouse Conservation score 225,706,758 UCSC Golden Path score
Transcription factor binding sites 186,792,740 Penn State - Bioinformatics Group and
TRANSFAC (free registration required)
factor, strand, score
Conserved Transcription factor binding sites (hmr) 4,188,229 Penn State - Bioinformatics Group and
TRANSFAC (free registration required)
factor, strand, score
Known regulatory regions 89 Penn State - Bioinformatics Group name
Functional promoters 154 Stanford
Predicted promoters 0 Stanford
Downloaded from UCSC Golden Path
Micro RNA 168 Vertebrate microRNA genes name
CpG islands 26,792 UCSC Golden Path name
SNPs 3,237,433 dbSNP
and UCSC Golden Path
type, allele, frequency
HbVar: A Database of Human Hemoglobin Variants and Thalassemias 1,096 variants HbVar type, HbVar ID
Repeats 4,931,892 UCSC Golden Path name, class, family
Recombination rates 8,424 deCODE Genetics
Marshfield
Genethon
Downloaded from UCSC Golden Path
recombination rate; sex averaged, F, and M
GC percent 153,538 UCSC Golden Path percent
Restriction sites 116,484,996 Penn State - Bioinformatics Group for 128 different enzymes

* All categories include fields of chromosome, start and stop point

Note: see the Alignment builds page to find what builds were used in the alignments.