Gala database, Mouse Oct 2003 data release

Category Number of entries Source Example fields from this category
Genes 27,381 Locus Link orientation, exons, CDS
Genes 790 Refseq gene track at UCSC orientation, exons, CDS
Genes with product and function data 16,880 Locus Link product, biological process, cellular component, molecular function
Expression data 514,777 UniGene at NCBI tissue, gene ID
Microarray expression data 8,370 GNF tissue, expression level
Strain with Mutant Allele 774 The Jackson Laboratory strain, gene ID
Gene Orthologs for human 12191 Database of evolutionary distancesand Ensembl species, gene symbol, coordinates in mouse, percent CDS identity,percent protein identity
Gene Orthologs for rat 3574 Database of evolutionary distancesand Ensembl species, gene symbol, coordinates in mouse, percent CDS identity,percent protein identity
Alternate Gene Model: Geneid Genes 34,456 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: Genscan Genes 44,289 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: MGC Genes 13,853 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: RefSeq Genes 16,614 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: Twinscan Genes 25,800 UCSC Golden Path orientation, exons, CDS
Alternate Gene Model: UCSC Known Genes 39,574 UCSC Golden Path orientation, exons, CDS
Multiple alignments 0 UCSC Golden Path and Penn State - Bioinformatics Group scoring using humor; hg15,mm3,rn3 score
Local alignments 1,826,412 UCSC Golden Path and Penn State - Bioinformatics Group score, lav file sequence and header fields
Gap free alignments 33,357,532 UCSC Golden Path and Penn State - Bioinformatics Group percent identity, local alignment ID
3 way conserved 0 Penn State - Bioinformatics Group repetitive element
Regulatory potential score 0 Penn State - Bioinformatics Group score
Human Conservation score 0 UCSC Golden Path score
Transcription factor binding sites 160,524,310 Penn State - Bioinformatics Group and
TRANSFAC (free registration required)
factor, strand, score
Conserved Transcription factor binding sites (hmr) 0 Penn State - Bioinformatics Group and
TRANSFAC (free registration required)
factor, strand, score
Micro RNA 204 Sanger Institute name
CpG islands 16,696 UCSC Golden Path name
SNPs 407,494 dbSNP
and UCSC Golden Path
type, allele, frequency
Repeats 4681708 UCSC Golden Path name, class, family
Recombination rates 0 deCODE Genetics
Marshfield
Genethon
Downloaded from UCSC Golden Path
recombination rate; sex averaged, F, and M
GC percent 131,922 UCSC Golden Path percent
Restriction sites 99,374,571 Penn State - Bioinformatics Group for 128 different enzymes

* All categories include fields of chromosome, start and stop point