Statistics R2

Contents

Ratio of Sequences containing AS information

UniGene Human Build #138 and dbEST 2001/08/12

    

Number of sequences

Number of sequences containing AS information

CDS

68,409 (100%)

7,324 (10.7%)

EST

3,735,344 (100%)

94,734 (2.5%)

UniGene Mouse Build #93 and dbEST 2001/08/12

¡@

Number of sequences

Number of sequences containing AS information

CDS

38,216 (100%)

1,945 (5.1%)

EST

2,068,128 (100%)

28,866 (1.4%)

Number of Genes Containing ASSPs (Alternative Splicing Site Pairs)

Human (UniGene Human Build #138 and dbEST 2001/08/12)

Classification of genes

# UniGene clusters

Genes without ASSP

9,983 (50.1%)

Genes with ASSPs

Only mRNA-supported AS
(no EST supported)

684 (3.4%)

9,953 (49.5%)

Only Single EST supported AS

4,025 (20.2%)

Multiple ESTs supported AS

5,244 (26.3%)

Total

19,936 (100%)

Mouse (UniGene Mouse Build #93 and dbEST 2001/08/12)

Classification of genes

# UniGene clusters

Genes without ASSP

11,396 (68.6%)

Genes with ASSPs

Only mRNA-confirmed AS
(no EST supported)

524 (3.1%)

5,219 (31.4%)

Only Single EST supported AS

2,720 (16.4%)

Multiple ESTs supported AS

1,975 (11.9%)

Total

16,615 (100%)

ASSPs Supported by Different Number of EST Sequences

The number of ESTs supporting an ASSP

ASSPs Count in human

ASSPs Count in mosue

1 EST 16643 6605
2~3 ESTs 5434 1829
4~7 ESTs 1997 468
8~15 ESTs 840 181
16~31 ESTs 392 74
32~63 ESTs 165 30
64~127 ESTs 71 11
128~255 ESTs 22 9
256~511 ESTs 7 4
512~1023 ESTs 5 3
1024~2047 ESTs 1 0
sum 25577 9214

Genes with at least 20 putative ASSPs

Ug_id Gene # UniGene member # putative ASSPs Descriptions Cytoband
Hs.2186 EEF1G 6743 33 eukaryotic translation elongation factor 1 gamma 7
Hs.14376 ACTG1 9120 38 actin, gamma 1 17q25
Hs.21346 LOC58481 469 20 hypothetical protein LOC58481 Xq28
Hs.22129 DJ1042K10.2 202 21 hypothetical protein 22q13.1-q13.2
Hs.75990 HP 550 45 haptoglobin
Hs.77385 MYL6 2095 20 myosin, light polypeptide 6, alkali, smooth muscle and non-muscle 12
Hs.78601 UROD 439 23 uroporphyrinogen decarboxylase 1p34
Hs.82208 ACADVL 645 20 acyl-Coenzyme A dehydrogenase, very long chain
Hs.84298 CD74 3475 23 CD74 antigen (invariant polypeptide of major histocompatibility complex, class II antigen-associated) 5q32
Hs.166011 CTNND1 510 23 catenin (cadherin-associated protein), delta 1 11q11
Hs.169476 GAPD 11286 44 glyceraldehyde-3-phosphate dehydrogenase 12p13
Hs.178551 RPL8 2138 24 ribosomal protein L8 8q
Hs.179661 FKBP1A 4800 37 FK506-binding protein 1A (12kD) 20p13
Hs.181165 EEF1A1 24710 67 eukaryotic translation elongation factor 1 alpha 1
Hs.182426 RPS2 9350 24 ribosomal protein S2
Hs.182447 HNRPC 1804 24 heterogeneous nuclear ribonucleoprotein C (C1/C2) 2q32
Hs.184411 ALB 6401 35 albumin 4q11-q13
Hs.195464 FLNA 1333 20 filamin A, alpha (actin-binding protein-280) Xq28
Hs.198281 PKM2 3781 24 pyruvate kinase, muscle 15q22
Hs.252259 RPS3 3919 20 ribosomal protein S3 11q13.3-q13.5
Hs.274348 BAT3 481 20 HLA-B associated transcript 3 6p21.3
Hs.278242 MGC12992 5930 25 hypothetical protein MGC12992 9
Hs.287820 FN1 3430 21 fibronectin 1 2q34
Hs.334822 H19 4384 36 H19, imprinted maternally expressed untranslated mRNA 11p15.5
Mm.331 2700054O04Rik 763 23 RIKEN cDNA 2700054O04 gene
Mm.4063 Ndr1 647 35 N-myc downstream regulated 1
Mm.7156 Gpx3 1932 27 glutathione peroxidase 3 11 B3-B5
Mm.16773 Alb1 3427 90 serum albumin variant 5 50.0 cM
Mm.21983 Bhmt 308 22 betaine-homocysteine methyltransferase
Mm.25789 Csna 2190 22 casein alpha 5 44.9 cM
Mm.197554 Ahsg 795 36 alpha-2-HS-glycoprotein