Putative ALternative Splicing Database 
(PALS db)

[Introduction] [Rationale] [History]

[User's Guide] [AS Lists] [Text Info] [All Seq Info] [Hyperlinks] [Related db]

User's Guide

The UPDATED help material of Putative ALternative Splicing Database (PALS db) for Nucleic Acids Res. Database Issue 2002

Contents

  1. Rationale
  2. Audience of PALS db
  3. Steps to use PALS db
  4. Release note
  5. Statistics
  6. Abbreviations
  7. For updates, see User's guide
  8. Disclaimer

Audience of PALS db

This database is designed for biologists, who are interested in

  • Discovering biological phenomenon
  • Solving a biological problem

PALS db is NOT designed to collect splicing site information for gene prediction programs.

Steps to use PALS db

  1. Select search field [next step] [top]

wpe3.jpg (7437 bytes)

  • "Gene description + Gene name": the gene symbol and gene name approved by The Human Genome Organisation (HUGO)
  • "Literature aliases": other gene symbols used in literatures (known as aliases), see HUGO help
  • Ug_id: the UniGene Cluster ID
  • Gb_id: the GenBank accession number
  • Human and Mouse Cytoband: the cytogenetic location of a gene
  1. Enter keyword [next step] [previous step] [top]
  • All the search is case insensitive. 
  • Partial word is also acceptable. 
  • Logic operations "and" and "not" are not available.
  • Logic operation "or", i.e. "|", can ONLY be used in "Gene description + Gene name". For example, "smn|eif4g" will find records that contains "smn", or "eif4g", or "smn, eif4g".
  • Query against Ug_id must be exact, e.g. "hs.2" or "Hs.2" can only retrieve Hs.2, but not Hs.22, Hs.222, Hs.2222, etc.
  • Query against cytogenetic locations
  • Human: use the "n[p|q]n" format (e.g. 1p36, 1q20, etc)
  • Mouse: e.g. "1 40" means position 40 cM on chromosome #1
  1. Set the optional parameters [search results] [previous step] [top]

The default species is human. The search speed will be slowing down by selecting both human and mouse.  In order to perform the AS comparison between human and mouse, users can also use the hyperlinks "homologs" or "similarities" in the graphical output of "AS lists" and "All seq info".

PALS db saves all the predicted AS sites and marked those sites that pass the threshold criteria. The default setting is 95% identity on both ends of alignment at least 50-bp in length (See rationale for details). Hopefully, the users can take advantage of the unmarked information to 

  • to enhance the confidence on prediction 
  • to recover false negatives

[previous step] [top]

Search Results

The results are displayed in an eight-column table (not the six-cloumn table shown below)

  1. Ug_id: The UniGene cluster id will hyperlink to NCBI's UniGene database
    • Green background: UniGene clusters that have at least one predicted AS site at the given threshold in the query interface
    • Gray background: UniGene clusters that don't have any predicted AS site at the given threshold in the query interface
  2. Gene: Will hyperlink to alias list of the given gene name
  3. UniGene member: List the number of all sequences in a given UniGene cluster
  4. AS lists: Will hyperlink to non-redundant AS information in graphics
  5. Text Info: Will hyperlink to text information for those UniGene clusters that pass the filtering criteria
  6. All seq info: Will hyperlink to a graphical display that have ALL the AS sites regardless of their quality
  7. Description: An one-sentence description of a given UniGene cluster
  8. Cytoband: List the cytogenetic location of a given UniGene cluster

¡@

Example

Genes with more than 20 putative ASSPs (Release 2, 2001/08/15)

Ug_id Gene # UniGene member # putative ASSPs Descriptions Cytoband
Hs.2186 EEF1G 6743 33 eukaryotic translation elongation factor 1 gamma 7
Hs.14376 ACTG1 9120 38 actin, gamma 1 17q25
Hs.21346 LOC58481 469 20 hypothetical protein LOC58481 Xq28
Hs.22129 DJ1042K10.2 202 21 hypothetical protein 22q13.1-q13.2
Hs.75990 HP 550 45 haptoglobin
Hs.77385 MYL6 2095 20 myosin, light polypeptide 6, alkali, smooth muscle and non-muscle 12
Hs.78601 UROD 439 23 uroporphyrinogen decarboxylase 1p34
Hs.82208 ACADVL 645 20 acyl-Coenzyme A dehydrogenase, very long chain
Hs.84298 CD74 3475 23 CD74 antigen (invariant polypeptide of major histocompatibility complex, class II antigen-associated) 5q32
Hs.166011 CTNND1 510 23 catenin (cadherin-associated protein), delta 1 11q11
Hs.169476 GAPD 11286 44 glyceraldehyde-3-phosphate dehydrogenase 12p13
Hs.178551 RPL8 2138 24 ribosomal protein L8 8q
Hs.179661 FKBP1A 4800 37 FK506-binding protein 1A (12kD) 20p13
Hs.181165 EEF1A1 24710 67 eukaryotic translation elongation factor 1 alpha 1
Hs.182426 RPS2 9350 24 ribosomal protein S2
Hs.182447 HNRPC 1804 24 heterogeneous nuclear ribonucleoprotein C (C1/C2) 2q32
Hs.184411 ALB 6401 35 albumin 4q11-q13
Hs.195464 FLNA 1333 20 filamin A, alpha (actin-binding protein-280) Xq28
Hs.198281 PKM2 3781 24 pyruvate kinase, muscle 15q22
Hs.252259 RPS3 3919 20 ribosomal protein S3 11q13.3-q13.5
Hs.274348 BAT3 481 20 HLA-B associated transcript 3 6p21.3
Hs.278242 MGC12992 5930 25 hypothetical protein MGC12992 9
Hs.287820 FN1 3430 21 fibronectin 1 2q34
Hs.334822 H19 4384 36 H19, imprinted maternally expressed untranslated mRNA 11p15.5
Mm.331 2700054O04Rik 763 23 RIKEN cDNA 2700054O04 gene
Mm.4063 Ndr1 647 35 N-myc downstream regulated 1
Mm.7156 Gpx3 1932 27 glutathione peroxidase 3 11 B3-B5
Mm.16773 Alb1 3427 90 serum albumin variant 5 50.0 cM
Mm.21983 Bhmt 308 22 betaine-homocysteine methyltransferase
Mm.25789 Csna 2190 22 casein alpha 5 44.9 cM
Mm.197554 Ahsg 795 36 alpha-2-HS-glycoprotein

     

Last updated on 2003-11-01 by Fu, Gloria Chiung-Ling