IdentiCS — Identification of coding sequence and in silico reconstruction of the metabolic network directly from unannotated low-coverage bacterial genome sequence

10/19/2004

Yi-Feng Chang (張益峰)

 

Abstract

Studying metabolic network of an organism is able to understanding its physiology and phenotypic behavior. The functionality of the potential metabolic network of a given organism can also be experimentally studied by system perturbations at physiological and genetic levels.

Usually, metabolic network reconstruction of a new organism required its well-annotated genome sequence. However, the process of annotating a new sequenced genome always time consuming. Therefore, in this work, authors proposed a different but simple approach to reduce the requirement of genome coverage.

First remove the gene finding procedure from conventional metabolic network reconstruction process. Then all of the available public annotated genome sequences are becoming the query sequences; and the low genome coverage sequence is becoming a target database. The reversed step of above mention, successfully reduce the genome coverage requirement from 8 fold to 3.9 fold and also improved the coding sequences recognition rate.

 

From: BMC Bioinformatics Vol. 5, 2004.

 

References

 

Kell, D.B. (2004). Metabolomics and systems biology: making sense of the soup. Current Opinion in Microbiology 7, 296-307.

Mendes. P. (2002). Emerging bioinformatics for the metabolome. Briefings in Bioinformatics 3, 134-145.

Ahren, D.G. and Ouzounis C.A. (2004). Robustness of metabolic map reconstruction. Journal of Bioinformatics and Computational Biology 2, 589-593.