Abstract
The availability of genome-wide gene expression data provides a unique set of genes from which to decipher the mechanisms underlying the common transcriptional response. This study attempts to mine putative binding site on how combinations of the known regulatory sites and over-represented repetitive elements are distributed in the promoter regions of groups of functionally related genes. The enormous number of associations mined makes it extremely difficult for a human user to identify those useful or interesting ones. Chi-square testing is applied to prune those insignificant associations and obtains a set of significant correlation rules. The over-represented repetitive elements appearing in the associations are possible to be transcription factor binding sites. The association rules mined would facilitate to predict putative regulatory elements and identify genes potentially co-regulated by the putative regulatory elements. Our proposed approach is applied to Saccharomyces cerevisiae and the promoter regions of Yeast ORFs.
Associate Professor
Department of Computer Science and Information Engineering National Central University
¡@