Bioinformatics Vol. 19 Suppl. 1 2003
Pages i169-i176
© 2003 Oxford University Press
Identification of functional clusters of transcription factor binding motifs in genome sequences: the MSCAN algorithm
1 Department of Computer Science and
Engineering, University of California, San Diego,
La Jolla, CA 92093-0114, USA
2 Center for Genomics and Bioinformatics,
Karolinska Institutet, SE-171 77 Stockholm, Sweden
3 Centre for Molecular Medicine and
Therapeutics, Department of Medical Genetics,
University of British Columbia, Vancouver, B.C., V5Z 4H4, Canada
4 Stockholm Bioinformatics Center and
Department of Numerical Analysis and Computer Science,
KTH, SE-100 44 Stockholm, Sweden
Received on January 6, 2003
; accepted on February 20, 2003
Motivation:The identification of regulatory control regions within genomes is a major challenge. Studies have demonstrated that regulating regions can be described as locally dense clusters or modules of cis-acting transcription factor binding sites (TFBS). For well-described biological contexts, it is possible to train predictive algorithms to discern novel modules in genome sequences. However, utility of module detection methods has been severely limited by insufficient training data. For only a few tissues can one obtain sufficient numbers of literature-derived regulatory modules.
Results: We present a novel method, MSCAN, that circumvents the training data problem by measuring the statistical significance of any non-overlapping combination of TFBS in a window. Given a set of transcription factor binding profiles, a significance threshold, and a genomic sequence, MSCAN returns putative regulatory regions. We assess performance on two curated collections of regulatory regions; one each for tissue-specific expression in liver and skeletal muscle cells. The efficiency of MSCAN allows for predictive screens of entire genomes.
Availability: http://tfscan.cgb.ki.se/cgi-bin/MSCAN
Contact: wyeth{at}cmmt.ubc.ca
Keywords: transcription, gene networks, modules, motif, promoter.
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
J. Hu, H. Hu, and X. Li MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs Nucleic Acids Res., August 1, 2008; 36(13): 4488 - 4497. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. S.W. Wong and R. Nielsen Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models Bioinformatics, August 15, 2007; 23(16): 2031 - 2037. [Abstract] [Full Text] [PDF] |
||||
![]() |
V. Ferretti, C. Poitras, D. Bergeron, B. Coulombe, F. Robert, and M. Blanchette PReMod: a database of genome-wide mammalian cis-regulatory module predictions Nucleic Acids Res., January 12, 2007; 35(suppl_1): D122 - D126. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta Computational identification of transcriptional regulatory elements in DNA sequence Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. J. Donaldson and B. Gottgens TFBScluster web server for the identification of mammalian composite regulatory elements. Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W524 - W528. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. Blanchette, A. R. Bataille, X. Chen, C. Poitras, J. Laganiere, C. Lefebvre, G. Deblois, V. Giguere, V. Ferretti, D. Bergeron, et al. Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression Genome Res., May 1, 2006; 16(5): 656 - 668. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Yu. Mitrophanov and M. Borodovsky Statistical significance in biological sequence analysis Brief Bioinform, March 1, 2006; 7(1): 2 - 24. |
||||
![]() |
I. J. Donaldson, M. Chapman, and B. Gottgens TFBScluster: a resource for the characterization of transcriptional regulatory networks Bioinformatics, July 1, 2005; 21(13): 3058 - 3059. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. GuhaThakurta, L. A. Schriefer, R. H. Waterston, and G. D. Stormo Novel transcription regulatory elements in Caenorhabditis elegans muscle genes Genome Res., December 1, 2004; 14(12): 2457 - 2468. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Kreiman Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes Nucleic Acids Res., May 20, 2004; 32(9): 2889 - 2900. [Abstract] [Full Text] [PDF] |
||||



