Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Johansson, O.
Right arrow Articles by Lagergren, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Johansson, O.
Right arrow Articles by Lagergren, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 19 Suppl. 1 2003
Pages i169-i176
© 2003 Oxford University Press

Identification of functional clusters of transcription factor binding motifs in genome sequences: the MSCAN algorithm

Ö. Johansson 1, W. Alkema 2, W. W. Wasserman 3,* and J. Lagergren 4

1 Department of Computer Science and Engineering, University of California, San Diego, La Jolla, CA 92093-0114, USA
2 Center for Genomics and Bioinformatics, Karolinska Institutet, SE-171 77 Stockholm, Sweden
3 Centre for Molecular Medicine and Therapeutics, Department of Medical Genetics, University of British Columbia, Vancouver, B.C., V5Z 4H4, Canada
4 Stockholm Bioinformatics Center and Department of Numerical Analysis and Computer Science, KTH, SE-100 44 Stockholm, Sweden

Received on January 6, 2003 ; accepted on February 20, 2003

Motivation:The identification of regulatory control regions within genomes is a major challenge. Studies have demonstrated that regulating regions can be described as locally dense clusters or modules of cis-acting transcription factor binding sites (TFBS). For well-described biological contexts, it is possible to train predictive algorithms to discern novel modules in genome sequences. However, utility of module detection methods has been severely limited by insufficient training data. For only a few tissues can one obtain sufficient numbers of literature-derived regulatory modules.

Results: We present a novel method, MSCAN, that circumvents the training data problem by measuring the statistical significance of any non-overlapping combination of TFBS in a window. Given a set of transcription factor binding profiles, a significance threshold, and a genomic sequence, MSCAN returns putative regulatory regions. We assess performance on two curated collections of regulatory regions; one each for tissue-specific expression in liver and skeletal muscle cells. The efficiency of MSCAN allows for predictive screens of entire genomes.

Availability: http://tfscan.cgb.ki.se/cgi-bin/MSCAN

Contact: wyeth{at}cmmt.ubc.ca

Keywords: transcription, gene networks, modules, motif, promoter.

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
J. Hu, H. Hu, and X. Li
MOPAT: a graph-based method to predict recurrent cis-regulatory modules from known motifs
Nucleic Acids Res., August 1, 2008; 36(13): 4488 - 4497.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
W. S.W. Wong and R. Nielsen
Finding cis-regulatory modules in Drosophila using phylogenetic hidden Markov models
Bioinformatics, August 15, 2007; 23(16): 2031 - 2037.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
V. Ferretti, C. Poitras, D. Bergeron, B. Coulombe, F. Robert, and M. Blanchette
PReMod: a database of genome-wide mammalian cis-regulatory module predictions
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D122 - D126.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. GuhaThakurta
Computational identification of transcriptional regulatory elements in DNA sequence
Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
I. J. Donaldson and B. Gottgens
TFBScluster web server for the identification of mammalian composite regulatory elements.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W524 - W528.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
M. Blanchette, A. R. Bataille, X. Chen, C. Poitras, J. Laganiere, C. Lefebvre, G. Deblois, V. Giguere, V. Ferretti, D. Bergeron, et al.
Genome-wide computational prediction of transcriptional regulatory modules reveals new insights into human gene expression
Genome Res., May 1, 2006; 16(5): 656 - 668.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
A. Yu. Mitrophanov and M. Borodovsky
Statistical significance in biological sequence analysis
Brief Bioinform, March 1, 2006; 7(1): 2 - 24.



Home page
BioinformaticsHome page
I. J. Donaldson, M. Chapman, and B. Gottgens
TFBScluster: a resource for the characterization of transcriptional regulatory networks
Bioinformatics, July 1, 2005; 21(13): 3058 - 3059.
[Abstract] [Full Text] [PDF]


Home page
Genome ResHome page
D. GuhaThakurta, L. A. Schriefer, R. H. Waterston, and G. D. Stormo
Novel transcription regulatory elements in Caenorhabditis elegans muscle genes
Genome Res., December 1, 2004; 14(12): 2457 - 2468.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Kreiman
Identification of sparsely distributed clusters of cis-regulatory elements in sets of co-expressed genes
Nucleic Acids Res., May 20, 2004; 32(9): 2889 - 2900.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.