Bioinformatics Vol. 17 no. 90001 2001
Pages S30-S38
© 2001 Oxford University Press
Separating real motifs from their artifacts

Department of Computer Science and Engineering, University of Washington, Box 352350, Seattle, WA 98195-2350, U.S.A.
Received on February 5, 2001
; revised on April 2, 2001
; accepted on April 2, 2001
The typical output of many computational methods to identify binding sites is a long list of motifs containing some real motifs (those most likely to correspond to the actual binding sites) along with a large number of random variations of these. We present a statistical method to separate real motifs from their artifacts. This produces a short list of high quality motifs that is sufficient to explain the over-representation of all motifs in the given sequences. Using synthetic data sets, we show that the output of our method is very accurate. On various sets of upstream sequences in S. cerevisiae , our program identifies several known binding sites, as well as a number of significant novel motifs.
Contact: blanchem{at}cs.washington.edu; saurabh{at}cs.washington.edu
The two authors contributed equally to the
paper.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
A Tittarelli, L Milla, F Vargas, A Morales, C Neupert, L. Meisel, H Salvo-G, E Penaloza, G Munoz, L. Corcuera, et al. Isolation and comparative analysis of the wheat TaPT2 promoter: identification in silico of new putative regulatory motifs conserved between monocots and dicots J. Exp. Bot., July 1, 2007; 58(10): 2573 - 2582. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Elnitski, V. X. Jin, P. J. Farnham, and S. J.M. Jones Locating mammalian transcription factor binding sites: A survey of computational and experimental techniques Genome Res., December 1, 2006; 16(12): 1455 - 1464. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sinha and M. Tompa YMF: a program for discovery of novel transcription factor binding sites by statistical overrepresentation Nucleic Acids Res., July 1, 2003; 31(13): 3586 - 3588. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Sinha and M. Tompa Discovery of novel transcription factor binding sites by statistical overrepresentation Nucleic Acids Res., December 15, 2002; 30(24): 5549 - 5560. [Abstract] [Full Text] [PDF] |
||||


