Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Steffen, N.R.
Right arrow Articles by Lathrop, R.H.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Steffen, N.R.
Right arrow Articles by Lathrop, R.H.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 18 no. 90001 2002
Pages S22-S30
© 2002 Oxford University Press

DNA sequence and structure: direct and indirect recognition in protein-DNA binding

N.R. Steffen 1, S.D. Murphy 1, L. Tolleri 2, G.W. Hatfield 3 and R.H. Lathrop 1

1 Information and Computer Science, University of California, Irvine, Irvine, CA, 92697-3425, USA
2 Chiron S.p.A., Via Fiorentina, 1, Siena, 53100, Italy
3 Molecular Biology, Genetics and Biochemistry, University of California, Irvine, Irvine, CA, 92697-4025, USA

Motivation: Direct recognition, or direct readout, of DNA bases by a DNA-binding protein involves amino acids that interact directly with features specific to each base. Experimental evidence also shows that in many cases the protein achieves partial sequence specificity by indirect recognition, i.e., by recognizing structural properties of the DNA. (1) Could threading a DNA sequence onto a crystal structure of bound DNA help explain the indirect recognition component of sequence specificity? (2) Might the resulting pure-structure computational motif manifest itself in familiar sequence-based computational motifs?

Results: The starting structure motif was a crystal structure of DNA bound to the integration host factor protein (IHF) of E. coli. IHF is known to exhibit both direct and indirect recognition of its binding sites. (1) Threading DNA sequences onto the crystal structure showed statistically significant partial separation of 60 IHF binding sites from random and intragenic sequences and was positively correlated with binding affinity. (2) The crystal structure was shown to be equivalent to a linear Markov network, and so, to a joint probability distribution over sequences, computable in linear time. It was transformed algorithmically into several common pure-sequence representations, including (a) small sets of short exact strings, (b) weight matrices, (c) consensus regular patterns, (d) multiple sequence alignments, and (e) phylogenetic trees. In all cases the pure-sequence motifs retained statistically significant partial separation of the IHF binding sites from random and intragenic sequences. Most exhibited positive correlation with binding affinity. The multiple alignment showed some conserved columns, and the phylogenetic tree partially mixed low-energy sequences with IHF binding sites but separated high-energy sequences. The conclusion is that deformation energy explains part of indirect recognition, which explains part of IHF sequence-specific binding.

Availability: Code and data on request.

Contact:Nick Steffen for code and Lorenzo Tolleri for data. nsteffen{at}uci.edu Lorenzo_Tolleri{at}chiron.it

Keywords: protein-DNA binding sites; sequence motifs or patterns; indirect recognition or readout; integration host factor; IHF.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
N. B. Becker, L. Wolff, and R. Everaers
Indirect readout: detection of optimized subsequences and calculation of relative binding affinities using different DNA elastic potentials
Nucleic Acids Res., November 14, 2006; 34(19): 5638 - 5649.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. GuhaThakurta
Computational identification of transcriptional regulatory elements in DNA sequence
Nucleic Acids Res., July 19, 2006; 34(12): 3585 - 3598.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
K. Florquin, Y. Saeys, S. Degroeve, P. Rouze, and Y. Van de Peer
Large-scale structural analysis of the core promoter in mammalian and plant genomes
Nucleic Acids Res., July 27, 2005; 33(13): 4255 - 4264.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
R. Alazard, M. Blaud, S. Elbaz, C. Vossen, G. Icre, G. Joseph, L. Nieto, and M. Erard
Identification of the 'NORE' (N-Oct-3 responsive element), a novel structural motif and composite element
Nucleic Acids Res., March 14, 2005; 33(5): 1513 - 1523.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
G. Paillard, C. Deremble, and R. Lavery
Looking into DNA recognition: zinc finger binding specificity
Nucleic Acids Res., December 21, 2004; 32(22): 6673 - 6682.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.