Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Krauthammer, M.
Right arrow Articles by Rzhetsky, A.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Krauthammer, M.
Right arrow Articles by Rzhetsky, A.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 18 no. 90001 2002
Pages S249-S257
© 2002 Oxford University Press

Of truth and pathways: chasing bits of information through myriads of articles

Michael Krauthammer 1, Pauline Kra 1,2, Ivan Iossifov 1,2, Shawn M. Gomez 2, George Hripcsak 1, Vasileios Hatzivassiloglou 4, Carol Friedman 1,3 and Andrey Rzhetsky 1,2

1 Department of Medical Informatics, Columbia University, New York, NY 10032, USA
2 Columbia Genome Center, Columbia University, New York, NY 10032, USA
3 Department of Computer Science, Queens College CUNY, Flushing, NY 11367, USA
4 Department of Computer Science, Columbia University, New York, NY 10027, USA

Received on January 24, 2002 ; revised on April 1, 2002 ; accepted on April 1, 2002

Motivation: Knowledge on interactions between molecules in living cells is indispensable for theoretical analysis and practical applications in modern genomics and molecular biology. Building such networks relies on the assumption that the correct molecular interactions are known or can be identified by reading a few research articles. However, this assumption does not necessarly hold, as truth is rather an emerging property based on many potentially conflicting facts. This paper explores the processes of knowledge generation and publishing in the molecular biology literature using modelling and analysis of real molecular interaction data. The data analysed in this article were automatically extracted from 50000 research articles in molecular biology using a computer system called GeneWays containing a natural language processing module. The paper indicates that truthfulness of statements is associated in the minds of scientists with the relative importance (connectedness) of substances under study, revealing a potential selection bias in the reporting of research results. Aiming at understanding the statistical properties of the life cycle of biological facts reported in research articles, we formulate a stochastic model describing generation and propagation of knowledge about molecular interactions through scientific publications. We hope that in the future such a model can be useful for automatically producing consensus views of molecular interaction data.

Contact: ar345{at}columbia.edu

Keywords: statistical modelling; scientometric analysis; molecular interaction data; natural language processing


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
BioinformaticsHome page
H. Shatkay, F. Pan, A. Rzhetsky, and W. J. Wilbur
Multi-dimensional classification of biomedical text: Toward automated, practical provision of high-utility text to diverse users
Bioinformatics, September 15, 2008; 24(18): 2086 - 2093.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
E. R. Jefferson, T. P. Walsh, T. J. Roberts, and G. J. Barton
SNAPPI-DB: a database and API of Structures, iNterfaces and Alignments for Protein-Protein Interactions
Nucleic Acids Res., January 12, 2007; 35(suppl_1): D580 - D589.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.