Bioinformatics Vol. 17 no. 2 2001
Pages 155-161
© 2001 Oxford University Press
Original Paper |
Automated extraction of information on proteinprotein interactions from the biological literature
1 Otsuka GEN Research Institute, Otsuka
Pharmaceutical Co. Ltd, 463-10 Kagasuno, Kawauchi-cho, Tokushima,
771-0192, Japan
2 Human Genome Center, Institute of Medical
Science, The University of Tokyo, 4-6-1 Shirokanedai, Minato-ku,
Tokyo, 108-8639, Japan
Received on September 11, 2000
; revised on September 13, 2000
; accepted on September 28, 2000
Motivation: To understand biological process, we must clarify how proteins interact with each other. However, since information about proteinprotein interactions still exists primarily in the scientific literature, it is not accessible in a computer-readable format. Efficient processing of large amounts of interactions therefore needs an intelligent information extraction method. Our aim is to develop an efficient method for extracting information on proteinprotein interaction from scientific literature.
Results: We present a method for extracting information on
proteinprotein interactions from the scientific literature.
This method, which employs only a protein name dictionary, surface
clues on word patterns and simple part-of-speech rules, achieved
high recall and precision rates for yeast (recall
86.8% and
precision
94.3%) and Escherichia coli (recall
82.5%
and precision
93.5%). The result of extraction suggests that
our method should be applicable to any species for which a protein
name dictionary is constructed.
Availability: The program is available on request from the authors.
Contact: ono{at}otsuka.gr.jp
* To whom correspondence should be addressed.
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
S. Kim, J. Yoon, and J. Yang Kernel approaches for genic interaction extraction Bioinformatics, January 1, 2008; 24(1): 118 - 126. [Abstract] [Full Text] [PDF] |
||||
![]() |
W. Xuan, P. Wang, S. J. Watson, and F. Meng Medline search engine for finding genetic markers with biological significance Bioinformatics, September 15, 2007; 23(18): 2477 - 2484. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. Fundel, R. Kuffner, and R. Zimmer RelEx--Relation extraction using dependency parse trees Bioinformatics, February 1, 2007; 23(3): 365 - 371. [Abstract] [Full Text] [PDF] |
||||
![]() |
Y. Kajikawa, K. Abe, and S. Noda Filling the gap between researchers studying different materials and different methods: a proposal for structured keywords Journal of Information Science, December 1, 2006; 32(6): 511 - 524. [Abstract] [PDF] |
||||
![]() |
T. MITSUMORI, M. MURATA, Y. FUKUDA, K. DOI, and H. DOI Extracting Protein-Protein Interaction Information from Biomedical Text with SVM IEICE Trans D: Information, August 1, 2006; E89-D(8): 2464 - 2466. [Abstract] [PDF] |
||||
![]() |
Y. Hao, X. Zhu, M. Huang, and M. Li Discovering patterns to extract protein-protein interactions from the literature: Part II Bioinformatics, August 1, 2005; 21(15): 3294 - 3300. [Abstract] [Full Text] [PDF] |
||||
![]() |
R. Hoffmann, M. Krallinger, E. Andres, J. Tamames, C. Blaschke, and A. Valencia Text Mining for Metabolic Pathways, Signaling Cascades, and Protein Networks Sci. Signal., May 10, 2005; 2005(283): pe21 - pe21. [Abstract] [Full Text] [PDF] |
||||
![]() |
O. Hofmann and D. Schomburg Concept-based annotation of enzyme classes Bioinformatics, May 1, 2005; 21(9): 2059 - 2066. [Abstract] [Full Text] [PDF] |
||||
![]() |
H. Pan, L. Zuo, V. Choudhary, Z. Zhang, S. H. Leow, F. T. Chong, Y. Huang, V. W. S. Ong, B. Mohanty, S. L. Tan, et al. Dragon TF Association Miner: a system for exploring transcription factor associations through text-mining Nucleic Acids Res., July 1, 2004; 32(suppl_2): W230 - W234. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. F. SCHAEFER Pathway Databases Ann. N.Y. Acad. Sci., May 1, 2004; 1020(1): 77 - 91. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Koike, Y. Kobayashi, and T. Takagi Kinase Pathway Database: An Integrated Protein-Kinase and NLP-Based Protein-Interaction Resource Genome Res., June 1, 2003; 13(6): 1231 - 1243. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Nagashima, D. G. Silva, N. Petrovsky, L. A. Socha, H. Suzuki, R. Saito, T. Kasukawa, I. V. Kurochkin, A. Konagaya, and C. Schonbach Inferring Higher Functional Information for RIKEN Mouse Full-Length cDNA Clones With FACTS Genome Res., June 1, 2003; 13(6): 1520 - 1533. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. J. Rigden, P. Setlow, B. Setlow, I. Bagyan, R. A. Stein, and M. J. Jedrzejas PrfA protein of Bacillus species: Prediction and demonstration of endonuclease activity on DNA Protein Sci., October 1, 2002; 11(10): 2370 - 2381. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Greenbaum, N. M. Luscombe, R. Jansen, J. Qian, and M. Gerstein Interrelating Different Types of Genomic Data, from Proteome to Secretome: 'Oming in on Function Genome Res., September 1, 2001; 11(9): 1463 - 1468. [Abstract] [Full Text] [PDF] |
||||







