Skip Navigation

This Article
Right arrow FREE Full Text (Print PDF) Freely available
Right arrow FREE Full Text (Screen PDF)
Right arrow Alert me when this article is cited
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Add to My Personal Archive
Right arrow Download to citation manager
Right arrow Search for citing articles in:
ISI Web of Science (153)
Right arrowRequest Permissions
Google Scholar
Right arrow Articles by Herrero, J.
Right arrow Articles by Dopazo, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Herrero, J.
Right arrow Articles by Dopazo, J.
Social Bookmarking
 Add to CiteULike   Add to Connotea   Add to Del.icio.us  
What's this?

Bioinformatics Vol. 17 no. 2 2001
Pages 126-136
© 2001 Oxford University Press


Original Paper

A hierarchical unsupervised growing neural network for clustering gene expression patterns

Javier Herrero 1, Alfonso Valencia 2 and Joaquín Dopazo 1,*

1 Bioinformatics, CNIO, Ctra. Majadahonda-Pozuelo, Km 2, Majadahonda, 28220 Madrid
2 Protein Design Group CNB-CSIC, 28049 Madrid, Spain

Received on August 6, 2000 ; revised on September 29, 2000 ; accepted on October 6, 2000

Motivation: We describe a new approach to the analysis of gene expression data coming from DNA array experiments, using an unsupervised neural network. DNA array technologies allow monitoring thousands of genes rapidly and efficiently. One of the interests of these studies is the search for correlated gene expression patterns, and this is usually achieved by clustering them. The Self-Organising Tree Algorithm, (SOTA) (Dopazo,J. and Carazo,J.M. (1997) J. Mol. Evol. , 44, 226–233), is a neural network that grows adopting the topology of a binary tree. The result of the algorithm is a hierarchical cluster obtained with the accuracy and robustness of a neural network.

Results: SOTA clustering confers several advantages over classical hierarchical clustering methods. SOTA is a divisive method: the clustering process is performed from top to bottom, i.e. the highest hierarchical levels are resolved before going to the details of the lowest levels. The growing can be stopped at the desired hierarchical level. Moreover, a criterion to stop the growing of the tree, based on the approximate distribution of probability obtained by randomisation of the original data set, is provided. By means of this criterion, a statistical support for the definition of clusters is proposed. In addition, obtaining average gene expression patterns is a built-in feature of the algorithm. Different neurons defining the different hierarchical levels represent the averages of the gene expression patterns contained in the clusters.

Since SOTA runtimes are approximately linear with the number of items to be classified, it is especially suitable for dealing with huge amounts of data. The method proposed is very general and applies to any data providing that they can be coded as a series of numbers and that a computable measure of similarity between data items can be used.

Availability: A server running the program can be found at: http://bioinfo.cnio.es/sotarray

Contact: jdopazo{at}cnio.es

* To whom correspondence should be addressed.


Add to CiteULike CiteULike   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us    What's this?


This article has been cited by other articles:


Home page
Nucleic Acids ResHome page
J. Tarraga, I. Medina, J. Carbonell, J. Huerta-Cepas, P. Minguez, E. Alloza, F. Al-Shahrour, S. Vegas-Azcarate, S. Goetz, P. Escobar, et al.
GEPAS, a web-based tool for microarray data analysis and interpretation
Nucleic Acids Res., July 1, 2008; 36(suppl_2): W308 - W314.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
L. Brehelin, O. Gascuel, and O. Martin
Using repeated measurements to validate hierarchical gene clusters
Bioinformatics, March 1, 2008; 24(5): 682 - 688.
[Abstract] [Full Text] [PDF]


Home page
J Exp BotHome page
S. Moco, E. Capanoglu, Y. Tikunov, R. J. Bino, D. Boyacioglu, R. D. Hall, J. Vervoort, and R. C. H. De Vos
Tissue specialization at the metabolite level is perceived during the development of tomato fruit
J. Exp. Bot., December 7, 2007; (2007) erm271v1.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
J. Madoz-Gurpide, M. Canamero, L. Sanchez, J. Solano, P. Alfonso, and J. I. Casal
A Proteomics Analysis of Cell Signaling Alterations in Colorectal Cancer
Mol. Cell. Proteomics, December 1, 2007; 6(12): 2150 - 2164.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Kerhornou and R. Guigo
BioMoby web services to support clustering of co-regulated genes based on similarity of promoter configurations
Bioinformatics, July 15, 2007; 23(14): 1831 - 1833.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
M. J. Nueda, A. Conesa, J. A. Westerhuis, H. C. J. Hoefsloot, A. K. Smilde, M. Talon, and A. Ferrer
Discovering gene expression patterns in time course microarray experiments by ANOVA SCA
Bioinformatics, July 15, 2007; 23(14): 1792 - 1800.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
V. Pihur, S. Datta, and S. Datta
Weighted rank aggregation of cluster validation measures: a Monte Carlo cross-entropy approach
Bioinformatics, July 1, 2007; 23(13): 1607 - 1615.
[Abstract] [Full Text] [PDF]


Home page
Appl. Environ. Microbiol.Home page
A. Mendes-Ferreira, M. del Olmo, J. Garcia-Martinez, E. Jimenez-Marti, A. Mendes-Faia, J. E. Perez-Ortin, and C. Leao
Transcriptional Response of Saccharomyces cerevisiae to Different Nitrogen Concentrations during Alcoholic Fermentation
Appl. Envir. Microbiol., May 1, 2007; 73(9): 3049 - 3060.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
D. S. V. Wong, F. K. Wong, and G. R. Wood
A multi-stage approach to clustering and imputation of gene expression profiles
Bioinformatics, April 15, 2007; 23(8): 998 - 1005.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
A. Sanchez-Aguilera, C. Montalban, P. de la Cueva, L. Sanchez-Verde, M. M. Morente, M. Garcia-Cosio, J. Garcia-Larana, C. Bellas, M. Provencio, V. Romagosa, et al.
Tumor microenvironment and mitotic checkpoint are key factors in the outcome of classic Hodgkin lymphoma
Blood, July 15, 2006; 108(2): 662 - 668.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
D. Montaner, J. Tarraga, J. Huerta-Cepas, J. Burguet, J. M. Vaquerizas, L. Conde, P. Minguez, J. Vera, S. Mukherjee, J. Valls, et al.
Next station in microarray data analysis: GEPAS.
Nucleic Acids Res., July 1, 2006; 34(Web Server issue): W486 - W491.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
A. Conesa, M. J. Nueda, A. Ferrer, and M. Talon
maSigPro: a method to identify significantly differential expression profiles in time-course microarray experiments
Bioinformatics, May 1, 2006; 22(9): 1096 - 1102.
[Abstract] [Full Text] [PDF]


Home page
J. Bacteriol.Home page
C. A. Cummings, H. J. Bootsma, D. A. Relman, and J. F. Miller
Species- and Strain-Specific Control of a Complex, Flexible Regulon by Bordetella BvgAS.
J. Bacteriol., March 1, 2006; 188(5): 1775 - 1785.
[Abstract] [Full Text] [PDF]


Home page
Brief BioinformHome page
P. Larranaga, B. Calvo, R. Santana, C. Bielza, J. Galdiano, I. Inza, J. A. Lozano, R. Armananzas, G. Santafe, A. Perez, et al.
Machine learning in bioinformatics
Brief Bioinform, March 1, 2006; 7(1): 86 - 112.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Lopez-Bigas, B. J. Blencowe, and C. A. Ouzounis
Highly consistent patterns for inherited human diseases at the molecular level
Bioinformatics, February 1, 2006; 22(3): 269 - 277.
[Abstract] [Full Text] [PDF]


Home page
Plant CellHome page
R. Alba, P. Payton, Z. Fei, R. McQuinn, P. Debbie, G. B. Martin, S. D. Tanksley, and J. J. Giovannoni
Transcriptome and Selected Metabolite Analyses Reveal Multiple Points of Ethylene Control during Tomato Fruit Development
PLANT CELL, November 1, 2005; 17(11): 2954 - 2965.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
E. P. Romijn, C. Christis, M. Wieffer, J. W. Gouw, A. Fullaondo, P. van der Sluijs, I. Braakman, and A. J. R. Heck
Expression Clustering Reveals Detailed Co-expression Patterns of Functionally Related Proteins during B Cell Differentiation: A Proteomic Study Using a Combination of One-Dimensional Gel Electrophoresis, LC-MS/MS, and Stable Isotope Labeling by Amino Acids in Cell Culture (SILAC)
Mol. Cell. Proteomics, September 1, 2005; 4(9): 1297 - 1310.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
J. Handl, J. Knowles, and D. B. Kell
Computational cluster validation in post-genomic data analysis
Bioinformatics, August 1, 2005; 21(15): 3201 - 3212.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. M. Vaquerizas, L. Conde, P. Yankilevich, A. Cabezon, P. Minguez, R. Diaz-Uriarte, F. Al-Shahrour, J. Herrero, and J. Dopazo
GEPAS, an experiment-oriented pipeline for the analysis of microarray gene expression data
Nucleic Acids Res., July 1, 2005; 33(suppl_2): W616 - W620.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
N. Martinez, M. Sanchez-Beato, A. Carnero, V. Moneo, J. C. Tercero, I. Fernandez, M. Navarrete, J. Jimeno, and M. A. Piris
Transcriptional signature of Ecteinascidin 743 (Yondelis, Trabectedin) in human sarcoma cells explanted from chemo-naive patients
Mol. Cancer Ther., May 1, 2005; 4(5): 814 - 823.
[Abstract] [Full Text] [PDF]


Home page
Molecular Cancer TherapeuticsHome page
P. J. Wild, R. C. Krieg, J. Seidl, R. Stoehr, K. Reher, C. Hofmann, J. Louhelainen, A. Rosenthal, A. Hartmann, C. Pilarsky, et al.
RNA expression profiling of normal and tumor cells following photodynamic therapy with 5-aminolevulinic acid-induced protoporphyrin IX in vitro
Mol. Cancer Ther., April 1, 2005; 4(4): 516 - 528.
[Abstract] [Full Text] [PDF]


Home page
BioinformaticsHome page
N. Bolshakova, F. Azuaje, and Pád. Cunningham
An integrated tool for microarray data clustering and cluster validity assessment
Bioinformatics, February 15, 2005; 21(4): 451 - 455.
[Abstract] [Full Text] [PDF]


Home page
J. Clin. Endocrinol. Metab.Home page
P. Hanifi-Moghaddam, S. C. J. P. Gielen, H. J. Kloosterboer, M. E. De Gooyer, A. M. Sijbers, A. J. van Gool, M. Smid, M. Moorhouse, F. H. van Wijk, C. W. Burger, et al.
Molecular Portrait of the Progestagenic and Estrogenic Actions of Tibolone: Behavior of Cellular Networks in Response to Tibolone
J. Clin. Endocrinol. Metab., February 1, 2005; 90(2): 973 - 983.
[Abstract] [Full Text] [PDF]


Home page
Clin. Cancer Res.Home page
B. Martinez-Delgado, B. Melendez, M. Cuadros, J. Alvarez, J. M. Castrillo, A. Ruiz de la Parte, M. Mollejo, C. Bellas, R. Diaz, L. Lombardia, et al.
Expression Profiling of T-Cell Lymphomas Differentiates Peripheral and Lymphoblastic Lymphomas and Defines Survival Related Genes
Clin. Cancer Res., August 1, 2004; 10(15): 4971 - 4982.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
C. I. Castillo-Davis, F. A. Kondrashov, D. L. Hartl, and R. J. Kulathinal
The Functional Genomic Distribution of Protein Divergence in Two Animal Phyla: Coevolution, Genomic Conflict, and Constraint
Genome Res., May 1, 2004; 14(5): 802 - 811.
[Abstract] [Full Text] [PDF]


Home page
BloodHome page
L. Tracey, R. Villuendas, A. M. Dotor, I. Spiteri, P. Ortiz, J. F. Garcia, J. L. R. Peralto, M. Lawler, and M. A. Piris
Mycosis fungoides shows concurrent deregulation of multiple genes involved in the TNF signaling pathway: an expression profile study
Blood, August 1, 2003; 102(3): 1042 - 1050.
[Abstract] [Full Text] [PDF]


Home page
Nucleic Acids ResHome page
J. Herrero, F. Al-Shahrour, R. Diaz-Uriarte, A. Mateos, J. M. Vaquerizas, J. Santoyo, and J. Dopazo
GEPAS: a web-based resource for microarray gene expression data analysis
Nucleic Acids Res., July 1, 2003; 31(13): 3461 - 3467.
[Abstract] [Full Text] [PDF]


Home page
Physiol. GenomicsHome page
H. Ressom, D. Wang, and P. Natarajan
Clustering gene expression data using adaptive double self-organizing map
Physiol Genomics, June 24, 2003; 14(1): 35 - 46.
[Abstract] [Full Text] [PDF]


Home page
Plant Physiol.Home page
D. Honys and D. Twell
Comparative Analysis of the Arabidopsis Pollen Transcriptome
Plant Physiology, June 1, 2003; 132(2): 640 - 652.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
M. J. Martin, J. Herrero, A. Mateos, and J. Dopazo
Comparing Bacterial Genomes Through Conservation Profiles
Genome Res., May 1, 2003; 13(5): 991 - 998.
[Abstract] [Full Text] [PDF]


Home page
FASEB J.Home page
O. TURECI, J. DING, H. HILTON, H. BIAN, H. OHKAWA, M. BRAXENTHALER, G. SEITZ, L. RADDRIZZANI, H. FRIESS, M. BUCHLER, et al.
Computational dissection of tissue contamination for identification of colon cancer-specific expression profiles
FASEB J, March 1, 2003; 17(3): 376 - 385.
[Abstract] [Full Text] [PDF]


Home page
Ann. N. Y. Acad. Sci.Home page
F. VALAFAR
Pattern Recognition Techniques in Microarray Data Analysis: A Survey
Ann. N.Y. Acad. Sci., December 1, 2002; 980(1): 41 - 64.
[Abstract] [Full Text] [PDF]


Home page
Genome Res.Home page
A. Mateos, J. Dopazo, R. Jansen, Y. Tu, M. Gerstein, and G. Stolovitzky
Systematic Learning of Gene Functional Classes From DNA Array Expression Data by Using Multilayer Perceptrons
Genome Res., November 1, 2002; 12(11): 1703 - 1715.
[Abstract] [Full Text] [PDF]



Disclaimer:
Please note that abstracts for content published before 1996 were created through digital scanning and may therefore not exactly replicate the text of the original print issues. All efforts have been made to ensure accuracy, but the Publisher will not be held responsible for any remaining inaccuracies. If you require any further clarification, please contact our Customer Services Department.