Bioinformatics Vol. 17 no. 90001 2001
Pages S22-S29
© 2001 Oxford University Press
Fast optimal leaf ordering for hierarchical clustering
1 Laboratory for Computer Science, MIT, 545
Technology Square, Cambridge, MA, 02139, USA
2 Artificial Intelligence Laboratory, MIT,
545 Technology Square, Cambridge, MA, 02139, USA
Received on February 5, 2001
; revised on March 26, 2001
; accepted on March 26, 2001
We present the first practical algorithm for the optimal linear leaf ordering of trees that are generated by hierarchical clustering. Hierarchical clustering has been extensively used to analyze gene expression data, and we show how optimal leaf ordering can reveal biological structure that is not observed with an existing heuristic ordering method. For a tree with n leaves, there are 2n-1 linear orderings consistent with the structure of the tree. Our optimal leaf ordering algorithm runs in time O(n4), and we present further improvements that make the running time of our algorithm practical.
Contact: {zivbj,gifford}@mit.edu; tommi{at}ai.mit.edu
![]()
CiteULike
Connotea
Del.icio.us What's this?
This article has been cited by other articles:
![]() |
Q. K. Beg, A. Vazquez, J. Ernst, M. A. de Menezes, Z. Bar-Joseph, A.-L. Barabasi, and Z. N. Oltvai Intracellular crowding defines the mode and sequence of substrate uptake by Escherichia coli and constrains its metabolic activity PNAS, July 31, 2007; 104(31): 12663 - 12668. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Grotkjaer, O. Winther, B. Regenberg, J. Nielsen, and L. K. Hansen Robust multi-scale clustering of large DNA microarray datasets with the consensus algorithm Bioinformatics, January 1, 2006; 22(1): 58 - 67. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. Tsafrir, I. Tsafrir, L. Ein-Dor, O. Zuk, D.A. Notterman, and E. Domany Sorting points into neighborhoods (SPIN): data analysis and visualization by ordering distance matrices Bioinformatics, May 15, 2005; 21(10): 2301 - 2308. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Caraux and S. Pinloche PermutMatrix: a graphical environment to arrange gene expression profiles in optimal linear order Bioinformatics, April 1, 2005; 21(7): 1280 - 1281. [Abstract] [Full Text] [PDF] |
||||

