A novel biclustering approach with iterative optimization to analyze gene expression data

Sawannee Sutheeworapong; Motonori Ota; Hiroyuki Ohta; Kengo Kinoshita

doi:10.2147/AABC.S32622

Back to Journals » Advances and Applications in Bioinformatics and Chemistry » Volume 5

Methodology

A novel biclustering approach with iterative optimization to analyze gene expression data

Authors Sutheeworapong S, Motonori O, Hiroyuki O, Kengo K

Received 4 April 2012

Accepted for publication 9 May 2012

Published 7 September 2012 Volume 2012:5 Pages 23—59

DOI https://doi.org/10.2147/AABC.S32622

Review by Single anonymous peer review

Peer reviewer comments 2

Download Article [PDF]

Video abstract presented by Sawannee Sutheeworapong.

Sawannee Sutheeworapong,^1,2 Motonori Ota,⁴ Hiroyuki Ohta,¹ Kengo Kinoshita^2,3
¹Department of Biological Sciences, Graduate School of Biosciences and Biotechnology, Tokyo Institute of Technology, Tokyo, Japan; ²Graduate School of Information Sciences, ³Institute of Development, Aging and Cancer, Tohoku University, Miyagi, Japan; ⁴Graduate School of Information Sciences, Nagoya University, Nagoya, Japan

Objective: With the dramatic increase in microarray data, biclustering has become a promising tool for gene expression analysis. Biclustering has been proven to be superior over clustering in identifying multifunctional genes and searching for co-expressed genes under a few specific conditions; that is, a subgroup of all conditions. Biclustering based on a genetic algorithm (GA) has shown better performance than greedy algorithms, but the overlap state for biclusters must be treated more systematically.
Results: We developed a new biclustering algorithm (binary-iterative genetic algorithm [BIGA]), based on an iterative GA, by introducing a novel, ternary-digit chromosome encoding function. BIGA searches for a set of biclusters by iterative binary divisions that allow the overlap state to be explicitly considered. In addition, the average of the Pearson’s correlation coefficient was employed to measure the relationship of genes within a bicluster, instead of the mean square residual, the popular classical index. As compared to the six existing algorithms, BIGA found highly correlated biclusters, with large gene coverage and reasonable gene overlap. The gene ontology (GO) enrichment showed that most of the biclusters are significant, with at least one GO term over represented.
Conclusion: BIGA is a powerful tool to analyze large amounts of gene expression data, and will facilitate the elucidation of the underlying functional mechanisms in living organisms.

Keywords: biclustering, microarray data, genetic algorithm, Pearson’s correlation coefficient

Creative Commons License © 2012 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.

Download Article [PDF]