Back to Journals » Advances and Applications in Bioinformatics and Chemistry » Volume 4

FDR-FET: an optimizing gene set enrichment analysis method

Authors Ji R, Ott K, Yordanova R, Bruccoleri

Published 15 March 2011 Volume 2011:4 Pages 37—42

DOI https://doi.org/10.2147/AABC.S15840

Review by Single anonymous peer review

Peer reviewer comments 2



Rui-Ru Ji1, Karl-Heinz Ott1, Roumyana Yordanova1, Robert E Bruccoleri2
1Applied Genomics, Research and Development, Bristol-Myers Squibb, Pennington, NJ, USA; 2Congenomics, Glastonbury, CT, USA

Abstract: Gene set enrichment analysis for analyzing large profiling and screening experiments can reveal unifying biological schemes based on previously accumulated knowledge represented as “gene sets”. Most of the existing implementations use a fixed fold-change or P value cutoff to generate regulated gene lists. However, the threshold selection in most cases is arbitrary, and has a significant effect on the test outcome and interpretation of the experiment. We developed a new gene set enrichment analysis method, ie, FDR-FET, which dynamically optimizes the threshold choice and improves the sensitivity and selectivity of gene set enrichment analysis. The procedure translates experimental results into a series of regulated gene lists at multiple false discovery rate (FDR) cutoffs, and computes the P value of the overrepresentation of a gene set using a Fisher’s exact test (FET) in each of these gene lists. The lowest P value is retained to represent the significance of the gene set. We also implemented improved methods to define a more relevant global reference set for the FET. We demonstrate the validity of the method using a published microarray study of three protease inhibitors of the human immunodeficiency virus and compare the results with those from other popular gene set enrichment analysis algorithms. Our results show that combining FDR with multiple cutoffs allows us to control the error while retaining genes that increase information content. We conclude that FDR-FET can selectively identify significant affected biological processes. Our method can be used for any user-generated gene list in the area of transcriptome, proteome, and other biological and scientific applications.

Keywords: gene set enrichment analysis, false discovery rate, Fisher’s exact test, microarray profiling, protease inhibitors

Creative Commons License © 2011 The Author(s). This work is published and licensed by Dove Medical Press Limited. The full terms of this license are available at https://www.dovepress.com/terms.php and incorporate the Creative Commons Attribution - Non Commercial (unported, v3.0) License. By accessing the work you hereby accept the Terms. Non-commercial uses of the work are permitted without any further permission from Dove Medical Press Limited, provided the work is properly attributed. For permission for commercial use of this work, please see paragraphs 4.2 and 5 of our Terms.