skip to content
Dovepress - Open Access to Scientific and Medical Research
View our mobile site

8852

Performance of PLS regression coefficients in selecting variables for each response of a multivariate PLS for omics-type data

Original Research

(3972) Views  (1025) Full article downloads

Authors: Giuseppe Palermo, Paolo Piraino, Hans-Dieter Zucht

Published Date May 2009 Volume 2009:2 Pages 57 - 70
DOI: http://dx.doi.org/10.2147/AABC.S3619

Giuseppe Palermo1, Paolo Piraino2, Hans-Dieter Zucht3

1Digilab BioVision GmbH, Hannover, Germany; 2Dr Paolo Piraino Statistical Consulting, Rende (CS), Italy; 3Proteome Sciences R&D GmbH and C. KG, Frankfurt am Main, Germany

Abstract: Multivariate partial least square (PLS) regression allows the modeling of complex biological events, by considering different factors at the same time. It is unaffected by data collinearity, representing a valuable method for modeling high-dimensional biological data (as derived from genomics, proteomics and peptidomics). In presence of multiple responses, it is of particular interest how to appropriately “dissect” the model, to reveal the importance of single attributes with regard to individual responses (for example, variable selection). In this paper, performances of multivariate PLS regression coefficients, in selecting relevant predictors for different responses in omics-type of data, were investigated by means of a receiver operating characteristic (ROC) analysis. For this purpose, simulated data, mimicking the covariance structures of microarray and liquid chromatography mass spectrometric data, were used to generate matrices of predictors and responses. The relevant predictors were set a priori. The influences of noise, the source of data with different covariance structure and the size of relevant predictors were investigated. Results demonstrate the applicability of PLS regression coeffi cients in selecting variables for each response of a multivariate PLS, in omics-type of data. Comparisons with other feature selection methods, such as variable importance in the projection scores, principal component regression, and least absolute shrinkage and selection operator regression were also provided.

Keywords: partial least square regression, regression coefficients, variable selection, biomarker discovery, omics-data








Readers of this article also read:

Role of aliskiren in cardio-renal protection and use in hypertensives with multiple risk factors
Classification of heterodimer interfaces using docking models and construction of scoring functions for the complex structure prediction
Computer applications for prediction of protein–protein interactions and rational drug design
Pharmacogenomics of drug efficacy in the interferon treatment of chronic hepatitis C using classification algorithms
An unsupervised strategy for biomedical image segmentation
Construction of random perfect phylogeny matrix
Zinc oxide nanoparticles as selective killers of proliferating cells
Expression of mannose binding lectin in HIV-1-infected brain: implications for HIV-related neuronal damage and neuroAIDS
Biodiversity: crises past and present, and future challenges
Cumulative clinical experience from over a decade of use of levofloxacin in community-acquired pneumonia: critical appraisal and role in therapy
  • Testimonials

    "... I was impressed at the rapidity of publication from submission to final acceptance." Dr Edwin Thrower, PhD, Yale University