Multi-block PLS discriminant analysis for the joint analysis of metabolomic and epidemiological data.
Discrimination
Epidemiology
Metabolomics
Multi-block
Multiblock PLS discriminant analysis
Journal
Metabolomics : Official journal of the Metabolomic Society
ISSN: 1573-3890
Titre abrégé: Metabolomics
Pays: United States
ID NLM: 101274889
Informations de publication
Date de publication:
03 10 2019
03 10 2019
Historique:
received:
11
06
2019
accepted:
25
09
2019
entrez:
5
10
2019
pubmed:
5
10
2019
medline:
2
6
2020
Statut:
epublish
Résumé
Metabolomics is a powerful phenotyping tool in nutrition and health research, generating complex data that need dedicated treatments to enrich knowledge of biological systems. In particular, to investigate relations between environmental factors, phenotypes and metabolism, discriminant statistical analyses are generally performed separately on metabolomic datasets, complemented by associations with metadata. Another relevant strategy is to simultaneously analyse thematic data blocks by a multi-block partial least squares discriminant analysis (MBPLSDA) allowing determining the importance of variables and blocks in discriminating groups of subjects, taking into account data structure. The present objective was to develop a full open-source standalone tool, allowing all steps of MBPLSDA for the joint analysis of metabolomic and epidemiological data. This tool was based on the mbpls function of the ade4 R package, enriched with functionalities, including some dedicated to discriminant analysis. Provided indicators help to determine the optimal number of components, to check the MBPLSDA model validity, and to evaluate the variability of its parameters and predictions. To illustrate the potential of this tool, MBPLSDA was applied to a real case study involving metabolomics, nutritional and clinical data from a human cohort. The availability of different functionalities in a single R package allowed optimizing parameters for an efficient joint analysis of metabolomics and epidemiological data to obtain new insights into multidimensional phenotypes. In particular, we highlighted the impact of filtering the metabolomic variables beforehand, and the relevance of a MBPLSDA approach in comparison to a standard PLS discriminant analysis method.
Identifiants
pubmed: 31583480
doi: 10.1007/s11306-019-1598-y
pii: 10.1007/s11306-019-1598-y
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
134Références
BMC Bioinformatics. 2008 Nov 28;9:504
pubmed: 19040729
J Proteome Res. 2017 Jun 2;16(6):2262-2272
pubmed: 28440083
Rejuvenation Res. 2007 Sep;10(3):377-86
pubmed: 17708689
Bioinformatics. 2015 May 1;31(9):1493-5
pubmed: 25527831
Brief Bioinform. 2006 Jun;7(2):151-8
pubmed: 16772265
PLoS Comput Biol. 2017 Nov 3;13(11):e1005752
pubmed: 29099853
Curr Drug Metab. 2006 Jun;7(5):525-39
pubmed: 16787160
Curr Opin Chem Biol. 2013 Oct;17(5):841-6
pubmed: 23849548
Anal Chim Acta. 2015 Jun 16;879:10-23
pubmed: 26002472
OMICS. 2014 Nov;18(11):682-95
pubmed: 25387159