Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions

Loading...
Publication Logo

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Frontiers Media S.A.

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

107

OpenAIRE Views

166

Publicly Funded

Yes
Impulse
Top 1%
Influence
Top 10%
Popularity
Top 1%

Research Projects

Journal Issue

Abstract

The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies.

Description

Nedyalkova, Miroslava/0000-0003-0793-3340; Vilne, Baiba/0000-0002-1084-7067; Zomer, Aldert/0000-0002-0758-5190; Vlachakis, Dimitrios/0000-0003-1823-6102; B. Lopes, Marta/0000-0002-4135-1857; Stres, Blaz/0000-0003-2972-2907; D'Elia, Domenica/0000-0003-3787-3836; Claesson, Marcus/0000-0002-5712-0623; Yilmaz, Ercument/0000-0002-3712-7086; Saez-Rodriguez, Julio/0000-0002-8552-8976; Marcos-Zambrano, Laura/0000-0003-1381-6407; Przymus, Piotr/0000-0001-9548-2388; May, Patrick/0000-0001-8698-3770; Klammsteiner, Thomas/0000-0003-1280-5159; Desai, Mahesh S/0000-0002-9223-2209; Elbere, Ilze/0000-0003-4381-885X; Lahti, Leo/0000-0001-5537-637X; Carrillo De Santa Pau, Enrique/0000-0002-2310-2267; Shigdel, Rajesh/0000-0002-8686-8569; Falquet, Laurent/0000-0001-8102-7579;

Keywords

Machine Learning, Microbiome, Ml4Microbiome, Personalized Medicine, Biomarker Identification, Microbiology (medical), 570, COLONIZATION RESISTANCE, Biología, EMC OR-01, 610, microbiome, ta3111, Microbiology, METAGENOMICS, Machine Learning, ML4Microbiome, Machine learning, Estándares de Referencia, Biomarker identification, Biology, medical applications, Microbiota, Data Science, GUT MICROBIOTA, Reproducibility of Results, health, bioinformatics, personalized medicine, Reference Standards, Personalized medicine, QR1-502, Benchmarking, machine learning, Research Design, biomarker identification, Investigación, INFERENCE, Microbiome, Aprendizaje Automático, COMMUNITIES, machine learning, microbiome, ML4Microbiome, personalized medicine, biomarker identification

Fields of Science

0301 basic medicine, 03 medical and health sciences, 0303 health sciences

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
69

Source

Frontiers in Microbiology

Volume

12

Issue

Start Page

End Page

PlumX Metrics
Citations

Scopus : 86

PubMed : 54

Captures

Mendeley Readers : 221

SCOPUS™ Citations

88

checked on Apr 19, 2026

Web of Science™ Citations

75

checked on Apr 19, 2026

Page Views

3

checked on Apr 19, 2026

Downloads

8

checked on Apr 19, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
6.5161
Altmetrics Badge

Sustainable Development Goals

SDG data is not available