Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions

dc.contributor.author Moreno-Indias, Isabel
dc.contributor.author Lahti, Leo
dc.contributor.author Nedyalkova, Miroslava
dc.contributor.author Elbere, Ilze
dc.contributor.author Roshchupkin, Gennady
dc.contributor.author Adilovic, Muhamed
dc.contributor.author Claesson, Marcus J.
dc.date.accessioned 2025-09-25T10:57:37Z
dc.date.available 2025-09-25T10:57:37Z
dc.date.issued 2021
dc.description Nedyalkova, Miroslava/0000-0003-0793-3340; Vilne, Baiba/0000-0002-1084-7067; Zomer, Aldert/0000-0002-0758-5190; Vlachakis, Dimitrios/0000-0003-1823-6102; B. Lopes, Marta/0000-0002-4135-1857; Stres, Blaz/0000-0003-2972-2907; D'Elia, Domenica/0000-0003-3787-3836; Claesson, Marcus/0000-0002-5712-0623; Yilmaz, Ercument/0000-0002-3712-7086; Saez-Rodriguez, Julio/0000-0002-8552-8976; Marcos-Zambrano, Laura/0000-0003-1381-6407; Przymus, Piotr/0000-0001-9548-2388; May, Patrick/0000-0001-8698-3770; Klammsteiner, Thomas/0000-0003-1280-5159; Desai, Mahesh S/0000-0002-9223-2209; Elbere, Ilze/0000-0003-4381-885X; Lahti, Leo/0000-0001-5537-637X; Carrillo De Santa Pau, Enrique/0000-0002-2310-2267; Shigdel, Rajesh/0000-0002-8686-8569; Falquet, Laurent/0000-0001-8102-7579; en_US
dc.description.abstract The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies. en_US
dc.description.sponsorship COST Action [CA18131]; Instituto de Salud Carlos III - Fondo Europeo de Desarrollo Regional-FEDER [CP16/00163]; project "Information and Communication Technologies for a Single Digital Market in Science, Education and Security" of the Scientific Research Center [NIS-3317]; National roadmaps for research infrastructures (RIs) [NIS-3318]; Academy of Finland [295741]; H2020-EU.4.b. project "Integration of knowledge and biobank resources in comprehensive translational approach for personalized prevention and treatment of metabolic disorders (INTEGROMED)" [857572]; Luxembourg National Research Fund (FNR) CORE grant [C18/BM/12585940] en_US
dc.description.sponsorship This study was supported by the COST Action CA18131 "Statistical and machine learning techniques in human microbiome studies." IM-I was supported by the "MS type I" program (CP16/00163) from the Instituto de Salud Carlos III and co-funded by Fondo Europeo de Desarrollo Regional-FEDER. MN was grateful for the additional support by the project "Information and Communication Technologies for a Single Digital Market in Science, Education and Security" of the Scientific Research Center, NIS-3317 and National roadmaps for research infrastructures (RIs) grant number NIS-3318. LL was supported by Academy of Finland (decision 295741). IE was supported by H2020-EU.4.b. project "Integration of knowledge and biobank resources in comprehensive translational approach for personalized prevention and treatment of metabolic disorders (INTEGROMED)" (grant agreement ID 857572). MD was supported by the Luxembourg National Research Fund (FNR) CORE grant (C18/BM/12585940). en_US
dc.description.sponsorship This study was supported by the COST Action CA18131 Statistical and machine learning techniques in human microbiome studies. IM-I was supported by the MS type I program (CP16/00163) from the Instituto de Salud Carlos III and co-funded by Fondo Europeo de Desarrollo Regional-FEDER. MN was grateful for the additional support by the project Information and Communication Technologies for a Single Digital Market in Science, Education and Security of the Scientific Research Center, NIS-3317 and National roadmaps for research infrastructures (RIs) grant number NIS-3318. LL was supported by Academy of Finland (decision 295741). IE was supported by H2020-EU.4.b. project Integration of knowledge and biobank resources in comprehensive translational approach for personalized prevention and treatment of metabolic disorders (INTEGROMED) (grant agreement ID 857572). MD was supported by the Luxembourg National Research Fund (FNR) CORE grant (C18/BM/12585940).
dc.identifier.doi 10.3389/fmicb.2021.635781
dc.identifier.issn 1664-302X
dc.identifier.scopus 2-s2.0-85102370593
dc.identifier.uri https://doi.org/10.3389/fmicb.2021.635781
dc.identifier.uri https://hdl.handle.net/20.500.12573/4683
dc.language.iso en en_US
dc.publisher Frontiers Media S.A. en_US
dc.relation.ispartof Frontiers in Microbiology en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Machine Learning en_US
dc.subject Microbiome en_US
dc.subject Ml4Microbiome en_US
dc.subject Personalized Medicine en_US
dc.subject Biomarker Identification en_US
dc.title Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Nedyalkova, Miroslava/0000-0003-0793-3340
gdc.author.id Vilne, Baiba/0000-0002-1084-7067
gdc.author.id Zomer, Aldert/0000-0002-0758-5190
gdc.author.id Vlachakis, Dimitrios/0000-0003-1823-6102
gdc.author.id B. Lopes, Marta/0000-0002-4135-1857
gdc.author.id Stres, Blaz/0000-0003-2972-2907
gdc.author.id D'Elia, Domenica/0000-0003-3787-3836
gdc.author.id Elbere, Ilze/0000-0003-4381-885X
gdc.author.id Falquet, Laurent/0000-0001-8102-7579
gdc.author.id Saez-Roiguez, Julio/0000-0002-8552-8976
gdc.author.scopusid 24332457100
gdc.author.scopusid 8679063700
gdc.author.scopusid 55467311500
gdc.author.scopusid 56871230600
gdc.author.scopusid 57190279096
gdc.author.scopusid 57194340394
gdc.author.scopusid 57219542183
gdc.author.wosid Lopes, Marta/F-5378-2011
gdc.author.wosid Moreno-Indias, Isabel/B-6595-2017
gdc.author.wosid Suharoschi, Ramona/M-9711-2019
gdc.author.wosid Vlachakis, Dimitrios/O-4323-2018
gdc.author.wosid B. Lopes, Marta/F-5378-2011
gdc.author.wosid Sampri, Alexia/Kwu-0954-2024
gdc.author.wosid Promponas, Vasilis/G-5539-2010
gdc.author.wosid Elbere, Ilze/JFS-0327-2023
gdc.author.wosid Truică, Ciprian-Octavian/J-9536-2014
gdc.author.wosid Falquet, Laurent/C-2541-2013
gdc.author.wosid Stres, Blaz/AAF-7279-2020
gdc.author.wosid Saez-Roiguez, Julio/H-7114-2019
gdc.author.wosid D'Elia, Domenica/O-2917-2015
gdc.bip.impulseclass C3
gdc.bip.influenceclass C4
gdc.bip.popularityclass C3
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Moreno-Indias, Isabel] Univ Malaga, Inst Invest Biomed Malaga IBIMA, Hosp Clin Univ Virgen Victoria, Unidad Gest Clin Endocrinol & Nutr, Malaga, Spain; [Moreno-Indias, Isabel] Inst Salud Carlos III, Ctr Invest Biomeid Red Fisiopatol & Obes & Nutr, Madrid, Spain; [Lahti, Leo] Univ Turku, Dept Comp, Turku, Finland; [Nedyalkova, Miroslava] Latvian Biomed Res & Study Ctr, Human Genet & Dis Mech, Riga, Latvia; [Elbere, Ilze] Latvian Biomed Res & Study Ctr, Riga, Latvia; [Roshchupkin, Gennady] Erasmus MC, Dept Epidemiol, Rotterdam, Netherlands; [Adilovic, Muhamed] Int Univ Sarajevo, Dept Genet & Bioengn, Sarajevo, Bosnia & Herceg; [Aydemir, Onder] Karadeniz Tech Univ, Dept Elect & Elect Engn, Trabzon, Turkey; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkey; [Santa Pau, Enrique Carrillo-de; Marcos-Zambrano, Laura Judith] IMDEA Food Inst, Computat Biol Grp, Precis Nutr & Canc Res Program, Madrid, Spain; [D'Elia, Domenica] CNR, Inst Biomed Technol, Dept Biomed Sci, Bari, Italy; [Desai, Mahesh S.] Luxembourg Inst Hlth, Dept Infect & Immun, Esch Sur Alzette, Luxembourg; [Desai, Mahesh S.] Univ Southern Denmark, Odense Univ Hosp, Dept Dermatol, Odense Res Ctr Anaphylaxis, Odense, Denmark; [Desai, Mahesh S.] Univ Southern Denmark, Odense Univ Hosp, Allergy Ctr, Odense, Denmark; [Falquet, Laurent] Univ Fribourg, Dept Biol, Fribourg, Switzerland; [Falquet, Laurent] Swiss Inst Bioinformat, Lausanne, Switzerland; [Gundogdu, Aycan] Erciyes Univ, Dept Microbiol & Clin Microbiol, Fac Med, Kayseri, Turkey; [Gundogdu, Aycan] Erciyes Univ, Genome & Stem Cell Ctr GenKok, Metagen Lab, Kayseri, Turkey; [Hron, Karel] Palacky Univ, Dept Math Anal & Applicat Math, Olomouc, Czech Republic; [Klammsteiner, Thomas] Univ Innsbruck, Dept Microbiol, Innsbruck, Austria; [Lopes, Marta B.] UNL, FCT, NOVA Lab Comp Sci & Informat NOVA LINCS, Caparica, Portugal; [Lopes, Marta B.] UNL, Ctr Matemat & Aplicacoes CMA, FCT, Caparica, Portugal; [Marques, Claudia] Univ Nova Lisboa, NOVA Med Sch, CINTESIS, NMS, Lisbon, Portugal; [Mason, Michael] Sage Bionetworks, Computat Oncol, Seattle, WA USA; [May, Patrick] Univ Luxembourg, Luxembourg Ctr Syst Biomed, Bioinformat Core, Esch Sur Alzette, Luxembourg; [Pasic, Lejla] Univ Sarajevo, Sarajevo Med Sch, Sch Sci & Technol, Sarajevo, Bosnia & Herceg; [Pio, Gianvito] Univ Bari Aldo Moro, Dept Comp Sci, Bari, Italy; [Pongor, Sandor] Pazmany Univ, Fac Informat Tehnol & Bion, Budapest, Hungary; [Promponas, Vasilis J.] Univ Cyprus, Dept Biol Sci, Bioinformat Res Lab, Nicosia, Cyprus; [Przymus, Piotr] Nicolaus Copernicus Univ, Fac Math & Comp Sci, Torun, Poland; [Saez-Rodriguez, Julio] Heidelberg Univ, Inst Computat Biomed, Fac Med, Heidelberg, Germany; [Saez-Rodriguez, Julio] Heidelberg Univ Hosp, Heidelberg, Germany; [Sampri, Alexia] Univ Manchester, Sch Hlth Sci, Div Informat Imaging & Data Sci, Manchester, Lancs, England; [Shigdel, Rajesh] Univ Bergen, Dept Clin Sci, Bergen, Norway; [Stres, Blaz] Jozef Stefan Inst, Ljubljana, Slovenia; [Stres, Blaz] Univ Ljubljana, Biotech Fac, Ljubljana, Slovenia; [Stres, Blaz] Univ Ljubljana, Fac Civil & Geodet Engn, Ljubljana, Slovenia; [Suharoschi, Ramona] Univ Agr Sci & Vet Med Cluj Napoca, Inst Life Sci, Fac Food Sci & Technol, Mol Nutr & Prote Lab, Cluj Napoca, Romania; [Truu, Jaak] Univ Tartu, Inst Mol & Cell Biol, Tartu, Estonia; [Truica, Ciprian-Octavian] Univ Politehn Bucuresti, Fac Automat Control & Comp, Dept Comp Sci & Engn, Bucharest, Romania; [Vilne, Baiba] Riga Stradins Univ, Bioinformat Res Unit, Riga, Latvia; [Vlachakis, Dimitrios] Agr Univ Athens, Sch Appl Biol & Biotechnol, Dept Biotechnol, Lab Genet, Athens, Greece; [Yilmaz, Ercument] Karadeniz Tech Univ, Dept Comp Technol, Trabzon, Turkey; [Zeller, Georg] European Mol Biol Lab, Struct & Computat Biol Unit, Heidelberg, Germany; [Zomer, Aldert L.] Univ Utrecht, Dept Infect Dis & Immunol, Fac Vet Med, Utrecht, Netherlands; [Gomez-Cabrero, David] Univ Publ Navarra UPNA, Complejo Hosp Navarra CHN, Navarrabiomed, IdiSNA, Pamplona, Spain; [Claesson, Marcus J.] Univ Coll Cork, Sch Microbiol, Cork, Ireland; [Claesson, Marcus J.] Univ Coll Cork, APC Microbiome Ireland, Cork, Ireland en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.volume 12 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q1
gdc.identifier.openalex W3127298225
gdc.identifier.pmid 33692771
gdc.identifier.wos WOS:000625979400001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.downloads 107
gdc.oaire.impulse 55.0
gdc.oaire.influence 4.3179744E-9
gdc.oaire.isgreen true
gdc.oaire.keywords Microbiology (medical)
gdc.oaire.keywords 570
gdc.oaire.keywords COLONIZATION RESISTANCE
gdc.oaire.keywords Biología
gdc.oaire.keywords EMC OR-01
gdc.oaire.keywords 610
gdc.oaire.keywords microbiome
gdc.oaire.keywords ta3111
gdc.oaire.keywords Microbiology
gdc.oaire.keywords METAGENOMICS
gdc.oaire.keywords Machine Learning
gdc.oaire.keywords ML4Microbiome
gdc.oaire.keywords Machine learning
gdc.oaire.keywords Estándares de Referencia
gdc.oaire.keywords Biomarker identification
gdc.oaire.keywords Biology
gdc.oaire.keywords medical applications
gdc.oaire.keywords Microbiota
gdc.oaire.keywords Data Science
gdc.oaire.keywords GUT MICROBIOTA
gdc.oaire.keywords Reproducibility of Results
gdc.oaire.keywords health
gdc.oaire.keywords bioinformatics
gdc.oaire.keywords personalized medicine
gdc.oaire.keywords Reference Standards
gdc.oaire.keywords Personalized medicine
gdc.oaire.keywords QR1-502
gdc.oaire.keywords Benchmarking
gdc.oaire.keywords machine learning
gdc.oaire.keywords Research Design
gdc.oaire.keywords biomarker identification
gdc.oaire.keywords Investigación
gdc.oaire.keywords INFERENCE
gdc.oaire.keywords Microbiome
gdc.oaire.keywords Aprendizaje Automático
gdc.oaire.keywords COMMUNITIES
gdc.oaire.keywords machine learning, microbiome, ML4Microbiome, personalized medicine, biomarker identification
gdc.oaire.popularity 6.029738E-8
gdc.oaire.publicfunded true
gdc.oaire.sciencefields 0301 basic medicine
gdc.oaire.sciencefields 03 medical and health sciences
gdc.oaire.sciencefields 0303 health sciences
gdc.oaire.views 166
gdc.openalex.collaboration International
gdc.openalex.fwci 6.5161
gdc.openalex.normalizedpercentile 0.98
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 69
gdc.plumx.facebookshareslikecount 82
gdc.plumx.mendeley 221
gdc.plumx.newscount 1
gdc.plumx.pubmedcites 54
gdc.plumx.scopuscites 86
gdc.scopus.citedcount 88
gdc.wos.citedcount 75
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Statistical and Machine Learning Techniques in Human Microbiome Studies Contemporary Challenges and Solutions.pdf
Size:
956.31 KB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: