Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions
| dc.contributor.author | Moreno-Indias, Isabel | |
| dc.contributor.author | Lahti, Leo | |
| dc.contributor.author | Nedyalkova, Miroslava | |
| dc.contributor.author | Elbere, Ilze | |
| dc.contributor.author | Roshchupkin, Gennady | |
| dc.contributor.author | Adilovic, Muhamed | |
| dc.contributor.author | Claesson, Marcus J. | |
| dc.date.accessioned | 2025-09-25T10:57:37Z | |
| dc.date.available | 2025-09-25T10:57:37Z | |
| dc.date.issued | 2021 | |
| dc.description | Nedyalkova, Miroslava/0000-0003-0793-3340; Vilne, Baiba/0000-0002-1084-7067; Zomer, Aldert/0000-0002-0758-5190; Vlachakis, Dimitrios/0000-0003-1823-6102; B. Lopes, Marta/0000-0002-4135-1857; Stres, Blaz/0000-0003-2972-2907; D'Elia, Domenica/0000-0003-3787-3836; Claesson, Marcus/0000-0002-5712-0623; Yilmaz, Ercument/0000-0002-3712-7086; Saez-Rodriguez, Julio/0000-0002-8552-8976; Marcos-Zambrano, Laura/0000-0003-1381-6407; Przymus, Piotr/0000-0001-9548-2388; May, Patrick/0000-0001-8698-3770; Klammsteiner, Thomas/0000-0003-1280-5159; Desai, Mahesh S/0000-0002-9223-2209; Elbere, Ilze/0000-0003-4381-885X; Lahti, Leo/0000-0001-5537-637X; Carrillo De Santa Pau, Enrique/0000-0002-2310-2267; Shigdel, Rajesh/0000-0002-8686-8569; Falquet, Laurent/0000-0001-8102-7579; | en_US |
| dc.description.abstract | The human microbiome has emerged as a central research topic in human biology and biomedicine. Current microbiome studies generate high-throughput omics data across different body sites, populations, and life stages. Many of the challenges in microbiome research are similar to other high-throughput studies, the quantitative analyses need to address the heterogeneity of data, specific statistical properties, and the remarkable variation in microbiome composition across individuals and body sites. This has led to a broad spectrum of statistical and machine learning challenges that range from study design, data processing, and standardization to analysis, modeling, cross-study comparison, prediction, data science ecosystems, and reproducible reporting. Nevertheless, although many statistics and machine learning approaches and tools have been developed, new techniques are needed to deal with emerging applications and the vast heterogeneity of microbiome data. We review and discuss emerging applications of statistical and machine learning techniques in human microbiome studies and introduce the COST Action CA18131 "ML4Microbiome" that brings together microbiome researchers and machine learning experts to address current challenges such as standardization of analysis pipelines for reproducibility of data analysis results, benchmarking, improvement, or development of existing and new tools and ontologies. | en_US |
| dc.description.sponsorship | COST Action [CA18131]; Instituto de Salud Carlos III - Fondo Europeo de Desarrollo Regional-FEDER [CP16/00163]; project "Information and Communication Technologies for a Single Digital Market in Science, Education and Security" of the Scientific Research Center [NIS-3317]; National roadmaps for research infrastructures (RIs) [NIS-3318]; Academy of Finland [295741]; H2020-EU.4.b. project "Integration of knowledge and biobank resources in comprehensive translational approach for personalized prevention and treatment of metabolic disorders (INTEGROMED)" [857572]; Luxembourg National Research Fund (FNR) CORE grant [C18/BM/12585940] | en_US |
| dc.description.sponsorship | This study was supported by the COST Action CA18131 "Statistical and machine learning techniques in human microbiome studies." IM-I was supported by the "MS type I" program (CP16/00163) from the Instituto de Salud Carlos III and co-funded by Fondo Europeo de Desarrollo Regional-FEDER. MN was grateful for the additional support by the project "Information and Communication Technologies for a Single Digital Market in Science, Education and Security" of the Scientific Research Center, NIS-3317 and National roadmaps for research infrastructures (RIs) grant number NIS-3318. LL was supported by Academy of Finland (decision 295741). IE was supported by H2020-EU.4.b. project "Integration of knowledge and biobank resources in comprehensive translational approach for personalized prevention and treatment of metabolic disorders (INTEGROMED)" (grant agreement ID 857572). MD was supported by the Luxembourg National Research Fund (FNR) CORE grant (C18/BM/12585940). | en_US |
| dc.description.sponsorship | This study was supported by the COST Action CA18131 Statistical and machine learning techniques in human microbiome studies. IM-I was supported by the MS type I program (CP16/00163) from the Instituto de Salud Carlos III and co-funded by Fondo Europeo de Desarrollo Regional-FEDER. MN was grateful for the additional support by the project Information and Communication Technologies for a Single Digital Market in Science, Education and Security of the Scientific Research Center, NIS-3317 and National roadmaps for research infrastructures (RIs) grant number NIS-3318. LL was supported by Academy of Finland (decision 295741). IE was supported by H2020-EU.4.b. project Integration of knowledge and biobank resources in comprehensive translational approach for personalized prevention and treatment of metabolic disorders (INTEGROMED) (grant agreement ID 857572). MD was supported by the Luxembourg National Research Fund (FNR) CORE grant (C18/BM/12585940). | |
| dc.identifier.doi | 10.3389/fmicb.2021.635781 | |
| dc.identifier.issn | 1664-302X | |
| dc.identifier.scopus | 2-s2.0-85102370593 | |
| dc.identifier.uri | https://doi.org/10.3389/fmicb.2021.635781 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12573/4683 | |
| dc.language.iso | en | en_US |
| dc.publisher | Frontiers Media S.A. | en_US |
| dc.relation.ispartof | Frontiers in Microbiology | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Machine Learning | en_US |
| dc.subject | Microbiome | en_US |
| dc.subject | Ml4Microbiome | en_US |
| dc.subject | Personalized Medicine | en_US |
| dc.subject | Biomarker Identification | en_US |
| dc.title | Statistical and Machine Learning Techniques in Human Microbiome Studies: Contemporary Challenges and Solutions | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.id | Nedyalkova, Miroslava/0000-0003-0793-3340 | |
| gdc.author.id | Vilne, Baiba/0000-0002-1084-7067 | |
| gdc.author.id | Zomer, Aldert/0000-0002-0758-5190 | |
| gdc.author.id | Vlachakis, Dimitrios/0000-0003-1823-6102 | |
| gdc.author.id | B. Lopes, Marta/0000-0002-4135-1857 | |
| gdc.author.id | Stres, Blaz/0000-0003-2972-2907 | |
| gdc.author.id | D'Elia, Domenica/0000-0003-3787-3836 | |
| gdc.author.id | Elbere, Ilze/0000-0003-4381-885X | |
| gdc.author.id | Falquet, Laurent/0000-0001-8102-7579 | |
| gdc.author.id | Saez-Roiguez, Julio/0000-0002-8552-8976 | |
| gdc.author.scopusid | 24332457100 | |
| gdc.author.scopusid | 8679063700 | |
| gdc.author.scopusid | 55467311500 | |
| gdc.author.scopusid | 56871230600 | |
| gdc.author.scopusid | 57190279096 | |
| gdc.author.scopusid | 57194340394 | |
| gdc.author.scopusid | 57219542183 | |
| gdc.author.wosid | Lopes, Marta/F-5378-2011 | |
| gdc.author.wosid | Moreno-Indias, Isabel/B-6595-2017 | |
| gdc.author.wosid | Suharoschi, Ramona/M-9711-2019 | |
| gdc.author.wosid | Vlachakis, Dimitrios/O-4323-2018 | |
| gdc.author.wosid | B. Lopes, Marta/F-5378-2011 | |
| gdc.author.wosid | Sampri, Alexia/Kwu-0954-2024 | |
| gdc.author.wosid | Promponas, Vasilis/G-5539-2010 | |
| gdc.author.wosid | Elbere, Ilze/JFS-0327-2023 | |
| gdc.author.wosid | Truică, Ciprian-Octavian/J-9536-2014 | |
| gdc.author.wosid | Falquet, Laurent/C-2541-2013 | |
| gdc.author.wosid | Stres, Blaz/AAF-7279-2020 | |
| gdc.author.wosid | Saez-Roiguez, Julio/H-7114-2019 | |
| gdc.author.wosid | D'Elia, Domenica/O-2917-2015 | |
| gdc.bip.impulseclass | C3 | |
| gdc.bip.influenceclass | C4 | |
| gdc.bip.popularityclass | C3 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | Abdullah Gül University | en_US |
| gdc.description.departmenttemp | [Moreno-Indias, Isabel] Univ Malaga, Inst Invest Biomed Malaga IBIMA, Hosp Clin Univ Virgen Victoria, Unidad Gest Clin Endocrinol & Nutr, Malaga, Spain; [Moreno-Indias, Isabel] Inst Salud Carlos III, Ctr Invest Biomeid Red Fisiopatol & Obes & Nutr, Madrid, Spain; [Lahti, Leo] Univ Turku, Dept Comp, Turku, Finland; [Nedyalkova, Miroslava] Latvian Biomed Res & Study Ctr, Human Genet & Dis Mech, Riga, Latvia; [Elbere, Ilze] Latvian Biomed Res & Study Ctr, Riga, Latvia; [Roshchupkin, Gennady] Erasmus MC, Dept Epidemiol, Rotterdam, Netherlands; [Adilovic, Muhamed] Int Univ Sarajevo, Dept Genet & Bioengn, Sarajevo, Bosnia & Herceg; [Aydemir, Onder] Karadeniz Tech Univ, Dept Elect & Elect Engn, Trabzon, Turkey; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkey; [Santa Pau, Enrique Carrillo-de; Marcos-Zambrano, Laura Judith] IMDEA Food Inst, Computat Biol Grp, Precis Nutr & Canc Res Program, Madrid, Spain; [D'Elia, Domenica] CNR, Inst Biomed Technol, Dept Biomed Sci, Bari, Italy; [Desai, Mahesh S.] Luxembourg Inst Hlth, Dept Infect & Immun, Esch Sur Alzette, Luxembourg; [Desai, Mahesh S.] Univ Southern Denmark, Odense Univ Hosp, Dept Dermatol, Odense Res Ctr Anaphylaxis, Odense, Denmark; [Desai, Mahesh S.] Univ Southern Denmark, Odense Univ Hosp, Allergy Ctr, Odense, Denmark; [Falquet, Laurent] Univ Fribourg, Dept Biol, Fribourg, Switzerland; [Falquet, Laurent] Swiss Inst Bioinformat, Lausanne, Switzerland; [Gundogdu, Aycan] Erciyes Univ, Dept Microbiol & Clin Microbiol, Fac Med, Kayseri, Turkey; [Gundogdu, Aycan] Erciyes Univ, Genome & Stem Cell Ctr GenKok, Metagen Lab, Kayseri, Turkey; [Hron, Karel] Palacky Univ, Dept Math Anal & Applicat Math, Olomouc, Czech Republic; [Klammsteiner, Thomas] Univ Innsbruck, Dept Microbiol, Innsbruck, Austria; [Lopes, Marta B.] UNL, FCT, NOVA Lab Comp Sci & Informat NOVA LINCS, Caparica, Portugal; [Lopes, Marta B.] UNL, Ctr Matemat & Aplicacoes CMA, FCT, Caparica, Portugal; [Marques, Claudia] Univ Nova Lisboa, NOVA Med Sch, CINTESIS, NMS, Lisbon, Portugal; [Mason, Michael] Sage Bionetworks, Computat Oncol, Seattle, WA USA; [May, Patrick] Univ Luxembourg, Luxembourg Ctr Syst Biomed, Bioinformat Core, Esch Sur Alzette, Luxembourg; [Pasic, Lejla] Univ Sarajevo, Sarajevo Med Sch, Sch Sci & Technol, Sarajevo, Bosnia & Herceg; [Pio, Gianvito] Univ Bari Aldo Moro, Dept Comp Sci, Bari, Italy; [Pongor, Sandor] Pazmany Univ, Fac Informat Tehnol & Bion, Budapest, Hungary; [Promponas, Vasilis J.] Univ Cyprus, Dept Biol Sci, Bioinformat Res Lab, Nicosia, Cyprus; [Przymus, Piotr] Nicolaus Copernicus Univ, Fac Math & Comp Sci, Torun, Poland; [Saez-Rodriguez, Julio] Heidelberg Univ, Inst Computat Biomed, Fac Med, Heidelberg, Germany; [Saez-Rodriguez, Julio] Heidelberg Univ Hosp, Heidelberg, Germany; [Sampri, Alexia] Univ Manchester, Sch Hlth Sci, Div Informat Imaging & Data Sci, Manchester, Lancs, England; [Shigdel, Rajesh] Univ Bergen, Dept Clin Sci, Bergen, Norway; [Stres, Blaz] Jozef Stefan Inst, Ljubljana, Slovenia; [Stres, Blaz] Univ Ljubljana, Biotech Fac, Ljubljana, Slovenia; [Stres, Blaz] Univ Ljubljana, Fac Civil & Geodet Engn, Ljubljana, Slovenia; [Suharoschi, Ramona] Univ Agr Sci & Vet Med Cluj Napoca, Inst Life Sci, Fac Food Sci & Technol, Mol Nutr & Prote Lab, Cluj Napoca, Romania; [Truu, Jaak] Univ Tartu, Inst Mol & Cell Biol, Tartu, Estonia; [Truica, Ciprian-Octavian] Univ Politehn Bucuresti, Fac Automat Control & Comp, Dept Comp Sci & Engn, Bucharest, Romania; [Vilne, Baiba] Riga Stradins Univ, Bioinformat Res Unit, Riga, Latvia; [Vlachakis, Dimitrios] Agr Univ Athens, Sch Appl Biol & Biotechnol, Dept Biotechnol, Lab Genet, Athens, Greece; [Yilmaz, Ercument] Karadeniz Tech Univ, Dept Comp Technol, Trabzon, Turkey; [Zeller, Georg] European Mol Biol Lab, Struct & Computat Biol Unit, Heidelberg, Germany; [Zomer, Aldert L.] Univ Utrecht, Dept Infect Dis & Immunol, Fac Vet Med, Utrecht, Netherlands; [Gomez-Cabrero, David] Univ Publ Navarra UPNA, Complejo Hosp Navarra CHN, Navarrabiomed, IdiSNA, Pamplona, Spain; [Claesson, Marcus J.] Univ Coll Cork, Sch Microbiol, Cork, Ireland; [Claesson, Marcus J.] Univ Coll Cork, APC Microbiome Ireland, Cork, Ireland | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q1 | |
| gdc.description.volume | 12 | en_US |
| gdc.description.woscitationindex | Science Citation Index Expanded | |
| gdc.description.wosquality | Q1 | |
| gdc.identifier.openalex | W3127298225 | |
| gdc.identifier.pmid | 33692771 | |
| gdc.identifier.wos | WOS:000625979400001 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.index.type | PubMed | |
| gdc.oaire.accesstype | GOLD | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.downloads | 107 | |
| gdc.oaire.impulse | 55.0 | |
| gdc.oaire.influence | 4.3179744E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | Microbiology (medical) | |
| gdc.oaire.keywords | 570 | |
| gdc.oaire.keywords | COLONIZATION RESISTANCE | |
| gdc.oaire.keywords | Biología | |
| gdc.oaire.keywords | EMC OR-01 | |
| gdc.oaire.keywords | 610 | |
| gdc.oaire.keywords | microbiome | |
| gdc.oaire.keywords | ta3111 | |
| gdc.oaire.keywords | Microbiology | |
| gdc.oaire.keywords | METAGENOMICS | |
| gdc.oaire.keywords | Machine Learning | |
| gdc.oaire.keywords | ML4Microbiome | |
| gdc.oaire.keywords | Machine learning | |
| gdc.oaire.keywords | Estándares de Referencia | |
| gdc.oaire.keywords | Biomarker identification | |
| gdc.oaire.keywords | Biology | |
| gdc.oaire.keywords | medical applications | |
| gdc.oaire.keywords | Microbiota | |
| gdc.oaire.keywords | Data Science | |
| gdc.oaire.keywords | GUT MICROBIOTA | |
| gdc.oaire.keywords | Reproducibility of Results | |
| gdc.oaire.keywords | health | |
| gdc.oaire.keywords | bioinformatics | |
| gdc.oaire.keywords | personalized medicine | |
| gdc.oaire.keywords | Reference Standards | |
| gdc.oaire.keywords | Personalized medicine | |
| gdc.oaire.keywords | QR1-502 | |
| gdc.oaire.keywords | Benchmarking | |
| gdc.oaire.keywords | machine learning | |
| gdc.oaire.keywords | Research Design | |
| gdc.oaire.keywords | biomarker identification | |
| gdc.oaire.keywords | Investigación | |
| gdc.oaire.keywords | INFERENCE | |
| gdc.oaire.keywords | Microbiome | |
| gdc.oaire.keywords | Aprendizaje Automático | |
| gdc.oaire.keywords | COMMUNITIES | |
| gdc.oaire.keywords | machine learning, microbiome, ML4Microbiome, personalized medicine, biomarker identification | |
| gdc.oaire.popularity | 6.029738E-8 | |
| gdc.oaire.publicfunded | true | |
| gdc.oaire.sciencefields | 0301 basic medicine | |
| gdc.oaire.sciencefields | 03 medical and health sciences | |
| gdc.oaire.sciencefields | 0303 health sciences | |
| gdc.oaire.views | 166 | |
| gdc.openalex.collaboration | International | |
| gdc.openalex.fwci | 6.5161 | |
| gdc.openalex.normalizedpercentile | 0.98 | |
| gdc.openalex.toppercent | TOP 10% | |
| gdc.opencitations.count | 69 | |
| gdc.plumx.facebookshareslikecount | 82 | |
| gdc.plumx.mendeley | 221 | |
| gdc.plumx.newscount | 1 | |
| gdc.plumx.pubmedcites | 54 | |
| gdc.plumx.scopuscites | 86 | |
| gdc.scopus.citedcount | 88 | |
| gdc.wos.citedcount | 75 | |
| relation.isOrgUnitOfPublication | 665d3039-05f8-4a25-9a3c-b9550bffecef | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 665d3039-05f8-4a25-9a3c-b9550bffecef |
Files
Original bundle
1 - 1 of 1
Loading...
- Name:
- Statistical and Machine Learning Techniques in Human Microbiome Studies Contemporary Challenges and Solutions.pdf
- Size:
- 956.31 KB
- Format:
- Adobe Portable Document Format
- Description:
- Makale Dosyası
License bundle
1 - 1 of 1
No Thumbnail Available
- Name:
- license.txt
- Size:
- 1.44 KB
- Format:
- Item-specific license agreed upon to submission
- Description:
