A Toolbox of Machine Learning Software to Support Microbiome Analysis

dc.contributor.author Marcos-Zambrano, Laura Judith
dc.contributor.author Lopez-Molina, Victor Manuel
dc.contributor.author Bakir-Gungor, Burcu
dc.contributor.author Frohme, Marcus
dc.contributor.author Karaduzovic-Hadziabdic, Kanita
dc.contributor.author Klammsteiner, Thomas
dc.contributor.author Carrillo de Santa Pau, Enrique
dc.date.accessioned 2025-09-25T10:39:44Z
dc.date.available 2025-09-25T10:39:44Z
dc.date.issued 2023
dc.description Duman, Hatice/0000-0002-4526-6609; B. Lopes, Marta/0000-0002-4135-1857; Kalluci, Eglantina/0009-0001-9039-1310; Simeon, Andrea/0000-0002-7096-7415; Aasmets, Oliver/0009-0001-9872-6031; Lahti, Leo/0000-0001-5537-637X; Dhamo, Xhilda/0000-0002-5157-7075; Araujo, Ricardo/0000-0001-7382-4834; Yilmaz, Ercument/0000-0002-3712-7086; Ibrahimi, Eliana/0000-0003-0956-215X; Lopez Molina, Victor Manuel/0009-0003-8126-8550; Karav, Sercan/0000-0003-4056-1673; Carrillo De Santa Pau, Enrique/0000-0002-2310-2267; Nap, Bram/0000-0003-2910-9109; Pujolassos, Meritxell/0000-0003-0313-3506; May, Patrick/0000-0001-8698-3770 en_US
dc.description.abstract The human microbiome has become an area of intense research due to its potential impact on human health. However, the analysis and interpretation of this data have proven to be challenging due to its complexity and high dimensionality. Machine learning (ML) algorithms can process vast amounts of data to uncover informative patterns and relationships within the data, even with limited prior knowledge. Therefore, there has been a rapid growth in the development of software specifically designed for the analysis and interpretation of microbiome data using ML techniques. These software incorporate a wide range of ML algorithms for clustering, classification, regression, or feature selection, to identify microbial patterns and relationships within the data and generate predictive models. This rapid development with a constant need for new developments and integration of new features require efforts into compile, catalog and classify these tools to create infrastructures and services with easy, transparent, and trustable standards. Here we review the state-of-the-art for ML tools applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on ML based software and framework resources currently available for the analysis of microbiome data in humans. The aim is to support microbiologists and biomedical scientists to go deeper into specialized resources that integrate ML techniques and facilitate future benchmarking to create standards for the analysis of microbiome data. The software resources are organized based on the type of analysis they were developed for and the ML techniques they implement. A description of each software with examples of usage is provided including comments about pitfalls and lacks in the usage of software based on ML methods in relation to microbiome data that need to be considered by developers and users. This review represents an extensive compilation to date, offering valuable insights and guidance for researchers interested in leveraging ML approaches for microbiome analysis. en_US
dc.description.sponsorship COST Action [CA18131]; COST (European Cooperation in Science and Technology) en_US
dc.description.sponsorship This article is based upon work from COST Action ML4Microbiome "Statistical and machine learning techniques in human microbiome studies," CA18131, supported by COST (European Cooperation in Science and Technology), www.cost.eu. en_US
dc.identifier.doi 10.3389/fmicb.2023.1250806
dc.identifier.issn 1664-302X
dc.identifier.scopus 2-s2.0-85178948402
dc.identifier.uri https://doi.org/10.3389/fmicb.2023.1250806
dc.identifier.uri https://hdl.handle.net/20.500.12573/3169
dc.language.iso en en_US
dc.publisher Frontiers Media S.A. en_US
dc.relation.ispartof Frontiers in Microbiology en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Microbiome en_US
dc.subject Machine Learning en_US
dc.subject Software en_US
dc.subject Feature Generation en_US
dc.subject Feature Analysis en_US
dc.subject Data Integration en_US
dc.subject Microbial Gene Prediction en_US
dc.subject Microbial Metabolic Modeling en_US
dc.title A Toolbox of Machine Learning Software to Support Microbiome Analysis en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Duman, Hatice/0000-0002-4526-6609
gdc.author.id B. Lopes, Marta/0000-0002-4135-1857
gdc.author.id Kalluci, Eglantina/0009-0001-9039-1310
gdc.author.id Simeon, Andrea/0000-0002-7096-7415
gdc.author.id Aasmets, Oliver/0009-0001-9872-6031
gdc.author.id Lahti, Leo/0000-0001-5537-637X
gdc.author.id May, Patrick/0000-0001-8698-3770
gdc.author.scopusid 55858205000
gdc.author.scopusid 58511676300
gdc.author.scopusid 25932029800
gdc.author.scopusid 6602134282
gdc.author.scopusid 55601026300
gdc.author.scopusid 57202114131
gdc.author.scopusid 24478354200
gdc.author.wosid Anagnostopoulos, Ioannis/Aal-6997-2020
gdc.author.wosid Gundogdu, Aycan/F-6465-2018
gdc.author.wosid B. Lopes, Marta/F-5378-2011
gdc.author.wosid De Santa Pau, Enrique/Aav-7544-2021
gdc.author.wosid Havulinna, Aki/Aam-4769-2021
gdc.author.wosid Karav, Sercan/T-8649-2018
gdc.author.wosid May, Patrick/N-1019-2019
gdc.bip.impulseclass C4
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Marcos-Zambrano, Laura Judith; Lopez-Molina, Victor Manuel; Lacruz-Pleguezuelos, Blanca; Carrillo de Santa Pau, Enrique] IMDEA Food Inst, Precis Nutr & Canc Res Program, Computat Biol Grp, Madrid, Spain; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkiye; [Frohme, Marcus; Nechyporenko, Alina; Lode, Daniel] Tech Univ Appl Sci Wildau, Div Mol Biotechnol & Funct Genom, Wildau, Germany; [Karaduzovic-Hadziabdic, Kanita] Int Univ Sarajevo, Fac Engn & Nat Sci, Sarajevo, Bosnia & Herceg; [Klammsteiner, Thomas] Univ Innsbruck, Dept Microbiol, Innsbruck, Austria; [Klammsteiner, Thomas] Univ Innsbruck, Dept Ecol, Innsbruck, Austria; [Ibrahimi, Eliana] Univ Tirana, Dept Biol, Tirana, Albania; [Lahti, Leo] Univ Turku, Dept Comp, Turku, Finland; [Loncar-Turukalo, Tatjana] Univ Novi Sad, Fac Tech Sci, Novi Sad, Serbia; [Dhamo, Xhilda; Kalluci, Eglantina] Univ Tirana, Fac Nat Sci, Dept Appl Math, Tirana, Albania; [Simeon, Andrea] Univ Novi Sad, BioSense Inst, Novi Sad, Serbia; [Nechyporenko, Alina] Kharkiv Natl Univ Radioelect, Dept Syst Engn, Kharkiv, Ukraine; [Pio, Gianvito; Ceci, Michelangelo] Univ Bari Aldo Moro, Dept Comp Sci, Bari, Italy; [Pio, Gianvito; Ceci, Michelangelo] Natl Interuniv Consortium Informat, Big Data Lab, Rome, Italy; [Przymus, Piotr] Nicolaus Copernicus Univ, Fac Math & Comp Sci, Torun, Poland; [Sampri, Alexia] Univ Cambridge, Victor Phillip Dahdaleh Heart & Lung Res Inst, Cambridge, England; [Trajkovik, Vladimir] Ss Cyril & Methodius Univ, Fac Comp Sci & Engn, Skopje, North Macedonia; [Aasmets, Oliver] Univ Tartu, Inst Genom, Estonian Genome Ctr, Tartu, Estonia; [Aasmets, Oliver] Univ Tartu, Inst Mol & Cell Biol, Dept Biotechnol, Tartu, Estonia; [Araujo, Ricardo] Univ Porto, Inst Invest & Inovacao Saude I3S, Nephrol & Infect Dis R&D Grp, Porto, Portugal; [Araujo, Ricardo] Univ Porto, INEB Inst Engn Biomed, Porto, Portugal; [Anagnostopoulos, Ioannis] Univ Piraeus, Dept Informat, Piraeus, Greece; [Anagnostopoulos, Ioannis] Univ Thessaly, Comp Sci & Biomed Informat Dept, Lamia, Greece; [Aydemir, Onder] Karadeniz Tech Univ, Dept Elect & Elect Engn, Trabzon, Turkiye; [Berland, Magali] Univ Paris Saclay, INRAE, MetaGenoPolis, Jouy En Josas, France; [Calle, M. Luz; Pujolassos, Meritxell] Cent Univ Catalonia, Univ Vic, Fac Sci Technol & Engn, Vic, Barcelona, Spain; [Calle, M. Luz] Fundacio Inst Recerca & Innovacio Ciencies Vida &, IRIS CC, Vic, Barcelona, Spain; [Duman, Hatice] Canakkale Onsekiz Mart Univ, Dept Mol Biol & Genet, Canakkale, Turkiye; [Gundogdu, Aycan] Erciyes Univ, Fac Med, Dept Microbiol & Clin Microbiol, Kayseri, Turkiye; [Gundogdu, Aycan] Erciyes Univ, Genome & Stem Cell Ctr GenKok, Metagen Lab, Kayseri, Turkiye; [Havulinna, Aki S.] Finnish Inst Hlth & Welf THL, Helsinki, Finland; [Havulinna, Aki S.] FIMM HiLIFE, Inst Mol Med Finland, Helsinki, Finland; [Kaka Bra, Kardokh Hama Najib; Truu, Jaak] Univ Tartu, Inst Mol & Cell Biol, Tartu, Estonia; [Karav, Sercan] Canakkale Onsekiz Mart Univ, Dept Mol Biol & Genet, Canakkale, Turkiye; [Lopes, Marta B.] NOVA Sch Sci & Technol, Ctr Math & Applicat NOVA Math, Dept Math, Caparica, Portugal; [Lopes, Marta B.] NOVA Sch Sci & Technol, Dept Mech & Ind Engn, UNIDEMI, Caparica, Portugal; [May, Patrick] Univ Luxembourg, Luxembourg Ctr Syst Biomed, Bioinformat Core, Esch Sur Alzette, Luxembourg; [Nap, Bram; Thiele, Ines] Univ Galway, Sch Med, Galway, Ireland; [Nedyalkova, Miroslava] Univ Sofia, Fac Chem & Pharm, Dept Inorgan Chem, Sofia, Bulgaria; [Paciencia, Ines] Univ Oulu, Res Unit Populat Hlth, Ctr Environm & Resp Hlth Res CERH, Oulu, Finland; [Paciencia, Ines] Univ Oulu, Bioctr Oulu, Oulu, Finland; [Pasic, Lejla] Univ Sarajevo, Sch Sci & Technol, Sarajevo Med Sch, Sarajevo, Bosnia & Herceg; [Shigdel, Rajesh] Univ Bergen, Dept Clin Sci, Bergen, Norway; [Susin, Antonio] UPC Barcelona Tech, Math Dept, Barcelona, Spain; [Thiele, Ines; Claesson, Marcus Joakim] Univ Coll Cork, APC Microbiome Ireland, Cork, Ireland; [Truica, Ciprian-Octavian] Natl Univ Sci & Technol Politehn, Fac Automat Control & Comp, Comp Sci & Engn Dept, Bucharest, Romania; [Wilmes, Paul] Luxembourg Ctr Syst Biomed, Syst Ecol Grp, Esch Sur Alzette, Luxembourg; [Wilmes, Paul] Univ Luxembourg, Fac Sci Technol & Med, Dept Life Sci & Med, Belvaux, Luxembourg; [Yilmaz, Ercument] Karadeniz Tech Univ, Dept Comp Technol, Trabzon, Turkiye; [Yousef, Malik] Zefat Acad Coll, Dept Informat Syst, Safed, Israel; [Yousef, Malik] Zefat Acad Coll, Galilee Digital Hlth Res Ctr GDH, Safed, Israel; [Claesson, Marcus Joakim] Univ Coll Cork, Sch Microbiol, Cork, Ireland en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q1
gdc.description.volume 14 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q1
gdc.identifier.openalex W4388946655
gdc.identifier.pmid 38075858
gdc.identifier.wos WOS:001116447600001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.downloads 51
gdc.oaire.impulse 22.0
gdc.oaire.influence 2.9821714E-9
gdc.oaire.isgreen true
gdc.oaire.keywords 570
gdc.oaire.keywords ddc:006
gdc.oaire.keywords Feature generation
gdc.oaire.keywords Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial
gdc.oaire.keywords 610
gdc.oaire.keywords microbiome
gdc.oaire.keywords Microbiology
gdc.oaire.keywords Microbial metabolic modeling
gdc.oaire.keywords microbial metabolic modeling
gdc.oaire.keywords Machine learning
gdc.oaire.keywords Aprenentatge automàtic
gdc.oaire.keywords microbial gene prediction
gdc.oaire.keywords data integration
gdc.oaire.keywords Feature analysis
gdc.oaire.keywords 11832 Microbiology and virology
gdc.oaire.keywords feature analysis
gdc.oaire.keywords software
gdc.oaire.keywords feature generation
gdc.oaire.keywords Microbiologia mèdica
gdc.oaire.keywords 006
gdc.oaire.keywords Medical microbiology
gdc.oaire.keywords QR1-502
gdc.oaire.keywords 004
gdc.oaire.keywords machine learning
gdc.oaire.keywords Microbial gene prediction
gdc.oaire.keywords feature analysi
gdc.oaire.keywords Data integration
gdc.oaire.keywords ddc:570
gdc.oaire.keywords Microbiome
gdc.oaire.keywords Software
gdc.oaire.popularity 1.7757886E-8
gdc.oaire.publicfunded true
gdc.oaire.views 138
gdc.openalex.collaboration International
gdc.openalex.fwci 5.0793
gdc.openalex.normalizedpercentile 0.96
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 16
gdc.plumx.crossrefcites 3
gdc.plumx.facebookshareslikecount 61
gdc.plumx.mendeley 80
gdc.plumx.pubmedcites 14
gdc.plumx.scopuscites 27
gdc.scopus.citedcount 27
gdc.virtual.author Güngör, Burcu
gdc.wos.citedcount 19
relation.isAuthorOfPublication e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isAuthorOfPublication.latestForDiscovery e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
fmicb-14-1250806.pdf
Size:
2.04 MB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: