A Toolbox of Machine Learning Software to Support Microbiome Analysis
| dc.contributor.author | Marcos-Zambrano, Laura Judith | |
| dc.contributor.author | Lopez-Molina, Victor Manuel | |
| dc.contributor.author | Bakir-Gungor, Burcu | |
| dc.contributor.author | Frohme, Marcus | |
| dc.contributor.author | Karaduzovic-Hadziabdic, Kanita | |
| dc.contributor.author | Klammsteiner, Thomas | |
| dc.contributor.author | Carrillo de Santa Pau, Enrique | |
| dc.date.accessioned | 2025-09-25T10:39:44Z | |
| dc.date.available | 2025-09-25T10:39:44Z | |
| dc.date.issued | 2023 | |
| dc.description | Duman, Hatice/0000-0002-4526-6609; B. Lopes, Marta/0000-0002-4135-1857; Kalluci, Eglantina/0009-0001-9039-1310; Simeon, Andrea/0000-0002-7096-7415; Aasmets, Oliver/0009-0001-9872-6031; Lahti, Leo/0000-0001-5537-637X; Dhamo, Xhilda/0000-0002-5157-7075; Araujo, Ricardo/0000-0001-7382-4834; Yilmaz, Ercument/0000-0002-3712-7086; Ibrahimi, Eliana/0000-0003-0956-215X; Lopez Molina, Victor Manuel/0009-0003-8126-8550; Karav, Sercan/0000-0003-4056-1673; Carrillo De Santa Pau, Enrique/0000-0002-2310-2267; Nap, Bram/0000-0003-2910-9109; Pujolassos, Meritxell/0000-0003-0313-3506; May, Patrick/0000-0001-8698-3770 | en_US |
| dc.description.abstract | The human microbiome has become an area of intense research due to its potential impact on human health. However, the analysis and interpretation of this data have proven to be challenging due to its complexity and high dimensionality. Machine learning (ML) algorithms can process vast amounts of data to uncover informative patterns and relationships within the data, even with limited prior knowledge. Therefore, there has been a rapid growth in the development of software specifically designed for the analysis and interpretation of microbiome data using ML techniques. These software incorporate a wide range of ML algorithms for clustering, classification, regression, or feature selection, to identify microbial patterns and relationships within the data and generate predictive models. This rapid development with a constant need for new developments and integration of new features require efforts into compile, catalog and classify these tools to create infrastructures and services with easy, transparent, and trustable standards. Here we review the state-of-the-art for ML tools applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on ML based software and framework resources currently available for the analysis of microbiome data in humans. The aim is to support microbiologists and biomedical scientists to go deeper into specialized resources that integrate ML techniques and facilitate future benchmarking to create standards for the analysis of microbiome data. The software resources are organized based on the type of analysis they were developed for and the ML techniques they implement. A description of each software with examples of usage is provided including comments about pitfalls and lacks in the usage of software based on ML methods in relation to microbiome data that need to be considered by developers and users. This review represents an extensive compilation to date, offering valuable insights and guidance for researchers interested in leveraging ML approaches for microbiome analysis. | en_US |
| dc.description.sponsorship | COST Action [CA18131]; COST (European Cooperation in Science and Technology) | en_US |
| dc.description.sponsorship | This article is based upon work from COST Action ML4Microbiome "Statistical and machine learning techniques in human microbiome studies," CA18131, supported by COST (European Cooperation in Science and Technology), www.cost.eu. | en_US |
| dc.identifier.doi | 10.3389/fmicb.2023.1250806 | |
| dc.identifier.issn | 1664-302X | |
| dc.identifier.scopus | 2-s2.0-85178948402 | |
| dc.identifier.uri | https://doi.org/10.3389/fmicb.2023.1250806 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12573/3169 | |
| dc.language.iso | en | en_US |
| dc.publisher | Frontiers Media S.A. | en_US |
| dc.relation.ispartof | Frontiers in Microbiology | en_US |
| dc.rights | info:eu-repo/semantics/openAccess | en_US |
| dc.subject | Microbiome | en_US |
| dc.subject | Machine Learning | en_US |
| dc.subject | Software | en_US |
| dc.subject | Feature Generation | en_US |
| dc.subject | Feature Analysis | en_US |
| dc.subject | Data Integration | en_US |
| dc.subject | Microbial Gene Prediction | en_US |
| dc.subject | Microbial Metabolic Modeling | en_US |
| dc.title | A Toolbox of Machine Learning Software to Support Microbiome Analysis | en_US |
| dc.type | Article | en_US |
| dspace.entity.type | Publication | |
| gdc.author.id | Duman, Hatice/0000-0002-4526-6609 | |
| gdc.author.id | B. Lopes, Marta/0000-0002-4135-1857 | |
| gdc.author.id | Kalluci, Eglantina/0009-0001-9039-1310 | |
| gdc.author.id | Simeon, Andrea/0000-0002-7096-7415 | |
| gdc.author.id | Aasmets, Oliver/0009-0001-9872-6031 | |
| gdc.author.id | Lahti, Leo/0000-0001-5537-637X | |
| gdc.author.id | May, Patrick/0000-0001-8698-3770 | |
| gdc.author.scopusid | 55858205000 | |
| gdc.author.scopusid | 58511676300 | |
| gdc.author.scopusid | 25932029800 | |
| gdc.author.scopusid | 6602134282 | |
| gdc.author.scopusid | 55601026300 | |
| gdc.author.scopusid | 57202114131 | |
| gdc.author.scopusid | 24478354200 | |
| gdc.author.wosid | Anagnostopoulos, Ioannis/Aal-6997-2020 | |
| gdc.author.wosid | Gundogdu, Aycan/F-6465-2018 | |
| gdc.author.wosid | B. Lopes, Marta/F-5378-2011 | |
| gdc.author.wosid | De Santa Pau, Enrique/Aav-7544-2021 | |
| gdc.author.wosid | Havulinna, Aki/Aam-4769-2021 | |
| gdc.author.wosid | Karav, Sercan/T-8649-2018 | |
| gdc.author.wosid | May, Patrick/N-1019-2019 | |
| gdc.bip.impulseclass | C4 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C4 | |
| gdc.coar.access | open access | |
| gdc.coar.type | text::journal::journal article | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | Abdullah Gül University | en_US |
| gdc.description.departmenttemp | [Marcos-Zambrano, Laura Judith; Lopez-Molina, Victor Manuel; Lacruz-Pleguezuelos, Blanca; Carrillo de Santa Pau, Enrique] IMDEA Food Inst, Precis Nutr & Canc Res Program, Computat Biol Grp, Madrid, Spain; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkiye; [Frohme, Marcus; Nechyporenko, Alina; Lode, Daniel] Tech Univ Appl Sci Wildau, Div Mol Biotechnol & Funct Genom, Wildau, Germany; [Karaduzovic-Hadziabdic, Kanita] Int Univ Sarajevo, Fac Engn & Nat Sci, Sarajevo, Bosnia & Herceg; [Klammsteiner, Thomas] Univ Innsbruck, Dept Microbiol, Innsbruck, Austria; [Klammsteiner, Thomas] Univ Innsbruck, Dept Ecol, Innsbruck, Austria; [Ibrahimi, Eliana] Univ Tirana, Dept Biol, Tirana, Albania; [Lahti, Leo] Univ Turku, Dept Comp, Turku, Finland; [Loncar-Turukalo, Tatjana] Univ Novi Sad, Fac Tech Sci, Novi Sad, Serbia; [Dhamo, Xhilda; Kalluci, Eglantina] Univ Tirana, Fac Nat Sci, Dept Appl Math, Tirana, Albania; [Simeon, Andrea] Univ Novi Sad, BioSense Inst, Novi Sad, Serbia; [Nechyporenko, Alina] Kharkiv Natl Univ Radioelect, Dept Syst Engn, Kharkiv, Ukraine; [Pio, Gianvito; Ceci, Michelangelo] Univ Bari Aldo Moro, Dept Comp Sci, Bari, Italy; [Pio, Gianvito; Ceci, Michelangelo] Natl Interuniv Consortium Informat, Big Data Lab, Rome, Italy; [Przymus, Piotr] Nicolaus Copernicus Univ, Fac Math & Comp Sci, Torun, Poland; [Sampri, Alexia] Univ Cambridge, Victor Phillip Dahdaleh Heart & Lung Res Inst, Cambridge, England; [Trajkovik, Vladimir] Ss Cyril & Methodius Univ, Fac Comp Sci & Engn, Skopje, North Macedonia; [Aasmets, Oliver] Univ Tartu, Inst Genom, Estonian Genome Ctr, Tartu, Estonia; [Aasmets, Oliver] Univ Tartu, Inst Mol & Cell Biol, Dept Biotechnol, Tartu, Estonia; [Araujo, Ricardo] Univ Porto, Inst Invest & Inovacao Saude I3S, Nephrol & Infect Dis R&D Grp, Porto, Portugal; [Araujo, Ricardo] Univ Porto, INEB Inst Engn Biomed, Porto, Portugal; [Anagnostopoulos, Ioannis] Univ Piraeus, Dept Informat, Piraeus, Greece; [Anagnostopoulos, Ioannis] Univ Thessaly, Comp Sci & Biomed Informat Dept, Lamia, Greece; [Aydemir, Onder] Karadeniz Tech Univ, Dept Elect & Elect Engn, Trabzon, Turkiye; [Berland, Magali] Univ Paris Saclay, INRAE, MetaGenoPolis, Jouy En Josas, France; [Calle, M. Luz; Pujolassos, Meritxell] Cent Univ Catalonia, Univ Vic, Fac Sci Technol & Engn, Vic, Barcelona, Spain; [Calle, M. Luz] Fundacio Inst Recerca & Innovacio Ciencies Vida &, IRIS CC, Vic, Barcelona, Spain; [Duman, Hatice] Canakkale Onsekiz Mart Univ, Dept Mol Biol & Genet, Canakkale, Turkiye; [Gundogdu, Aycan] Erciyes Univ, Fac Med, Dept Microbiol & Clin Microbiol, Kayseri, Turkiye; [Gundogdu, Aycan] Erciyes Univ, Genome & Stem Cell Ctr GenKok, Metagen Lab, Kayseri, Turkiye; [Havulinna, Aki S.] Finnish Inst Hlth & Welf THL, Helsinki, Finland; [Havulinna, Aki S.] FIMM HiLIFE, Inst Mol Med Finland, Helsinki, Finland; [Kaka Bra, Kardokh Hama Najib; Truu, Jaak] Univ Tartu, Inst Mol & Cell Biol, Tartu, Estonia; [Karav, Sercan] Canakkale Onsekiz Mart Univ, Dept Mol Biol & Genet, Canakkale, Turkiye; [Lopes, Marta B.] NOVA Sch Sci & Technol, Ctr Math & Applicat NOVA Math, Dept Math, Caparica, Portugal; [Lopes, Marta B.] NOVA Sch Sci & Technol, Dept Mech & Ind Engn, UNIDEMI, Caparica, Portugal; [May, Patrick] Univ Luxembourg, Luxembourg Ctr Syst Biomed, Bioinformat Core, Esch Sur Alzette, Luxembourg; [Nap, Bram; Thiele, Ines] Univ Galway, Sch Med, Galway, Ireland; [Nedyalkova, Miroslava] Univ Sofia, Fac Chem & Pharm, Dept Inorgan Chem, Sofia, Bulgaria; [Paciencia, Ines] Univ Oulu, Res Unit Populat Hlth, Ctr Environm & Resp Hlth Res CERH, Oulu, Finland; [Paciencia, Ines] Univ Oulu, Bioctr Oulu, Oulu, Finland; [Pasic, Lejla] Univ Sarajevo, Sch Sci & Technol, Sarajevo Med Sch, Sarajevo, Bosnia & Herceg; [Shigdel, Rajesh] Univ Bergen, Dept Clin Sci, Bergen, Norway; [Susin, Antonio] UPC Barcelona Tech, Math Dept, Barcelona, Spain; [Thiele, Ines; Claesson, Marcus Joakim] Univ Coll Cork, APC Microbiome Ireland, Cork, Ireland; [Truica, Ciprian-Octavian] Natl Univ Sci & Technol Politehn, Fac Automat Control & Comp, Comp Sci & Engn Dept, Bucharest, Romania; [Wilmes, Paul] Luxembourg Ctr Syst Biomed, Syst Ecol Grp, Esch Sur Alzette, Luxembourg; [Wilmes, Paul] Univ Luxembourg, Fac Sci Technol & Med, Dept Life Sci & Med, Belvaux, Luxembourg; [Yilmaz, Ercument] Karadeniz Tech Univ, Dept Comp Technol, Trabzon, Turkiye; [Yousef, Malik] Zefat Acad Coll, Dept Informat Syst, Safed, Israel; [Yousef, Malik] Zefat Acad Coll, Galilee Digital Hlth Res Ctr GDH, Safed, Israel; [Claesson, Marcus Joakim] Univ Coll Cork, Sch Microbiol, Cork, Ireland | en_US |
| gdc.description.publicationcategory | Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | Q1 | |
| gdc.description.volume | 14 | en_US |
| gdc.description.woscitationindex | Science Citation Index Expanded | |
| gdc.description.wosquality | Q1 | |
| gdc.identifier.openalex | W4388946655 | |
| gdc.identifier.pmid | 38075858 | |
| gdc.identifier.wos | WOS:001116447600001 | |
| gdc.index.type | WoS | |
| gdc.index.type | Scopus | |
| gdc.index.type | PubMed | |
| gdc.oaire.accesstype | GOLD | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.downloads | 51 | |
| gdc.oaire.impulse | 22.0 | |
| gdc.oaire.influence | 2.9821714E-9 | |
| gdc.oaire.isgreen | true | |
| gdc.oaire.keywords | 570 | |
| gdc.oaire.keywords | ddc:006 | |
| gdc.oaire.keywords | Feature generation | |
| gdc.oaire.keywords | Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial | |
| gdc.oaire.keywords | 610 | |
| gdc.oaire.keywords | microbiome | |
| gdc.oaire.keywords | Microbiology | |
| gdc.oaire.keywords | Microbial metabolic modeling | |
| gdc.oaire.keywords | microbial metabolic modeling | |
| gdc.oaire.keywords | Machine learning | |
| gdc.oaire.keywords | Aprenentatge automàtic | |
| gdc.oaire.keywords | microbial gene prediction | |
| gdc.oaire.keywords | data integration | |
| gdc.oaire.keywords | Feature analysis | |
| gdc.oaire.keywords | 11832 Microbiology and virology | |
| gdc.oaire.keywords | feature analysis | |
| gdc.oaire.keywords | software | |
| gdc.oaire.keywords | feature generation | |
| gdc.oaire.keywords | Microbiologia mèdica | |
| gdc.oaire.keywords | 006 | |
| gdc.oaire.keywords | Medical microbiology | |
| gdc.oaire.keywords | QR1-502 | |
| gdc.oaire.keywords | 004 | |
| gdc.oaire.keywords | machine learning | |
| gdc.oaire.keywords | Microbial gene prediction | |
| gdc.oaire.keywords | feature analysi | |
| gdc.oaire.keywords | Data integration | |
| gdc.oaire.keywords | ddc:570 | |
| gdc.oaire.keywords | Microbiome | |
| gdc.oaire.keywords | Software | |
| gdc.oaire.popularity | 1.7757886E-8 | |
| gdc.oaire.publicfunded | true | |
| gdc.oaire.views | 138 | |
| gdc.openalex.collaboration | International | |
| gdc.openalex.fwci | 5.0793 | |
| gdc.openalex.normalizedpercentile | 0.96 | |
| gdc.openalex.toppercent | TOP 10% | |
| gdc.opencitations.count | 16 | |
| gdc.plumx.crossrefcites | 3 | |
| gdc.plumx.facebookshareslikecount | 61 | |
| gdc.plumx.mendeley | 80 | |
| gdc.plumx.pubmedcites | 14 | |
| gdc.plumx.scopuscites | 27 | |
| gdc.scopus.citedcount | 27 | |
| gdc.virtual.author | Güngör, Burcu | |
| gdc.wos.citedcount | 19 | |
| relation.isAuthorOfPublication | e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0 | |
| relation.isAuthorOfPublication.latestForDiscovery | e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0 | |
| relation.isOrgUnitOfPublication | 665d3039-05f8-4a25-9a3c-b9550bffecef | |
| relation.isOrgUnitOfPublication | 52f507ab-f278-4a1f-824c-44da2a86bd51 | |
| relation.isOrgUnitOfPublication | ef13a800-4c99-4124-81e0-3e25b33c0c2b | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 665d3039-05f8-4a25-9a3c-b9550bffecef |
