A Toolbox of Machine Learning Software to Support Microbiome Analysis

Loading...
Publication Logo

Date

2023

Journal Title

Journal ISSN

Volume Title

Publisher

Frontiers Media S.A.

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

51

OpenAIRE Views

138

Publicly Funded

Yes
Impulse
Top 10%
Influence
Average
Popularity
Top 10%

Research Projects

Journal Issue

Abstract

The human microbiome has become an area of intense research due to its potential impact on human health. However, the analysis and interpretation of this data have proven to be challenging due to its complexity and high dimensionality. Machine learning (ML) algorithms can process vast amounts of data to uncover informative patterns and relationships within the data, even with limited prior knowledge. Therefore, there has been a rapid growth in the development of software specifically designed for the analysis and interpretation of microbiome data using ML techniques. These software incorporate a wide range of ML algorithms for clustering, classification, regression, or feature selection, to identify microbial patterns and relationships within the data and generate predictive models. This rapid development with a constant need for new developments and integration of new features require efforts into compile, catalog and classify these tools to create infrastructures and services with easy, transparent, and trustable standards. Here we review the state-of-the-art for ML tools applied in human microbiome studies, performed as part of the COST Action ML4Microbiome activities. This scoping review focuses on ML based software and framework resources currently available for the analysis of microbiome data in humans. The aim is to support microbiologists and biomedical scientists to go deeper into specialized resources that integrate ML techniques and facilitate future benchmarking to create standards for the analysis of microbiome data. The software resources are organized based on the type of analysis they were developed for and the ML techniques they implement. A description of each software with examples of usage is provided including comments about pitfalls and lacks in the usage of software based on ML methods in relation to microbiome data that need to be considered by developers and users. This review represents an extensive compilation to date, offering valuable insights and guidance for researchers interested in leveraging ML approaches for microbiome analysis.

Description

Duman, Hatice/0000-0002-4526-6609; B. Lopes, Marta/0000-0002-4135-1857; Kalluci, Eglantina/0009-0001-9039-1310; Simeon, Andrea/0000-0002-7096-7415; Aasmets, Oliver/0009-0001-9872-6031; Lahti, Leo/0000-0001-5537-637X; Dhamo, Xhilda/0000-0002-5157-7075; Araujo, Ricardo/0000-0001-7382-4834; Yilmaz, Ercument/0000-0002-3712-7086; Ibrahimi, Eliana/0000-0003-0956-215X; Lopez Molina, Victor Manuel/0009-0003-8126-8550; Karav, Sercan/0000-0003-4056-1673; Carrillo De Santa Pau, Enrique/0000-0002-2310-2267; Nap, Bram/0000-0003-2910-9109; Pujolassos, Meritxell/0000-0003-0313-3506; May, Patrick/0000-0001-8698-3770

Keywords

Microbiome, Machine Learning, Software, Feature Generation, Feature Analysis, Data Integration, Microbial Gene Prediction, Microbial Metabolic Modeling, 570, ddc:006, Feature generation, Àrees temàtiques de la UPC::Informàtica::Intel·ligència artificial, 610, microbiome, Microbiology, Microbial metabolic modeling, microbial metabolic modeling, Machine learning, Aprenentatge automàtic, microbial gene prediction, data integration, Feature analysis, 11832 Microbiology and virology, feature analysis, software, feature generation, Microbiologia mèdica, 006, Medical microbiology, QR1-502, 004, machine learning, Microbial gene prediction, feature analysi, Data integration, ddc:570, Microbiome, Software

Fields of Science

Citation

WoS Q

Q1

Scopus Q

Q1
OpenCitations Logo
OpenCitations Citation Count
16

Source

Frontiers in Microbiology

Volume

14

Issue

Start Page

End Page

PlumX Metrics
Citations

CrossRef : 3

Scopus : 27

PubMed : 14

Captures

Mendeley Readers : 80

SCOPUS™ Citations

27

checked on Mar 04, 2026

Web of Science™ Citations

19

checked on Mar 04, 2026

Page Views

1

checked on Mar 04, 2026

Downloads

2

checked on Mar 04, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
5.0793
Altmetrics Badge

Sustainable Development Goals

SDG data is not available