Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods

dc.contributor.author Bakır Güngör, Burcu
dc.contributor.author Hacılar, Hilal
dc.contributor.author Jabeer, Amhar
dc.contributor.author Nalbantoğlu, Özkan Ufuk
dc.contributor.author Aran, Oya
dc.contributor.author Yousef, Malik
dc.contributor.department AGÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü en_US
dc.contributor.institutionauthor Bakır Güngör, Burcu
dc.date.accessioned 2022-07-01T08:44:22Z
dc.date.available 2022-07-01T08:44:22Z
dc.date.issued 2022 en_US
dc.description.abstract The tremendous boost in next generation sequencing and in the “omics” technologies makes it possible to characterize the human gut microbiome—the collective genomes of the microbial community that reside in our gastrointestinal tract. Although some of these microorganisms are considered to be essential regulators of our immune system, the alteration of the complexity and eubiotic state of microbiota might promote autoimmune and inflammatory disorders such as diabetes, rheumatoid arthritis, Inflammatory bowel diseases (IBD), obesity, and carcinogenesis. IBD, comprising Crohn’s disease and ulcerative colitis, is a gut-related, multifactorial disease with an unknown etiology. IBD presents defects in the detection and control of the gut microbiota, associated with unbalanced immune reactions, genetic mutations that confer susceptibility to the disease, and complex environmental conditions such as westernized lifestyle. Although some existing studies attempt to unveil the composition and functional capacity of the gut microbiome in relation to IBD diseases, a comprehensive picture of the gut microbiome in IBD patients is far from being complete. Due to the complexity of metagenomic studies, the applications of the state-of-the-art machine learning techniques became popular to address a wide range of questions in the field of metagenomic data analysis. In this regard, using IBD associated metagenomics dataset, this study utilizes both supervised and unsupervised machine learning algorithms, (i) to generate a classification model that aids IBD diagnosis, (ii) to discover IBD-associated biomarkers, (iii) to discover subgroups of IBD patients using k-means and hierarchical clustering approaches. To deal with the high dimensionality of features, we applied robust feature selection algorithms such as Conditional Mutual Information Maximization (CMIM), Fast Correlation Based Filter (FCBF), min redundancy max relevance (mRMR), Select K Best (SKB), Information Gain (IG) and Extreme Gradient Boosting (XGBoost). In our experiments with 100-fold Monte Carlo cross-validation (MCCV), XGBoost, IG, and SKB methods showed a considerable effect in terms of minimizing the microbiota used for the diagnosis of IBD and thus reducing the cost and time. We observed that compared to Decision Tree, Support Vector Machine, Logitboost, Adaboost, and stacking ensemble classifiers, our Random Forest classifier resulted in better performance measures for the classification of IBD. Our findings revealed potential microbiome-mediated mechanisms of IBD and these findings might be useful for the development of microbiome-based diagnostics. en_US
dc.description.sponsorship Abdullah Gul University Support Foundation (AGUV) en_US
dc.identifier.endpage 38 en_US
dc.identifier.issn 2167-8359
dc.identifier.other WOS:000792143800005
dc.identifier.startpage 1 en_US
dc.identifier.uri https://doi.org/10.7717/peerj.13205
dc.identifier.uri https://hdl.handle.net/20.500.12573/1308
dc.identifier.volume 10 en_US
dc.language.iso eng en_US
dc.publisher PEERJ INC en_US
dc.relation.isversionof 10.7717/peerj.13205 en_US
dc.relation.journal PeerJ en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Feature selection en_US
dc.subject Human gut microbiome en_US
dc.subject Biomarker discovery en_US
dc.subject Classification en_US
dc.subject Metagenomics en_US
dc.title Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods en_US
dc.type article en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Inflammatory bowel disease biomarkers of human gut microbiota selected via different feature selection methods.pdf
Size:
2.85 MB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: