Identifying Taxonomic Biomarkers of Colorectal Cancer in Human Intestinal Microbiota Using Multiple Feature Selection Methods

dc.contributor.author Jabeer, Amhar
dc.contributor.author Kocak, Aysegul
dc.contributor.author Akkas, Huseyin
dc.contributor.author Yenisert, Ferhan
dc.contributor.author Nalbantoglu, Ozkan Ufuk
dc.contributor.author Yousef, Malik
dc.contributor.author Bakir Gungor, Burcu
dc.contributor.authorID 0000-0002-6367-7823 en_US
dc.contributor.authorID 0000-0002-2272-6270 en_US
dc.contributor.department AGÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü en_US
dc.contributor.institutionauthor Jabeer, Amhar
dc.contributor.institutionauthor Kocak, Aysegul
dc.contributor.institutionauthor Akkas, Huseyin
dc.contributor.institutionauthor Yenisert, Ferhan
dc.contributor.institutionauthor Bakir Gungor, Burcu
dc.date.accessioned 2024-05-22T12:28:43Z
dc.date.available 2024-05-22T12:28:43Z
dc.date.issued 2022 en_US
dc.description.abstract A variety of bacterial species called gut microbiota work together to maintain a steady intestinal environment. The gastrointestinal tract contains tremendous amount of different species including archaea, bacteria, fungi, and viruses. While these organisms are crucial immune system stabilizers, the dysbiosis of the intestinal flora has been related to gastrointestinal disorders including Colorectal cancer (CRC), intestinal cancer, irritable bowel syndrome and inflammatory bowel disease. In the last decade, next-generation sequencing (NGS) methods have accelerated the identification of human gut flora. CRC is a deathly condition that has been on the rise in the last century, affecting half a million people each year. Since early CRC diagnosis is critical for an effective treatment, there is an immediate requirement for a classification system that can expedite CRC diagnosis. In this study, via analyzing the available metagenomics data on CRC, we aim to facilitate the CRC diagnosis via finding biomarkers linked with CRC, and via building a classification model. We have obtained the metagenomic sequencing data of the healthy individuals and CRC patients from a metagenome-wide association analysis and we have classified this data according to the disease stages. Conditional Mutual Information Maximization (CMIM), Fast Correlation Based Filter (FCBF), Extreme Gradient Boosting (XGBoost), min redundancy max relevance (mRMR), Information Gain (IG) and Select K Best (SKB) feature selection algorithms were utilized to cope with the complexity of the features. We observed that the SKB, IG, and XGBoost techniques made significant contributions to decrease the microbiota in use for CRC diagnosis, thereby reducing cost and time. We realized that our Random Forest classifier outperformed Adaboost, Support Vector Machine, Decision Tree, Logitboost and stacking ensemble classifiers in terms of CRC classification performance. Our results reiterated some known and some potential microbiome associated mechanisms in CRC, which could aid the design of new diagnostics based on the microbiome. en_US
dc.identifier.endpage 6 en_US
dc.identifier.isbn 978-166548894-5
dc.identifier.startpage 1 en_US
dc.identifier.uri https://doi.org/10.1109/ASYU56188.2022.9925551
dc.identifier.uri https://hdl.handle.net/20.500.12573/2140
dc.language.iso eng en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.isversionof 10.1109/ASYU56188.2022.9925551 en_US
dc.relation.journal Proceedings - 2022 Innovations in Intelligent Systems and Applications Conference, ASYU 2022 en_US
dc.relation.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Feature selection en_US
dc.subject Metagenomics en_US
dc.subject Human gut microbiome en_US
dc.subject Classification en_US
dc.subject Biomarker discovery en_US
dc.title Identifying Taxonomic Biomarkers of Colorectal Cancer in Human Intestinal Microbiota Using Multiple Feature Selection Methods en_US
dc.type conferenceObject en_US

Files

Original bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
Identifying_Taxonomic_Biomarkers_of_Colorectal_Cancer_in_Human_Intestinal_Microbiota_Using_Multiple_Feature_Selection_Methods.pdf
Size:
4.09 MB
Format:
Adobe Portable Document Format
Description:
Konferans Ögesi

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: