Integrating Gene Ontology Based Grouping and Ranking into the Machine Learning Algorithm for Gene Expression Data Analysis

dc.contributor.author Yousef, Malik
dc.contributor.author Sayici, Ahmet
dc.contributor.author Bakir-Gungor, Burcu
dc.contributor.department AGÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü en_US
dc.contributor.institutionauthor Sayici, Ahmet
dc.contributor.institutionauthor Bakir-Gungor, Burcu
dc.date.accessioned 2022-02-03T09:05:03Z
dc.date.available 2022-02-03T09:05:03Z
dc.date.issued 2021 en_US
dc.description.abstract Recent advances in the high throughput technologies resulted in the production of large gene expression data sets for several phenotypes. Via comparing the gene expression levels under different conditions, such as disease vs. control, treated vs. not treated, drug A vs. drug B, etc., one could identify biomarkers. As opposed to traditional gene selection approaches, integrative gene selection approaches incorporate domain knowledge from external biological resources during gene selection, which improves interpretability and predictive performance. In this respect, Gene Ontology provides cellular component, molecular function and biological process terms for the products of each gene. In this study, we present Gene Ontology based feature selection approach for gene expression data analysis. In our approach, we used the ontology information as grouping (term) information and embedded this information into a machine learning algorithm for selecting the most significant groups (terms) of ontology. Those groups are used to build the machine learning model in order to perform the classification task. The output of the tool is a significant ontology group for the task of 2-class classification applied on the gene expression data. This knowledge allows the researcher to perform more advanced gene expression analyses. We tested our approach on 8 different gene expression datasets. In our experiments, we observed that the tool successfully found the significant Ontology terms that would be used as a classification model. We believe that our tool will help the geneticists to identify affected genes in transcriptomic data and this information could enable the design of platforms to assist diagnosis, to assess patients' prognoses, and to create patient treatment plans. en_US
dc.description.sponsorship SponsorsSoftware Competence Ctr Hagenberg; JKU Inst Telecooperat; iiwas en_US
dc.identifier.issn 1865-0929
dc.identifier.issn 1865-0937
dc.identifier.uri https //doi.org/10.1007/978-3-030-87101-7_20
dc.identifier.uri https://hdl.handle.net/20.500.12573/1116
dc.identifier.volume Volume 1479 Page 205-214 en_US
dc.language.iso eng en_US
dc.publisher SPRINGER INTERNATIONAL PUBLISHING AGGEWERBESTRASSE 11, CHAM, CH-6330, SWITZERLAND en_US
dc.relation.isversionof 10.1007/978-3-030-87101-7_20 en_US
dc.relation.journal DATABASE AND EXPERT SYSTEMS APPLICATIONS - DEXA 2021 WORKSHOPS en_US
dc.relation.publicationcategory Kitap Bölümü - Uluslararası en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.title Integrating Gene Ontology Based Grouping and Ranking into the Machine Learning Algorithm for Gene Expression Data Analysis en_US
dc.title.alternative Communications in Computer and Information Science en_US
dc.type bookPart en_US

Files

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: