GeNetOntology: Identifying Affected Gene Ontology Terms via Grouping, Scoring, and Modeling of Gene Expression Data Utilizing Biological Knowledge-Based Machine Learning

dc.contributor.author Ersoz, Nur Sebnem
dc.contributor.author Bakir-Gungor, Burcu
dc.contributor.author Yousef, Malik
dc.date.accessioned 2025-09-25T10:47:42Z
dc.date.available 2025-09-25T10:47:42Z
dc.date.issued 2023
dc.description.abstract Introduction: Identifying significant sets of genes that are up/downregulated under specific conditions is vital to understand disease development mechanisms at the molecular level. Along this line, in order to analyze transcriptomic data, several computational feature selection (i.e., gene selection) methods have been proposed. On the other hand, uncovering the core functions of the selected genes provides a deep understanding of diseases. In order to address this problem, biological domain knowledge-based feature selection methods have been proposed. Unlike computational gene selection approaches, these domain knowledge-based methods take the underlying biology into account and integrate knowledge from external biological resources. Gene Ontology (GO) is one such biological resource that provides ontology terms for defining the molecular function, cellular component, and biological process of the gene product.Methods: In this study, we developed a tool named GeNetOntology which performs GO-based feature selection for gene expression data analysis. In the proposed approach, the process of Grouping, Scoring, and Modeling (G-S-M) is used to identify significant GO terms. GO information has been used as the grouping information, which has been embedded into a machine learning (ML) algorithm to select informative ontology terms. The genes annotated with the selected ontology terms have been used in the training part to carry out the classification task of the ML model. The output is an important set of ontologies for the two-class classification task applied to gene expression data for a given phenotype.Results: Our approach has been tested on 11 different gene expression datasets, and the results showed that GeNetOntology successfully identified important disease-related ontology terms to be used in the classification model.Discussion: GeNetOntology will assist geneticists and scientists to identify a range of disease-related genes and ontologies in transcriptomic data analysis, and it will also help doctors design diagnosis platforms and improve patient treatment plans. en_US
dc.description.sponsorship The work of MY has been supported by the Zefat Academic College. The work of BB-G has been supported by the Abdullah Gul University Support Foundation (AGUV).; Zefat Academic College; Abdullah Gul University Support Foundation (AGUV) en_US
dc.description.sponsorship The work of MY has been supported by the Zefat Academic College. The work of BB-G has been supported by the Abdullah Gul University Support Foundation (AGUV). en_US
dc.description.sponsorship Abdullah Gul University Support Foundation; Zefat Academic College
dc.identifier.doi 10.3389/fgene.2023.1139082
dc.identifier.issn 1664-8021
dc.identifier.scopus 2-s2.0-85169677317
dc.identifier.uri https://doi.org/10.3389/fgene.2023.1139082
dc.identifier.uri https://hdl.handle.net/20.500.12573/3887
dc.language.iso en en_US
dc.publisher Frontiers Media S.A. en_US
dc.relation.ispartof Frontiers in Genetics en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Gene Ontology en_US
dc.subject Gene Expression Data Analysis en_US
dc.subject Machine Learning en_US
dc.subject Feature Selection en_US
dc.subject Enrichment Analysis en_US
dc.subject Feature Scoring en_US
dc.subject Feature Grouping en_US
dc.subject Classification en_US
dc.title GeNetOntology: Identifying Affected Gene Ontology Terms via Grouping, Scoring, and Modeling of Gene Expression Data Utilizing Biological Knowledge-Based Machine Learning en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.scopusid 57423006700
gdc.author.scopusid 25932029800
gdc.author.scopusid 14029389000
gdc.bip.impulseclass C4
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Ersoz, Nur Sebnem] Abdullah Gul Univ, Grad Sch Engn & Sci, Dept Bioengn, Kayseri, Turkiye; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Fac Engn, Dept Comp Engn, Kayseri, Turkiye; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Fac Life & Nat Sci, Dept Bioengn, Kayseri, Turkiye; [Yousef, Malik] Zefat Acad Coll, Dept Informat Syst, Safed, Israel; [Yousef, Malik] Zefat Acad Coll, Galilee Digital Hlth Res Ctr GDH, Safed, Israel en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q2
gdc.description.volume 14 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q2
gdc.identifier.openalex W4386022877
gdc.identifier.pmid 37671046
gdc.identifier.wos WOS:001057833100001
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.downloads 100
gdc.oaire.impulse 16.0
gdc.oaire.influence 3.0574898E-9
gdc.oaire.isgreen true
gdc.oaire.keywords machine learning
gdc.oaire.keywords feature selection
gdc.oaire.keywords feature scoring
gdc.oaire.keywords classification
gdc.oaire.keywords Genetics
gdc.oaire.keywords gene ontology
gdc.oaire.keywords feature grouping
gdc.oaire.keywords QH426-470
gdc.oaire.keywords enrichment analysis
gdc.oaire.keywords gene expression data analysis
gdc.oaire.popularity 1.4181255E-8
gdc.oaire.publicfunded false
gdc.oaire.views 162
gdc.openalex.collaboration International
gdc.openalex.fwci 2.8629
gdc.openalex.normalizedpercentile 0.91
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 13
gdc.plumx.mendeley 23
gdc.plumx.newscount 1
gdc.plumx.pubmedcites 6
gdc.plumx.scopuscites 21
gdc.scopus.citedcount 21
gdc.virtual.author Güngör, Burcu
gdc.wos.citedcount 16
relation.isAuthorOfPublication e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isAuthorOfPublication.latestForDiscovery e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
fgene-14-1139082.pdf
Size:
4.69 MB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: