The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behçet's Disease
| dc.contributor.author | Görmez, Yasin | |
| dc.contributor.author | Işik, Yunus Emre | |
| dc.contributor.author | Bakir-Güngör, Burcu | |
| dc.date.accessioned | 2025-09-25T10:59:06Z | |
| dc.date.available | 2025-09-25T10:59:06Z | |
| dc.date.issued | 2018 | |
| dc.description.abstract | Behçet's disease is a long-term multisystem inflammatory disorder, characterized by recurrent attacks affecting several organs. As the genotyping individuals get cheaper and easier following the developments in genomic technologies, genome-wide association studies (GWAS) emerged. By this means, via studying big-sized case-control groups for a specific disease, potential genetic variations, single nucleotide polymorphisms (SNPs) are identified. Although several genetic risk factors are identified for Behçet's disease with the help of these studies via scanning around a million of SNPs, these variations could only explain up to 20% of the disease's genetic risk. In this study, for Behçet's disease classification, via comparing all the SNPs genotyped in GWAS, with the SNPs selected via using genetic knowledge, gain ratio and information gain; both reduction in the feature size and improvement in the classification accuracy is aimed. Also, using different classification algorithms such as random forest, k-nearest neighbour and logistic regression, their effects on the classification accuracy are investigated. Our results showed that compared to other feature selection methods, with at least 81% success rate, the selection of the SNPs using the genetic information (of their GWAS p-values, indicating the significance of the SNP against the disease) provides 15% to 42% improvement in all classification algorithms. This improvement is statistically sound. While gain ratio and information gain feature selection techniques yield similar classification accuracies, the models using all SNPs could not exceed 50% accuracies and results in the worst performance. © 2019 Elsevier B.V., All rights reserved. | en_US |
| dc.identifier.doi | 10.1109/UBMK.2018.8566517 | |
| dc.identifier.isbn | 9781538678930 | |
| dc.identifier.scopus | 2-s2.0-85060642821 | |
| dc.identifier.uri | https://doi.org/10.1109/UBMK.2018.8566517 | |
| dc.identifier.uri | https://hdl.handle.net/20.500.12573/4803 | |
| dc.language.iso | en | en_US |
| dc.publisher | Institute of Electrical and Electronics Engineers Inc. | en_US |
| dc.relation.ispartof | -- 3rd International Conference on Computer Science and Engineering, UBMK 2018 -- Sarajevo -- 143560 | en_US |
| dc.rights | info:eu-repo/semantics/closedAccess | en_US |
| dc.subject | Behçet's Disease | en_US |
| dc.subject | Feature Selection | en_US |
| dc.subject | Genome-Wide Association Study (GWAS) | en_US |
| dc.subject | Machine Learning | en_US |
| dc.subject | Single Nucleotide Polymorphism (Snp) | en_US |
| dc.subject | Data Mining | en_US |
| dc.subject | Decision Trees | en_US |
| dc.subject | Disease Control | en_US |
| dc.subject | Feature Extraction | en_US |
| dc.subject | Genes | en_US |
| dc.subject | Learning Systems | en_US |
| dc.subject | Nearest Neighbor Search | en_US |
| dc.subject | Nucleotides | en_US |
| dc.subject | Classification Accuracy | en_US |
| dc.subject | Classification Algorithm | en_US |
| dc.subject | Disease Classification | en_US |
| dc.subject | Feature Selection Methods | en_US |
| dc.subject | Genome-Wide Association Studies | en_US |
| dc.subject | Inflammatory Disorders | en_US |
| dc.subject | Selection Techniques | en_US |
| dc.subject | Single-Nucleotide Polymorphisms | en_US |
| dc.subject | Classification (Of Information) | en_US |
| dc.title | The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behçet's Disease | en_US |
| dc.type | Conference Object | en_US |
| dspace.entity.type | Publication | |
| gdc.author.scopusid | 57195222392 | |
| gdc.author.scopusid | 57195215625 | |
| gdc.author.scopusid | 25932029800 | |
| gdc.bip.impulseclass | C5 | |
| gdc.bip.influenceclass | C5 | |
| gdc.bip.popularityclass | C5 | |
| gdc.coar.access | metadata only access | |
| gdc.coar.type | text::conference output | |
| gdc.collaboration.industrial | false | |
| gdc.description.department | Abdullah Gül University | en_US |
| gdc.description.departmenttemp | [Görmez] Yasin, Iktisadi ve Idari Bilimler Fakultesi, Cumhuriyet Üniversitesi, Sivas, Turkey; [Işik] Yunus Emre, Iktisadi ve Idari Bilimler Fakultesi, Cumhuriyet Üniversitesi, Sivas, Turkey; [Bakir-Güngör] Burcu, Bilgisayar Mühendisliǧi, Abdullah Gül Üniversitesi, Kayseri, Turkey | en_US |
| gdc.description.endpage | 447 | en_US |
| gdc.description.publicationcategory | Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı | en_US |
| gdc.description.scopusquality | N/A | |
| gdc.description.startpage | 443 | en_US |
| gdc.description.wosquality | N/A | |
| gdc.identifier.openalex | W2905098304 | |
| gdc.index.type | Scopus | |
| gdc.oaire.diamondjournal | false | |
| gdc.oaire.impulse | 0.0 | |
| gdc.oaire.influence | 2.4895952E-9 | |
| gdc.oaire.isgreen | false | |
| gdc.oaire.keywords | feature selection | |
| gdc.oaire.keywords | machine learning | |
| gdc.oaire.keywords | Behçet's disease | |
| gdc.oaire.keywords | single nucleotide polymorphism (SNP) | |
| gdc.oaire.keywords | genome-wide association study (GWAS) | |
| gdc.oaire.keywords | Behcet's disease | |
| gdc.oaire.popularity | 1.0376504E-9 | |
| gdc.oaire.publicfunded | false | |
| gdc.openalex.collaboration | National | |
| gdc.openalex.fwci | 1.6087 | |
| gdc.openalex.normalizedpercentile | 0.89 | |
| gdc.openalex.toppercent | TOP 10% | |
| gdc.opencitations.count | 1 | |
| gdc.plumx.mendeley | 3 | |
| gdc.plumx.scopuscites | 1 | |
| gdc.scopus.citedcount | 1 | |
| gdc.virtual.author | Güngör, Burcu | |
| relation.isAuthorOfPublication | e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0 | |
| relation.isAuthorOfPublication.latestForDiscovery | e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0 | |
| relation.isOrgUnitOfPublication | 665d3039-05f8-4a25-9a3c-b9550bffecef | |
| relation.isOrgUnitOfPublication | 52f507ab-f278-4a1f-824c-44da2a86bd51 | |
| relation.isOrgUnitOfPublication | ef13a800-4c99-4124-81e0-3e25b33c0c2b | |
| relation.isOrgUnitOfPublication.latestForDiscovery | 665d3039-05f8-4a25-9a3c-b9550bffecef |
