The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behcet's Disease

dc.contributor.author Gormez, Yasin
dc.contributor.author Isik, Yunus Emre
dc.contributor.author Bakir-Gungor, Burcu
dc.date.accessioned 2025-09-25T10:37:59Z
dc.date.available 2025-09-25T10:37:59Z
dc.date.issued 2018
dc.description Bakir-Gungor, Burcu/0000-0002-2272-6270 en_US
dc.description.abstract Behcet's disease is a long-term multisystem inflammatory disorder, characterized by recurrent attacks affecting several organs. As the genotyping individuals get cheaper and easier following the developments in genomic technologies, genome-wide association studies (GWAS) emerged. By this means, via studying big-sized case-control groups for a specific disease, potential genetic variations, single nucleotide polymorphisms (SNPs) are identified. Although several genetic risk factors are identified for Behcet's disease with the help of these studies via scanning around a million of SNPs, these variations could only explain up to 200/u of the disease's genetic risk. In this study, for Behcet's disease classification, via comparing all the SNPs genotyped in GWAS, with the SNPs selected via using genetic knowledge, gain ratio and information gain; both reduction in the feature size and improvement in the classification accuracy is aimed. Also, using different classification algorithms such as random forest, k-nearest neighbour and logistic regression, their effects on the classification accuracy are investigated. Our results showed that compared to other feature selection methods, with at least 81% success rate, the selection of the SNPs using the genetic information (of their GWAS p-values, indicating the significance of the SNP against the disease) provides 15% to 42% improvement in all classification algorithms. This improvement is statistically sound. While gain ratio and information gain feature selection techniques yield similar classification accuracies, the models using all SNPs could not exceed 50% accuracies and results in the worst performance. en_US
dc.identifier.isbn 9781538678930
dc.identifier.uri https://hdl.handle.net/20.500.12573/3000
dc.language.iso tr en_US
dc.publisher IEEE en_US
dc.relation.ispartof 3rd International Conference on Computer Science and Engineering (UBMK) -- SEP 20-23, 2018 -- Sarajevo, BOSNIA & HERCEG en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Behcet's Disease en_US
dc.subject Machine Learning en_US
dc.subject Feature Selection en_US
dc.subject Single Nucleotide Polymorphism (Snp) en_US
dc.subject Genome-Wide Association Study (GWAS) en_US
dc.title The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behcet's Disease en_US
dc.title.alternative The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behcet's Disease en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.id Bakir-Gungor, Burcu/0000-0002-2272-6270
gdc.author.institutional Güngör, Burcu
gdc.author.wosid Işik, Yunus/Jep-8357-2023
gdc.author.wosid Görmez, Yasin/Jef-8096-2023
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Gormez, Yasin; Isik, Yunus Emre] Cumhuriyet Univ, Iktisadi Idari Bilimler Fak, Yonetim Bilisim Sistemleri, Sivas, Turkey; [Bakir-Gungor, Burcu] Abdullah Gul Univ, Muhendislik Fak, Bilgisayar Muhendisligi, Kayseri, Turkey en_US
gdc.description.endpage 447 en_US
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 443 en_US
gdc.description.woscitationindex Conference Proceedings Citation Index - Science
gdc.description.wosquality N/A
gdc.identifier.wos WOS:000459847400085
gdc.wos.citedcount 1
relation.isAuthorOfPublication e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isAuthorOfPublication.latestForDiscovery e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files