Browsing by Author "Işik, Yunus Emre"
Now showing 1 - 3 of 3
- Results Per Page
- Sort Options
Conference Object Citation - Scopus: 1The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behçet's Disease(Institute of Electrical and Electronics Engineers Inc., 2018) Görmez, Yasin; Işik, Yunus Emre; Bakir-Güngör, BurcuBehçet's disease is a long-term multisystem inflammatory disorder, characterized by recurrent attacks affecting several organs. As the genotyping individuals get cheaper and easier following the developments in genomic technologies, genome-wide association studies (GWAS) emerged. By this means, via studying big-sized case-control groups for a specific disease, potential genetic variations, single nucleotide polymorphisms (SNPs) are identified. Although several genetic risk factors are identified for Behçet's disease with the help of these studies via scanning around a million of SNPs, these variations could only explain up to 20% of the disease's genetic risk. In this study, for Behçet's disease classification, via comparing all the SNPs genotyped in GWAS, with the SNPs selected via using genetic knowledge, gain ratio and information gain; both reduction in the feature size and improvement in the classification accuracy is aimed. Also, using different classification algorithms such as random forest, k-nearest neighbour and logistic regression, their effects on the classification accuracy are investigated. Our results showed that compared to other feature selection methods, with at least 81% success rate, the selection of the SNPs using the genetic information (of their GWAS p-values, indicating the significance of the SNP against the disease) provides 15% to 42% improvement in all classification algorithms. This improvement is statistically sound. While gain ratio and information gain feature selection techniques yield similar classification accuracies, the models using all SNPs could not exceed 50% accuracies and results in the worst performance. © 2019 Elsevier B.V., All rights reserved.Conference Object Citation - WoS: 3Citation - Scopus: 12NSEM: Duygu Analizi için Özgün Yıǧınlanmiş Topluluk Yöntemi(Institute of Electrical and Electronics Engineers Inc., 2019) Işik, Yunus Emre; Görmez, Yasin; Kaynar, Oǧuz; Aydin, ZaferToday, people often share their ideas, opinions and feelings through forums, social media sites, blogs and similar platforms. For this reason, access to these data has become very easy. Increase in the number of shares makes it possible to analyze and use these data in terms of marketing and politics. However, due to the large number of data, it is impossible that this analysis will be done by humans. Determination of what type of emotion is included automatically is done by sentiment analysis methods. In these methods, the text is defined as a mathematical vector and classified by machine learning methods. Ensemble methods are one of the most important methods used as classifiers in sentiment analysis. In these methods, a classifier error is tried to be solved by another classifier. In sentiment analysis, the feature vector that describes the text is as important as the classifier. Feature vectors obtained using different methods can make mistakes in different places. For this reason, in this study, NSEM is proposed for sentiment analysis, which is a new ensemble method that uses 2 different classifiers and 2 different feature extraction methods. As a result of the analysis, the proposed method is the most successful method with an accuracy rate of 79.1%. © 2019 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 3Protein İkincil Yapı Tahmini Için Makine Öǧrenmesi Yöntemlerinin Karşılaştırılması(Institute of Electrical and Electronics Engineers Inc., 2018) Aydin, Zafer; Kaynar, Oǧuz; Görmez, Yasin; Işik, Yunus EmreThree-dimensional structure prediction is one of the important problems in bioinformatics and theoretical chemistry. One of the most important steps in the three-dimensional structure prediction is the estimation of secondary structure. Due to rapidly growing databases and recent feature extraction methods datasets used for predicting secondary structure can potentially contain a large number of samples and dimensions. For this reason, it is important to use algorithms that are fast and accurate. In this study, various classification algorithms have been optimized for the second phase of a two-stage classifier on EVAset benchmark both in the original input space and in the space reduced using the information gain metric. The most accurate classifier is obtained as the support vector machine while the extreme learning machine is significantly faster in model training. © 2018 Elsevier B.V., All rights reserved.
