Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 7 of 7
  • Conference Object
    Linear Vs. Non-Linear Embedding Methods in Recommendation Systems
    (Institute of Electrical and Electronics Engineers Inc., 2022-09-07) Gurler, Kerem; Cos¸kun, Mustafa; Karagenc, Safak; Orun, Gokhan; Kuleli Pak, Burcu Kuleli; Güngör, Vehbi Çağrı; Coskun, Mustafa; Pak, Burcu Kuleli
    Predicting customer interest in items is very crucial in direct marketing as it can potentially boost sales. Data mining techniques are developed to predict which items a particular user might be interested in based on their purchase history or explicit feedback in form of ratings or comments. Recently, non-linear and linear methods have been developed for this purpose. In this study, we applied Neighborhood based Collaborative Filtering (CF), Matrix Factorization (MF), Singular Value Decomposition (SVD), Neural Graph CF (NGCF) and Light Graph Convolutional Network (LightGCN) on explicit user product rating data which is acquired from the online gaming and mobile entertainment platform called HADI. We compared the results of node embedding methods in terms of Precision@k, Recall@k and NDCG@k values. SVD and LightGCN showed the best test performance and SVD was significantly superior to LightGCN in terms of training speed. To further increase predictive performance of SVD, we have applied classification with Logistic Regression and Deep Random Forest on user and item embeddings created by the SVD. © 2022 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - WoS: 22
    Citation - Scopus: 52
    Evaluation of Classification Algorithms, Linear Discriminant Analysis and a New Hybrid Feature Selection Methodology for the Diagnosis of Coronary Artery Disease
    (Institute of Electrical and Electronics Engineers Inc., 2018-12) Kolukisa, Burak; Hacilar, Hilal; Göy, Gökhan; Kus, Mustafa; Bakir-Güngör, Burcu; Aral, Atilla; Güngör, Vehbi Çağrı
    According to the World Health Organization (WHO), 31% of the world's total deaths in 2016 (17.9 million) was due to cardiovascular diseases (CVD). With the development of information technologies, it has become possible to predict whether people have heart diseases or not by checking certain physical and biochemical values at a lower cost. In this study, we have evalated a set of different classification algorithms, linear discriminant analysis and proposed a new hybrid feature selection methodology for the diagnosis of coronary heart diseases (CHD). Throughout this research effort, using three publicly available Heart Disease diagnosis datasets (UCI Machine Learning Repository), we have conducted comparative performance evaluations in terms of accuracy, sensitivity, specificity, F-measure, AUC and running time. © 2023 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 21
    Assessing Employee Attrition Using Classifications Algorithms
    (Association for Computing Machinery, 2020-05-15) Ozdemir, Fatma; Cos¸kun, Mustafa; Gezer, Cengiz; Güngör, Vehbi Çağrı; Coskun, Mustafa; Cagri Gungor, V.
    Employees leave an organization when other organizations offer better opportunities than their current organizations. Continuity and sustenance and even completion of jobs are crucial issues for the companies not to suffer financial losses. Especially if the talented employees, who are at critical positions in the companies, leave the job, it becomes difficult for the organizations to maintain their businesses. Today, organizations would like to predict attrition of their employees and plan and prepare for it. However, the HR departments of organizations are not advanced enough to make such predictions in a handcrafted manner. For this reason, organizations are looking for new systems or methods that automatize the prediction of employee attrition utilizing data mining methods. In this study, we use IBM HR data set and apply different classification methods, such as Support Vector Machine (SVM), Random Forest, J48, LogitBoost, Multilayer Perceptron (MLP), K-Nearest Neighbors (KNN), Linear Discriminant Analysis (LDA), Naive Bayes, Bagging, AdaBoost, Logistic Regression, to predict the employee attrition. Different from exiting studies, we systematically evaluate our findings with various classification metrics, such as F-measure, Area Under Curve, accuracy, sensitivity, and specificity. We observe that data mining methods can be useful for predicting the employee attrition. © 2022 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 2
    Makine Öǧrenmesi Teknikleri Ile İnternet Servis Saǧlayıcısı için Müşteri Kayıp Tahmini
    (Institute of Electrical and Electronics Engineers Inc., 2020-09) Göy, Gökhan; Kolukisa, Burak; Bahçevan, Cenk Anıl; Güngör, Vehbi Çağrı
    With the developing technology in every fields, a competitive marketing environment has been arised. In this competitive environment, analyzing customer behavior has become vital. In particular, the ability to easily change any service provider has become very critical for the company to continue its existence. At the same time, the amount of financial resources spent on retaining customers much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities. In this study, we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value. In this context, the most successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936. © 2020 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - WoS: 2
    Citation - Scopus: 6
    Makine Öğrenmesi Yöntemleri ile Kredi Kartı Sahteciliğinin Tespiti
    (Institute of Electrical and Electronics Engineers Inc., 2019-09) Göy, Gökhan; Gezer, Cengiz; Güngör, Vehbi Çağrı
    With the increase in credit card usage of people, the credit card transactions increase dramatically. It is difficult to identify fraudulent transactions among the vast amount of credit card transactions. Although credit card fraud is limited in number of transactions, it causes serious problems in terms of financial losses for individuals and organizations. Even though large number of studies has been conducted to solve this problem, there is no generally accepted solution. In this paper, a publicly available data set is used. The unbalance problem of the data set was solved by using hybrid sampling methods together. On this data set, comparative performance evaluations have been conducted. Different from other studies, the Area Under the Curve (AUC) metric, which expresses the success in such data sets, has also been used in addition to standard performance metrics. Since it is also important to quickly detect credit card fraud transactions; the running time of different methods is also presented as another performance metric. © 2020 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 1
    Koroner Arter Hastalığı Tanısı İçin Alan Bilgisi İçeren Topluluk Öznitelik Seçim Yöntemi
    (Institute of Electrical and Electronics Engineers Inc., 2020-10-05) Kolukisa, Burak; Güngör, Vehbi Çağrı; Bakir-Güngör, Burcu; Gungor, Burcu Bakir
    Coronary Artery Disease (CAD) is the condition where, the heart is not fed enough as a result of the accumulation of fatty matter called atheroma in the walls of the arteries. In 2016, CAD accounts for 31% (17.9 million) of the world's total deaths and its diagnosis is difficult. It is estimated that approximately 23.6 million people will die from this disease in 2030. With the development of machine learning and data mining techniques, it might be possible to diagnose CAD inexpensively and easily via examining some physical and biochemical values. In this study, for the CAD classification problem, a novel ensemble feature selection methodology that incorporates domain knowledge is proposed. Via applying the proposed methodology on the UCI Cleveland CAD dataset and using different classification algorithms, performance metrics are compared. It is shown that in our experiments, when Multilayer Perceptron classifier is used with 9 selected features, our proposed solution reached 85.47% accuracy, 82.96% accuracy and 0.839 F-Measure. As a future work, we aim to generate a machine learning model that can quickly diagnose CAD on real-time data in hospitals. © 2021 Elsevier B.V., All rights reserved.
  • Conference Object
    Evaluation of Hybrid Classification Approaches: Case Studies on Credit Datasets
    (Springer Verlag service@springer.de, 2018) Cetiner, Erkan; Güngör, Vehbi Çağrı; Kocak, Taskin
    Hybrid classification approaches on credit domain are widely used to obtain valuable information about customer behaviours. Single classification algorithms such as neural networks, support vector machines and regression analysis have been used since years on related area. In this paper, we propose hybrid classification approaches, which try to combine several classifiers and ensemble learners to boost accuracy on classification results. We worked with two credit datasets, German dataset which is a public dataset and a Turkish Corporate Bank dataset. The goal of using such diverse datasets is to search for generalization ability of proposed model. Results show that feature selection plays a vital role on classification accuracy, hybrid approaches which shaped with ensemble learners outperform single classification techniques and hybrid approaches which consists SVM has better accuracy performance than other hybrid approaches. © 2018 Elsevier B.V., All rights reserved.