Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 5 of 5
  • Article
    Citation - Scopus: 6
    Network Intrusion Detection Based on Machine Learning Strategies: Performance Comparisons on Imbalanced Wired, Wireless, and Software-Defined Networking (SDN) Network Traffics
    (Turkiye Klinikleri, 2024-07-26) Hacilar, Hilal; Aydin, Zafer; Güngör, Vehbi Çağrı
    The rapid growth of computer networks emphasizes the urgency of addressing security issues. Organizations rely on network intrusion detection systems (NIDSs) to protect sensitive data from unauthorized access and theft. These systems analyze network traffic to detect suspicious activities, such as attempted breaches or cyberattacks. However, existing studies lack a thorough assessment of class imbalances and classification performance for different types of network intrusions: wired, wireless, and software-defined networking (SDN). This research aims to fill this gap by examining these networks’ imbalances, feature selection, and binary classification to enhance intrusion detection system efficiency. Various techniques such as SMOTE, ROS, ADASYN, and SMOTETomek are used to handle imbalanced datasets. Additionally, eXtreme Gradient Boosting (XGBoost) identifies key features, and an autoencoder (AE) assists in feature extraction for the classification task. The study evaluates datasets such as AWID, UNSW, and InSDN, yielding the best results with different numbers of selected features. Bayesian optimization fine-tunes parameters, and diverse machine learning algorithms (SVM, kNN, XGBoost, random forest, ensemble classifiers, and autoencoders) are employed. The optimal results, considering F1-measure, overall accuracy, detection rate, and false alarm rate, have been achieved for the UNSW-NB15, preprocessed AWID, and InSDN datasets, with values of [0.9356, 0.9289, 0.9328, 0.07597], [0.997, 0.9995, 0.9999, 0.0171], and [0.9998, 0.9996, 0.9998, 0.0012], respectively. These findings demonstrate that combining Bayesian optimization with oversampling techniques significantly enhances classification performance across wired, wireless, and SDN networks when compared to previous research conducted on these datasets. © 2024 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 19
    A Novel Feature Design and Stacking Approach for Non-Technical Electricity Loss Detection
    (Institute of Electrical and Electronics Engineers Inc., 2018-05) Aydin, Zafer; Güngör, Vehbi Çağrı
    Non-technical electricity losses continue to jeopardize economic and social well-being of many countries. In this work, we develop machine learning classifiers that can identify anomalous electricity consumption in Turkey. Starting from weekly electricity usage data, we develop new features that capture statistical and frequency domain characteristics of the customers and their consumption patterns. We analyze the effect of reducing number of feature descriptors through dimensionality reduction and feature selection techniques. To overcome the class imbalance problem, we implement several ensemble methods and compare their prediction accuracy to those of the standard classifiers. The proposed features and combining strengths of different classifiers bring significant improvements on performance metrics, which is demonstrated through detailed simulations on shopping mall sector. We anticipate that advances in this field will contribute to the economies considerably. © 2018 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 2
    Makine Öǧrenmesi Teknikleri Ile İnternet Servis Saǧlayıcısı için Müşteri Kayıp Tahmini
    (Institute of Electrical and Electronics Engineers Inc., 2020-09) Göy, Gökhan; Kolukisa, Burak; Bahçevan, Cenk Anıl; Güngör, Vehbi Çağrı
    With the developing technology in every fields, a competitive marketing environment has been arised. In this competitive environment, analyzing customer behavior has become vital. In particular, the ability to easily change any service provider has become very critical for the company to continue its existence. At the same time, the amount of financial resources spent on retaining customers much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities. In this study, we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value. In this context, the most successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936. © 2020 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 1
    Koroner Arter Hastalığı Tanısı İçin Alan Bilgisi İçeren Topluluk Öznitelik Seçim Yöntemi
    (Institute of Electrical and Electronics Engineers Inc., 2020-10-05) Kolukisa, Burak; Güngör, Vehbi Çağrı; Bakir-Güngör, Burcu; Gungor, Burcu Bakir
    Coronary Artery Disease (CAD) is the condition where, the heart is not fed enough as a result of the accumulation of fatty matter called atheroma in the walls of the arteries. In 2016, CAD accounts for 31% (17.9 million) of the world's total deaths and its diagnosis is difficult. It is estimated that approximately 23.6 million people will die from this disease in 2030. With the development of machine learning and data mining techniques, it might be possible to diagnose CAD inexpensively and easily via examining some physical and biochemical values. In this study, for the CAD classification problem, a novel ensemble feature selection methodology that incorporates domain knowledge is proposed. Via applying the proposed methodology on the UCI Cleveland CAD dataset and using different classification algorithms, performance metrics are compared. It is shown that in our experiments, when Multilayer Perceptron classifier is used with 9 selected features, our proposed solution reached 85.47% accuracy, 82.96% accuracy and 0.839 F-Measure. As a future work, we aim to generate a machine learning model that can quickly diagnose CAD on real-time data in hospitals. © 2021 Elsevier B.V., All rights reserved.
  • Conference Object
    Evaluation of Hybrid Classification Approaches: Case Studies on Credit Datasets
    (Springer Verlag service@springer.de, 2018) Cetiner, Erkan; Güngör, Vehbi Çağrı; Kocak, Taskin
    Hybrid classification approaches on credit domain are widely used to obtain valuable information about customer behaviours. Single classification algorithms such as neural networks, support vector machines and regression analysis have been used since years on related area. In this paper, we propose hybrid classification approaches, which try to combine several classifiers and ensemble learners to boost accuracy on classification results. We worked with two credit datasets, German dataset which is a public dataset and a Turkish Corporate Bank dataset. The goal of using such diverse datasets is to search for generalization ability of proposed model. Results show that feature selection plays a vital role on classification accuracy, hybrid approaches which shaped with ensemble learners outperform single classification techniques and hybrid approaches which consists SVM has better accuracy performance than other hybrid approaches. © 2018 Elsevier B.V., All rights reserved.