Scopus İndeksli Yayınlar Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395
Browse
4 results
Search Results
Article Citation - Scopus: 6Network Intrusion Detection Based on Machine Learning Strategies: Performance Comparisons on Imbalanced Wired, Wireless, and Software-Defined Networking (SDN) Network Traffics(Turkiye Klinikleri, 2024-07-26) Hacilar, Hilal; Aydin, Zafer; Güngör, Vehbi ÇağrıThe rapid growth of computer networks emphasizes the urgency of addressing security issues. Organizations rely on network intrusion detection systems (NIDSs) to protect sensitive data from unauthorized access and theft. These systems analyze network traffic to detect suspicious activities, such as attempted breaches or cyberattacks. However, existing studies lack a thorough assessment of class imbalances and classification performance for different types of network intrusions: wired, wireless, and software-defined networking (SDN). This research aims to fill this gap by examining these networks’ imbalances, feature selection, and binary classification to enhance intrusion detection system efficiency. Various techniques such as SMOTE, ROS, ADASYN, and SMOTETomek are used to handle imbalanced datasets. Additionally, eXtreme Gradient Boosting (XGBoost) identifies key features, and an autoencoder (AE) assists in feature extraction for the classification task. The study evaluates datasets such as AWID, UNSW, and InSDN, yielding the best results with different numbers of selected features. Bayesian optimization fine-tunes parameters, and diverse machine learning algorithms (SVM, kNN, XGBoost, random forest, ensemble classifiers, and autoencoders) are employed. The optimal results, considering F1-measure, overall accuracy, detection rate, and false alarm rate, have been achieved for the UNSW-NB15, preprocessed AWID, and InSDN datasets, with values of [0.9356, 0.9289, 0.9328, 0.07597], [0.997, 0.9995, 0.9999, 0.0171], and [0.9998, 0.9996, 0.9998, 0.0012], respectively. These findings demonstrate that combining Bayesian optimization with oversampling techniques significantly enhances classification performance across wired, wireless, and SDN networks when compared to previous research conducted on these datasets. © 2024 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 21Assessing Employee Attrition Using Classifications Algorithms(Association for Computing Machinery, 2020-05-15) Ozdemir, Fatma; Cos¸kun, Mustafa; Gezer, Cengiz; Güngör, Vehbi Çağrı; Coskun, Mustafa; Cagri Gungor, V.Employees leave an organization when other organizations offer better opportunities than their current organizations. Continuity and sustenance and even completion of jobs are crucial issues for the companies not to suffer financial losses. Especially if the talented employees, who are at critical positions in the companies, leave the job, it becomes difficult for the organizations to maintain their businesses. Today, organizations would like to predict attrition of their employees and plan and prepare for it. However, the HR departments of organizations are not advanced enough to make such predictions in a handcrafted manner. For this reason, organizations are looking for new systems or methods that automatize the prediction of employee attrition utilizing data mining methods. In this study, we use IBM HR data set and apply different classification methods, such as Support Vector Machine (SVM), Random Forest, J48, LogitBoost, Multilayer Perceptron (MLP), K-Nearest Neighbors (KNN), Linear Discriminant Analysis (LDA), Naive Bayes, Bagging, AdaBoost, Logistic Regression, to predict the employee attrition. Different from exiting studies, we systematically evaluate our findings with various classification metrics, such as F-measure, Area Under Curve, accuracy, sensitivity, and specificity. We observe that data mining methods can be useful for predicting the employee attrition. © 2022 Elsevier B.V., All rights reserved.Conference Object Evaluation of Hybrid Classification Approaches: Case Studies on Credit Datasets(Springer Verlag service@springer.de, 2018) Cetiner, Erkan; Güngör, Vehbi Çağrı; Kocak, TaskinHybrid classification approaches on credit domain are widely used to obtain valuable information about customer behaviours. Single classification algorithms such as neural networks, support vector machines and regression analysis have been used since years on related area. In this paper, we propose hybrid classification approaches, which try to combine several classifiers and ensemble learners to boost accuracy on classification results. We worked with two credit datasets, German dataset which is a public dataset and a Turkish Corporate Bank dataset. The goal of using such diverse datasets is to search for generalization ability of proposed model. Results show that feature selection plays a vital role on classification accuracy, hybrid approaches which shaped with ensemble learners outperform single classification techniques and hybrid approaches which consists SVM has better accuracy performance than other hybrid approaches. © 2018 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 1Man-Hour Prediction for Complex Industrial Products(Institute of Electrical and Electronics Engineers Inc., 2023) Unal, Ahmet Emin; Boyar, Halit; Kuleli Pak, Burcu Kuleli; Cem Yildiz, Mehmet; Erten, Ali Erman; Güngör, Vehbi Çağrı; Pak, Burcu Kuleli; Cagri Gungor, VehbiAccurately predicting the cost is crucial for the success of complex industrial projects. There can be several sources contributing to the cost. Traditional methods for cost estimation may not provide the required accuracy and speed to ensure the success of the project. Recently, machine learning techniques have shown promising results in improving cost estimation in various industrial products. This study investigates the performance of gradient-boosting machine learning models and feature engineering techniques on a private dataset of metal sheet project man-hour costs. A comparison of distinct models is conducted, key aspects influencing cost are identified, and the implications of incorporating domain-specific knowledge, including its advantages and disadvantages, are assessed based on performance outcomes. Experimental results demonstrate that LightGBM and XGBoost outperform other models, and feature selection and synthetic data generation techniques improve the performance. Overall, this study highlights the potential of machine learning in metal sheet sampling projects and emphasizes the importance of feature engineering and domain expertise for better model performance. © 2024 Elsevier B.V., All rights reserved.
