Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 10 of 11
  • Article
    Citation - WoS: 6
    Citation - Scopus: 7
    The Determination of Distinctive Single Nucleotide Polymorphism Sets for the Diagnosis of Behcet's Disease
    (IEEE Computer Soc, 2022-05-01) Isik, Yunus Emre; Gormez, Yasin; Aydin, Zafer; Bakir-Gungor, Burcu
    Behcet's Disease (BD) is a multi-system inflammatory disorder in which the etiology remains unclear. The most probable hypothesis is that genetic tendency and environmental factors play roles in the development of BD. In order to find the essential reasons, genetic changes on thousands of genes should be analyzed. Besides, there is a need for extra analysis to find out which genetic factor affects the disease. Machine learning approaches have high potential for extracting the knowledge from genomics and selecting the representative Single Nucleotide Polymorphisms (SNPs) as the most effective features for the clinical diagnosis process. In this study, we have attempted to identify representative SNPs using feature selection methods, incorporating biological information and aimed to develop a machine-learning model for diagnosing Behcet's disease. By combining biological information and machine learning classifiers, up to 99.64 percent accuracy of disease prediction is achieved using only 13,611 out of 311,459 SNPs. In addition, we revealed the SNPs that are most distinctive by performing repeated feature selection in cross-validation experiments.
  • Article
    Citation - WoS: 4
    Citation - Scopus: 4
    Sample Reduction Strategies for Protein Secondary Structure Prediction
    (MDPI, 2019-10-18) Atasever, Sema; Aydin, Zafer; Erbay, Hasan; Sabzekar, Mostafa
    Predicting the secondary structure from protein sequence plays a crucial role in estimating the 3D structure, which has applications in drug design and in understanding the function of proteins. As new genes and proteins are discovered, the large size of the protein databases and datasets that can be used for training prediction models grows considerably. A two-stage hybrid classifier, which employs dynamic Bayesian networks and a support vector machine (SVM) has been shown to provide state-of-the-art prediction accuracy for protein secondary structure prediction. However, SVM is not efficient for large datasets due to the quadratic optimization involved in model training. In this paper, two techniques are implemented on CB513 benchmark for reducing the number of samples in the train set of the SVM. The first method randomly selects a fraction of data samples from the train set using a stratified selection strategy. This approach can remove approximately 50% of the data samples from the train set and reduce the model training time by 73.38% on average without decreasing the prediction accuracy significantly. The second method clusters the data samples by a hierarchical clustering algorithm and replaces the train set samples with nearest neighbors of the cluster centers in order to improve the training time. To cluster the feature vectors, the hierarchical clustering method is implemented, for which the number of clusters and the number of nearest neighbors are optimized as hyper-parameters by computing the prediction accuracy on validation sets. It is found that clustering can reduce the size of the train set by 26% without reducing the prediction accuracy. Among the clustering techniques Ward's method provided the best accuracy on test data.
  • Article
    Citation - Scopus: 6
    Network Intrusion Detection Based on Machine Learning Strategies: Performance Comparisons on Imbalanced Wired, Wireless, and Software-Defined Networking (SDN) Network Traffics
    (Turkiye Klinikleri, 2024-07-26) Hacilar, Hilal; Aydin, Zafer; Güngör, Vehbi Çağrı
    The rapid growth of computer networks emphasizes the urgency of addressing security issues. Organizations rely on network intrusion detection systems (NIDSs) to protect sensitive data from unauthorized access and theft. These systems analyze network traffic to detect suspicious activities, such as attempted breaches or cyberattacks. However, existing studies lack a thorough assessment of class imbalances and classification performance for different types of network intrusions: wired, wireless, and software-defined networking (SDN). This research aims to fill this gap by examining these networks’ imbalances, feature selection, and binary classification to enhance intrusion detection system efficiency. Various techniques such as SMOTE, ROS, ADASYN, and SMOTETomek are used to handle imbalanced datasets. Additionally, eXtreme Gradient Boosting (XGBoost) identifies key features, and an autoencoder (AE) assists in feature extraction for the classification task. The study evaluates datasets such as AWID, UNSW, and InSDN, yielding the best results with different numbers of selected features. Bayesian optimization fine-tunes parameters, and diverse machine learning algorithms (SVM, kNN, XGBoost, random forest, ensemble classifiers, and autoencoders) are employed. The optimal results, considering F1-measure, overall accuracy, detection rate, and false alarm rate, have been achieved for the UNSW-NB15, preprocessed AWID, and InSDN datasets, with values of [0.9356, 0.9289, 0.9328, 0.07597], [0.997, 0.9995, 0.9999, 0.0171], and [0.9998, 0.9996, 0.9998, 0.0012], respectively. These findings demonstrate that combining Bayesian optimization with oversampling techniques significantly enhances classification performance across wired, wireless, and SDN networks when compared to previous research conducted on these datasets. © 2024 Elsevier B.V., All rights reserved.
  • Article
    Citation - WoS: 8
    Citation - Scopus: 10
    Lung Cancer Subtype Differentiation From Positron Emission Tomography Images
    (Tubitak Scientific & Technological Research Council Turkey, 2020-01-27) Ayyildiz, Oguzhan; Aydin, Zafer; Yilmaz, Bulent; Karacavus, Seyhan; Senkaya, Kubra; Icer, Semra; Kaya, Eser; Taşdemir, Arzu
    Lung cancer is one of the deadly cancer types, and almost 85% of lung cancers are nonsmall cell lung cancer (NSCLC). In the present study we investigated classification and feature selection methods for the differentiation of two subtypes of NSCLC, namely adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). The major advances in understanding the effects of therapy agents suggest that future targeted therapies will be increasingly subtype specific. We obtained positron emission tomography (PET) images of 93 patients with NSCLC, 39 of which had ADC while the rest had SqCC. Random walk segmentation was applied to delineate three-dimensional tumor volume, and 39 texture features were extracted to grade the tumor subtypes. We examined 11 classifiers with two different feature selection methods and the effect of normalization on accuracy. The classifiers we used were the k-nearest-neighbor, logistic regression, support vector machine, Bayesian network, decision tree, radial basis function network, random forest, AdaBoostM1, and three stacking methods. To evaluate the prediction accuracy we performed a leave-one-out cross-validation experiment on the dataset. We also considered optimizing certain hyperparameters of these models by performing 10-fold cross-validation separately on each training set. We found that the stacking ensemble classifier, which combines a decision tree, AdaBoostM1, and logistic regression methods by a metalearner, was the most accurate method for detecting subtypes of NSCLC, and normalization of feature sets improved the accuracy of the classification method.
  • Article
    Citation - WoS: 10
    Citation - Scopus: 11
    Knowledge Based Response Correction Method for Design of Reconfigurable N-Shaped Microstrip Patch Antenna Using Inverse Anns
    (Wiley, 2015-12-18) Aoad, Ashrf; Simsek, Murat; Aydin, Zafer
    Artificial neural networks (ANNs) have been often used for engineering design problems. In this work, an inverse model of a reconfigurable N-shaped microstrip patch antenna which is formed by ANN is considered to find design parameters. For this task, knowledge-based response correction consists of two steps, which include generating response using multilayer perceptron as a first step and correcting this response using knowledge based methods such as source difference, prior knowledge input, and prior knowledge input with difference as a second step. The proposed antenna has four states of operation controlled by two Positive-Intrinsic-Negative (PIN) diodes with ON/OFF states. The two-step ANN models are inversely trained using the optimum of the resonant frequency parameter as the input and the physical dimensions of the proposed antenna as outputs of the multilayer perceptron. The outputs and, in some methods, the input parameters of the multilayer perceptron are sent as input to the knowledge-based models while the obtained outputs from the two steps are the results of the new physical dimensions of the redesigned reconfigurable antenna that will be compared and analyzed. This input/output complexity of the proposed reconfigurable antenna allows an accurate and fast inverse model to be developed with less training data. Users may use this antenna and its ANN models to develop new products in the market where any frequency in the operating region can be given to the input to result an appropriate form of the new reconfigurable antenna. Copyright (c) 2015 John Wiley & Sons, Ltd.
  • Article
    Citation - WoS: 13
    Citation - Scopus: 20
    IGPRED: Combination of Convolutional Neural and Graph Convolutional Networks for Protein Secondary Structure Prediction
    (Wiley, 2021-05-25) Gormez, Yasin; Sabzekar, Mostafa; Aydin, Zafer
    There is a close relationship between the tertiary structure and the function of a protein. One of the important steps to determine the tertiary structure is protein secondary structure prediction (PSSP). For this reason, predicting secondary structure with higher accuracy will give valuable information about the tertiary structure. Recently, deep learning techniques have obtained promising improvements in several machine learning applications including PSSP. In this article, a novel deep learning model, based on convolutional neural network and graph convolutional network is proposed. PSIBLAST PSSM, HHMAKE PSSM, physico-chemical properties of amino acids are combined with structural profiles to generate a rich feature set. Furthermore, the hyper-parameters of the proposed network are optimized using Bayesian optimization. The proposed model IGPRED obtained 89.19%, 86.34%, 87.87%, 85.76%, and 86.54% Q3 accuracies for CullPDB, EVAset, CASP10, CASP11, and CASP12 datasets, respectively.
  • Article
    Citation - WoS: 3
    Citation - Scopus: 6
    IGPRED-Multitask: A Deep Learning Model to Predict Protein Secondary Structure, Torsion Angles and Solvent Accessibility
    (IEEE Computer Soc, 2023-03-01) Gormez, Yasin; Aydin, Zafer
    Protein secondary structure, solvent accessibility and torsion angle predictions are preliminary steps to predict 3D structure of a protein. Deep learning approaches have achieved significant improvements in predicting various features of protein structure. In this study, IGPRED-Multitask, a deep learning model with multi task learning architecture based on deep inception network, graph convolutional network and a bidirectional long short-term memory is proposed. Moreover, hyper-parameters of the model are fine-tuned using Bayesian optimization, which is faster and more effective than grid search. The same benchmark test data sets as in the OPUS-TASS paper including TEST2016, TEST2018, CASP12, CASP13, CASPFM, HARD68, CAMEO93, CAMEO93_HARD, as well as the train and validation sets, are used for fair comparison with the literature. Statistically significant improvements are observed in secondary structure prediction on 4 datasets, in phi angle prediction on 2 datasets and in psi angel prediction on 3 datasets compared to the state-of-the-art methods. For solvent accessibility prediction, TEST2016 and TEST2018 datasets are used only to assess the performance of the proposed model.
  • Article
    Citation - WoS: 7
    Citation - Scopus: 5
    Effect of Interpolation on Specular Reflections in Texture-Based Automatic Colonic Polyp Detection
    (Wiley, 2020-06-26) Kacmaz, Rukiye Nur; Yilmaz, Bulent; Aydin, Zafer
    Reflections of LED light cause unwanted noise effects called specular reflection (SR) on colonoscopic images. The aim of this study was to seek answers to the following two questions. (a) How are the texture features used in automatic detection of polyps affected by the interpolation on specular reflections? (b) If they are affected does it really affect the classification performance? In order to answer these questions, we used 610 colonoscopy images, and divided each image into tiles whose sizes were 32-by-32 pixels. From these tiles, we selected the ones without any specular reflection. We added different shape and size specular reflections cropped from real images onto the reflection-free tiles. We then used the nearest neighbors, bilinear and bicubic interpolation techniques on the tiles on which SRs were added. On these tiles we extracted 116 texture features using 3 second-order approaches, and 4 first-order statistics. First, we used paired samplettest. Second, we performed automatic classification of polyps and background using random forest and k nearest neighbors (k-NN) approaches using the texture features for different combinations of specular reflections added on the tiles from the polyp or background. The results showed that depending on the size of specular reflection, interpolation can cause a significant difference between the texture features that were coming from reflection-free tiles and the same tiles on which interpolation was performed. In addition, we note that bicubic interpolation may be preferred to eliminate specular reflection when texture features are used for background and polyp discrimination.
  • Article
    Citation - WoS: 2
    Citation - Scopus: 4
    Deep-Learning AI-Model for Predicting Dental Plaque in the Young Permanent Teeth of Children Aged 8-13 Years
    (MDPI, 2025-04-07) Tez, Banu Cicek; Guzel, Yasin; Eliacik, Bahar Basak Kiziltan; Aydin, Zafer; Kızıltan Eliaçık, Bahar Başak
    Background/Objectives: Dental plaque is a significant contributor to various prevalent oral health conditions, including caries, gingivitis, and periodontitis. Consequently, its detection and management are of paramount importance for maintaining oral health. Manual plaque assessment is time-consuming, error-prone, and particularly challenging in uncooperative pediatric patients. These limitations have encouraged researchers to seek faster, more reliable methods. Accordingly, this study aims to develop a deep learning model for detecting and segmenting plaque in young permanent teeth and to evaluate its diagnostic precision. Methods: The dataset comprises 506 dental images from 31 patients aged between 8 and 13 years. Six state-of-the-art models were trained and evaluated using this dataset. The U-Net Transformer model, which yielded the best performance, was further compared against three experienced pediatric dentists for clinical feasibility using 35 randomly selected images from the test set. The clinical trial was registered on under the ID NCT06603233 (1 June 2023). Results: The Intersection over Union (IoU) score of the U-Net Transformer on the test set was measured as 0.7845, and the p-values obtained from the three t-tests conducted for comparison with dentists were found to be below 0.05. Compared with three experienced pediatric dentists, the deep learning model exhibited clinically superior performance in the detection and segmentation of dental plaque in young permanent teeth. Conclusions: This finding highlights the potential of AI-driven technologies in enhancing the accuracy and reliability of dental plaque detection and segmentation in pediatric dentistry.
  • Article
    Citation - WoS: 14
    Citation - Scopus: 20
    A Deep Learning Approach With Bayesian Optimization and Ensemble Classifiers for Detecting Denial of Service Attacks
    (Wiley, 2020-05-06) Gormez, Yasin; Aydin, Zafer; Karademir, Ramazan; Gungor, Vehbi C.
    Detecting malicious behavior is important for preventing security threats in a computer network. Denial of Service (DoS) is among the popular cyber attacks targeted at web sites of high-profile organizations and can potentially have high economic and time costs. In this paper, several machine learning methods including ensemble models and autoencoder-based deep learning classifiers are compared and tuned using Bayesian optimization. The autoencoder framework enables to extract new features by mapping the original input to a new space. The methods are trained and tested both for binary and multi-class classification on Digiturk and Labris datasets, which were introduced recently for detecting various types of DDoS attacks. The best performing methods are found to be ensembles though deep learning classifiers achieved comparable level of accuracy.