WoS İndeksli Yayınlar Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/394
Browse
11 results
Search Results
Article Frequency-Based Deep Occlusion Awareness Instance Segmentation(MDPI, 2026-02-26) Guzel, Yasin; Aydin, Zafer; Talu, Muhammed FatihOne major challenge faced by deep learning-based methods that detect target objects in the form of bounding boxes is object occlusion. High degrees of occlusion significantly diminish the accuracy of instance segmentation. Nonetheless, complex-valued Fourier descriptors can robustly represent object boundaries using minimal information. In this study, the impact of integrating Fourier descriptors-renowned for their strong representational capacity-with deep network models (UNet) that exhibit high generalization performance on instance segmentation accuracy was investigated. Within the scope of the research, nine network models were designed based on different strategies for utilizing frequency components. These variants fall into four strategy families: (i) UNet-style spectrum regression on fixed low-frequency windows (FUNet), (ii) magnitude-guided frequency selection/ROI construction (FUNet-Thr, FUNet-BBox), (iii) sequence models over tokenized FFT coefficients (BiLSTM Patch/Sorted), and (iv) encoder-only spectrum predictors with different depth/capacity (EncoderFFT1/2). To fairly evaluate the models' performance in segmenting objects subjected to disruptive factors (e.g., occlusion, blurring, noise), a specialized synthetic dataset was prepared. The task is formulated as single-target (single-instance), single-class segmentation. This dataset, automatically generated according to initial parameter values, contains images of objects moving at various speeds within a single frame. Among these models, the one termed FUNet, which relies on partial matching of central frequency components, achieved the highest segmentation accuracy despite the disruptive effects. Under the challenging Dataset 8 setting, the proposed FUNet achieved the highest overlap-based performance (Dice = 0.9329, IoU = 0.8842) among Attention U-Net, U-Net, and FourierNet, with statistically significant gains confirmed by paired per-image tests.Article Citation - WoS: 4Citation - Scopus: 4Sample Reduction Strategies for Protein Secondary Structure Prediction(MDPI, 2019-10-18) Atasever, Sema; Aydin, Zafer; Erbay, Hasan; Sabzekar, MostafaPredicting the secondary structure from protein sequence plays a crucial role in estimating the 3D structure, which has applications in drug design and in understanding the function of proteins. As new genes and proteins are discovered, the large size of the protein databases and datasets that can be used for training prediction models grows considerably. A two-stage hybrid classifier, which employs dynamic Bayesian networks and a support vector machine (SVM) has been shown to provide state-of-the-art prediction accuracy for protein secondary structure prediction. However, SVM is not efficient for large datasets due to the quadratic optimization involved in model training. In this paper, two techniques are implemented on CB513 benchmark for reducing the number of samples in the train set of the SVM. The first method randomly selects a fraction of data samples from the train set using a stratified selection strategy. This approach can remove approximately 50% of the data samples from the train set and reduce the model training time by 73.38% on average without decreasing the prediction accuracy significantly. The second method clusters the data samples by a hierarchical clustering algorithm and replaces the train set samples with nearest neighbors of the cluster centers in order to improve the training time. To cluster the feature vectors, the hierarchical clustering method is implemented, for which the number of clusters and the number of nearest neighbors are optimized as hyper-parameters by computing the prediction accuracy on validation sets. It is found that clustering can reduce the size of the train set by 26% without reducing the prediction accuracy. Among the clustering techniques Ward's method provided the best accuracy on test data.Article Citation - WoS: 1New Modeling of Reconfigurable Microstrip Antenna Using Hybrid Structure of Simulation Driven and Knowledge Based Artificial Neural Networks(Pamukkale Univ, 2020) Aoad, Ashrf; Aydin, ZaferKnowledge-based modeling has a critical role to embed existing knowledge to improve modeling performance. Since reconfigurable antenna can provide more operational frequencies than the classical antennas, a knowledge-based hybrid structure is used in this work to obtain efficient model and producing optimum new models for a reconfigurable microstrip antenna. The hybrid structure consists of two phases. The first phase generates initial knowledge which is used in knowledge-based modeling structure to obtain design parameters. Artificial neural network based multilayer perceptron can generate necessary knowledge for a knowledge-based model after the training process. Knowledge-based modeling improves the accuracy of the initial model to determine design parameters corresponding to the design target. Source difference, prior knowledge Input and prior knowledge input with difference can be applied to realize an efficient knowledge-based strategy. 3D-EM simulation generates the new model in terms of the design parameters of the proposed application. It has three switching states for operating, which are organized by two resistor circuits representing ON/OFF states. Switch positions and geometrical parameters can be used for satisfying design targets between 1 GHz and 6 GHz for the efficient antenna design.Article Citation - Scopus: 6Network Intrusion Detection Based on Machine Learning Strategies: Performance Comparisons on Imbalanced Wired, Wireless, and Software-Defined Networking (SDN) Network Traffics(Turkiye Klinikleri, 2024-07-26) Hacilar, Hilal; Aydin, Zafer; Güngör, Vehbi ÇağrıThe rapid growth of computer networks emphasizes the urgency of addressing security issues. Organizations rely on network intrusion detection systems (NIDSs) to protect sensitive data from unauthorized access and theft. These systems analyze network traffic to detect suspicious activities, such as attempted breaches or cyberattacks. However, existing studies lack a thorough assessment of class imbalances and classification performance for different types of network intrusions: wired, wireless, and software-defined networking (SDN). This research aims to fill this gap by examining these networks’ imbalances, feature selection, and binary classification to enhance intrusion detection system efficiency. Various techniques such as SMOTE, ROS, ADASYN, and SMOTETomek are used to handle imbalanced datasets. Additionally, eXtreme Gradient Boosting (XGBoost) identifies key features, and an autoencoder (AE) assists in feature extraction for the classification task. The study evaluates datasets such as AWID, UNSW, and InSDN, yielding the best results with different numbers of selected features. Bayesian optimization fine-tunes parameters, and diverse machine learning algorithms (SVM, kNN, XGBoost, random forest, ensemble classifiers, and autoencoders) are employed. The optimal results, considering F1-measure, overall accuracy, detection rate, and false alarm rate, have been achieved for the UNSW-NB15, preprocessed AWID, and InSDN datasets, with values of [0.9356, 0.9289, 0.9328, 0.07597], [0.997, 0.9995, 0.9999, 0.0171], and [0.9998, 0.9996, 0.9998, 0.0012], respectively. These findings demonstrate that combining Bayesian optimization with oversampling techniques significantly enhances classification performance across wired, wireless, and SDN networks when compared to previous research conducted on these datasets. © 2024 Elsevier B.V., All rights reserved.Article Citation - WoS: 26Citation - Scopus: 48Metabolic Imaging Based Sub-Classification of Lung Cancer(IEEE-Inst Electrical Electronics Engineers Inc, 2020) Bicakci, Mustafa; Ayyildiz, Oguzhan; Aydin, Zafer; Basturk, Alper; Karacavus, Seyhan; Yilmaz, BulentLung cancer is one of the deadliest cancer types whose 84% is non-small cell lung cancer (NSCLC). In this study, deep learning-based classification methods were investigated comprehensively to differentiate two subtypes of NSCLC, namely adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). The study used 1457 F-18-FDG PET images/slices with tumor from 94 patients (88 men), 38 of which were ADC and the rest were SqCC. Three experiments were carried out to examine the contribution of peritumoral areas in PET images on subtype classification of tumors. We assessed multilayer perceptron (MLP) and three convolutional neural network (CNN) models such as SqueezeNet, VGG16 and VGG19 using three kinds of images in these experiments: 1) Whole slices without cropping or segmentation, 2) cropped image portions (square subimages) that include the tumor and 3) segmented image portions corresponding to tumors using random walk method. Several optimizers and regularization methods were used to optimize each model for the diagnostic classification. The classification models were trained and evaluated by performing stratified 10-fold cross validation, and F-score and area-under-curve (AUC) metrics were used to quantify the performance. According to our results, it is possible to say that inclusion of peritumoral regions/tissues both contributes to the success of models and makes segmentation effort unnecessary. To the best of our knowledge, deep learning-based models have not been applied to the subtype classification of NSCLC in PET imaging, therefore, this study is a significant cornerstone providing thorough comparisons and evaluations of several deep learning models on metabolic imaging for lung cancer. Even simpler deep learning models are found promising in this domain, indicating that any improvement in deep learning models in machine learning community can be reflected well in this domain as well.Article Citation - WoS: 8Citation - Scopus: 10Lung Cancer Subtype Differentiation From Positron Emission Tomography Images(Tubitak Scientific & Technological Research Council Turkey, 2020-01-27) Ayyildiz, Oguzhan; Aydin, Zafer; Yilmaz, Bulent; Karacavus, Seyhan; Senkaya, Kubra; Icer, Semra; Kaya, Eser; Taşdemir, ArzuLung cancer is one of the deadly cancer types, and almost 85% of lung cancers are nonsmall cell lung cancer (NSCLC). In the present study we investigated classification and feature selection methods for the differentiation of two subtypes of NSCLC, namely adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). The major advances in understanding the effects of therapy agents suggest that future targeted therapies will be increasingly subtype specific. We obtained positron emission tomography (PET) images of 93 patients with NSCLC, 39 of which had ADC while the rest had SqCC. Random walk segmentation was applied to delineate three-dimensional tumor volume, and 39 texture features were extracted to grade the tumor subtypes. We examined 11 classifiers with two different feature selection methods and the effect of normalization on accuracy. The classifiers we used were the k-nearest-neighbor, logistic regression, support vector machine, Bayesian network, decision tree, radial basis function network, random forest, AdaBoostM1, and three stacking methods. To evaluate the prediction accuracy we performed a leave-one-out cross-validation experiment on the dataset. We also considered optimizing certain hyperparameters of these models by performing 10-fold cross-validation separately on each training set. We found that the stacking ensemble classifier, which combines a decision tree, AdaBoostM1, and logistic regression methods by a metalearner, was the most accurate method for detecting subtypes of NSCLC, and normalization of feature sets improved the accuracy of the classification method.Article Citation - WoS: 2Citation - Scopus: 4Deep-Learning AI-Model for Predicting Dental Plaque in the Young Permanent Teeth of Children Aged 8-13 Years(MDPI, 2025-04-07) Tez, Banu Cicek; Guzel, Yasin; Eliacik, Bahar Basak Kiziltan; Aydin, Zafer; Kızıltan Eliaçık, Bahar BaşakBackground/Objectives: Dental plaque is a significant contributor to various prevalent oral health conditions, including caries, gingivitis, and periodontitis. Consequently, its detection and management are of paramount importance for maintaining oral health. Manual plaque assessment is time-consuming, error-prone, and particularly challenging in uncooperative pediatric patients. These limitations have encouraged researchers to seek faster, more reliable methods. Accordingly, this study aims to develop a deep learning model for detecting and segmenting plaque in young permanent teeth and to evaluate its diagnostic precision. Methods: The dataset comprises 506 dental images from 31 patients aged between 8 and 13 years. Six state-of-the-art models were trained and evaluated using this dataset. The U-Net Transformer model, which yielded the best performance, was further compared against three experienced pediatric dentists for clinical feasibility using 35 randomly selected images from the test set. The clinical trial was registered on under the ID NCT06603233 (1 June 2023). Results: The Intersection over Union (IoU) score of the U-Net Transformer on the test set was measured as 0.7845, and the p-values obtained from the three t-tests conducted for comparison with dentists were found to be below 0.05. Compared with three experienced pediatric dentists, the deep learning model exhibited clinically superior performance in the detection and segmentation of dental plaque in young permanent teeth. Conclusions: This finding highlights the potential of AI-driven technologies in enhancing the accuracy and reliability of dental plaque detection and segmentation in pediatric dentistry.Article Citation - WoS: 4Citation - Scopus: 5Comparative Analysis of Machine Learning Approaches for Predicting Respiratory Virus Infection and Symptom Severity(PeerJ Inc, 2023-06-30) Isik, Yunus Emre; Aydin, ZaferRespiratory diseases are among the major health problems causing a burden on hospitals. Diagnosis of infection and rapid prediction of severity without time-consuming clinical tests could be beneficial in preventing the spread and progression of the disease, especially in countries where health systems remain incapable. Personalized medicine studies involving statistics and computer technologies could help to address this need. In addition to individual studies, competitions are also held such as Dialogue for Reverse Engineering Assessment and Methods (DREAM) challenge which is a community-driven organization with a mission to research biology, bioinformatics, and biomedicine. One of these competitions was the Respiratory Viral DREAM Challenge, which aimed to develop early predictive biomarkers for respiratory virus infections. These efforts are promising, however, the prediction performance of the computational methods developed for detecting respiratory diseases still has room for improvement. In this study, we focused on improving the performance of predicting the infection and symptom severity of individuals infected with various respiratory viruses using gene expression data collected before and after exposure. The publicly available gene expression dataset in the Gene Expression Omnibus, named GSE73072, containing samples exposed to four respiratory viruses (H1N1, H3N2, human rhinovirus (HRV), and respiratory syncytial virus (RSV)) was used as input data. Various preprocessing methods and machine learning algorithms were implemented and compared to achieve the best prediction performance. The experimental results showed that the proposed approaches obtained a prediction performance of 0.9746 area under the precision-recall curve (AUPRC) for infection (i.e., shedding) prediction (SC-1), 0.9182 AUPRC for symptom class prediction (SC-2), and 0.6733 Pearson correlation for symptom score prediction (SC-3) by outperforming the best leaderboard scores of Respiratory Viral DREAM Challenge (a 4.48% improvement for SC-1, a 13.68% improvement for SC-2, and a 13.98% improvement for SC-3). Additionally, over-representation analysis (ORA), which is a statistical method for objectively determining whether certain genes are more prevalent in pre-defined sets such as pathways, was applied using the most significant genes selected by feature selection methods. The results show that pathways associated with the 'adaptive immune system' and 'immune disease' are strongly linked to pre-infection and symptom development. These findings contribute to our knowledge about predicting respiratory infections and are expected to facilitate the development of future studies that concentrate on predicting not only infections but also the associated symptoms.Article Citation - WoS: 17Citation - Scopus: 19A Deep Ensemble Approach for Long-Term Traffic Flow Prediction(Springer Heidelberg, 2024-01-27) Cini, Nevin; Aydin, ZaferIn the last 50 years, with the growth of cities and increase in the number of vehicles and mobility, traffic has become troublesome. As a result, traffic flow prediction started to attract attention as an important research area. However, despite the extensive literature, traffic flow prediction still remains as an open research problem, specifically for long-term traffic flow prediction. Compared to the models developed for short-term traffic flow prediction, the number of models developed for long-term traffic flow prediction is very few. Based on this shortcoming, in this study, we focus on long-term traffic flow prediction and propose a novel deep ensemble model (DEM). In order to build this ensemble model, first, we developed a convolutional neural network (CNN), a long short-term memory (LSTM) network and a gated recurrent unit (GRU) network as deep learning models, which formed the base learners. In the next step, we combine the output of these models according to their individual forecasting success. We use another deep learning model to determine the success of the individual models. Our proposed model is a flexible ensemble prediction model that can be updated based on traffic data. To evaluate the performance of the proposed model, we use a publicly available dataset. Experimental results show that the developed DEM model has a mean square error of 0.06 and a mean absolute error of 0.15 for single-step prediction; it shows that achieves a mean square error of 0.25 and a mean absolute error of 0.32 for multi-step prediction. We compared our proposed model with many models in different categories; individual deep learning models (i.e., LSTM, CNN, GRU), selected traditional machine learning models (i.e., linear regression, decision tree regression, k-nearest-neighbors regression) and other ensemble models such as random-forest regression. These results also support the claim that ensemble learning models perform better than individual models.Article Citation - WoS: 13Citation - Scopus: 13A Continuously Benchmarked and Crowdsourced Challenge for Rapid Development and Evaluation of Models to Predict COVID-19 Diagnosis and Hospitalization(Amer Medical Assoc, 2021-10-11) Yan, Yao; Schaffter, Thomas; Bergquist, Timothy; Yu, Thomas; Prosser, Justin; Aydin, Zafer; Mooney, SeanIMPORTANCE Machine learning could be used to predict the likelihood of diagnosis and severity of illness. Lack of COVID-19 patient data has hindered the data science community in developing models to aid in the response to the pandemic. OBJECTIVES To describe the rapid development and evaluation of clinical algorithms to predict COVID-19 diagnosis and hospitalization using patient data by citizen scientists, provide an unbiased assessment of model performance, and benchmark model performance on subgroups. DESIGN, SETTING, AND PARTICIPANTS This diagnostic and prognostic study operated a continuous, crowdsourced challenge using a model-to-data approach to securely enable the use of regularly updated COVID-19 patient data from the University of Washington by participants from May 6 to December 23, 2020. A postchallenge analysis was conducted from December 24, 2020, to April 7, 2021, to assess the generalizability of models on the cumulative data set as well as subgroups stratified by age, sex, race, and time of COVID-19 test. By December 23, 2020, this challenge engaged 482 participants from 90 teams and 7 countries. MAIN OUTCOMES AND MEASURES Machine learning algorithms used patient data and output a score that represented the probability of patients receiving a positive COVID-19 test result or being hospitalized within 21 days after receiving a positive COVID-19 test result. Algorithms were evaluated using area under the receiver operating characteristic curve (AUROC) and area under the precision recall curve (AUPRC) scores. Ensemble models aggregating models from the top challenge teams were developed and evaluated. RESULTS In the analysis using the cumulative data set, the best performance for COVID-19 diagnosis prediction was an AUROC of 0.776 (95% CI, 0.775-0.777) and an AUPRC of 0.297, and for hospitalization prediction, an AUROC of 0.796 (95% CI, 0.794-0.798) and an AUPRC of 0.188. Analysis on top models submitting to the challenge showed consistently better model performance on the female group than the male group. Among all age groups, the best performance was obtained for the 25- to 49-year age group, and the worst performance was obtained for the group aged 17 years or younger. CONCLUSIONS AND RELEVANCE In this diagnostic and prognostic study, models submitted by citizen scientists achieved high performance for the prediction of COVID-19 testing and hospitalization outcomes. Evaluation of challenge models on demographic subgroups and prospective data revealed performance discrepancies, providing insights into the potential bias and limitations in the models.
