Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 10 of 78
  • Conference Object
    Impact of Gene Duplicate Handling Strategies on Classification Performance and Feature Selection in Gene Expression Data
    (Institute of Electrical and Electronics Engineers Inc., 2025-09-17) Kuzudisli, Cihan; Qaqish, Bahjat; Gungor, Burcu Bakir; Yousef, Malik
  • Conference Object
    Citation - Scopus: 3
    Examining Tongue Movement Intentions in EEG with Machine and Deep Learning: An Approach for Dysphagia Rehabilitation
    (European Signal Processing Conference, EUSIPCO, 2024-08-26) Aslan, Sevgi Gökçe; Yılmaz, Bülent
  • Article
    Supervised Learning-Driven Dead Band Control of Occupant Thermostats for Energy-Efficient Residential HVAC
    (Elsevier, 2026-03) Savasci, Alper; Ceylan, Oguzhan; Paudyal, Sumit
    Heating, ventilation, and air conditioning (HVAC) systems play a crucial role in demand-side management (DSM) by shaping residential electricity consumption and enabling flexible, grid-responsive operation. Thermostats in HVAC systems regulate indoor temperature as part of a closed-loop control framework, typically incorporating a fixed temperature dead band-a range around the setpoint where no action is taken-to reduce energy use and prevent frequent cycling of the HVAC system. Although essential for efficiency and equipment longevity, fixed dead bands limit adaptability, as dynamically adjusting them under varying environmental conditions remains challenging for occupants. To address this limitation, we propose a machine learning (ML)-based dead band tuning framework that optimally adjusts thermostat settings in real time. The method integrates conventional optimization with data-driven modeling: a mixed-integer linear programming (MILP) model is first used to gen erate optimal dead band values under measured outdoor temperature records (diverse seasonal weather scenarios) which are then employed to train the ML-based predictor to learn a real-time discrete dead band decision policy that approximates the MILP-optimal hysteresis-aware decisions. Among the evaluated models, Random Forest demonstrates superior predictive performance, achieving a mean squared error (MSE) of 0.0399 and a coefficient of determination (R2) of 95.75 %.
  • Article
    Citation - WoS: 1
    A Comprehensive Analysis of Acoustic Emission Signals To Distinguish the Different Damage Types for Fiber-Reinforced Polymers: A Review
    (Wiley, 2025-12-03) Yilmaz, Cagatay
    Fiber-reinforced polymers (FRP) attract the attention of key industries, such as aerospace, wind energy, and automotive, as they can reduce the weight of structural components without compromising their mechanical properties. Due to FRP's anisotropic and non-homogeneous structure, their failure under different loading conditions and the corresponding failure mechanisms must be investigated. One method that progressively monitors the failure of FRP underload is Acoustic Emission (AE). AE can register the elastic stress waves in the form of digitized waveforms, released by the discontinuous events that occur in the FRP under load. These discontinuities can be clustered and identified as transverse cracking, fiber/matrix interface debonding, delamination, and fiber failure by analyzing the AE waveforms. Recently, numerous clustering approaches using machine learning algorithms, along with the varying features of AE waveforms, have been developed and are being used. These algorithms include supervised and unsupervised clustering, deep learning algorithms, and neural network methods, among others. While supervised algorithms require a training dataset to classify AE signals, unsupervised algorithms can perform clustering without training datasets. Deep learning and neural network algorithms can train themselves to cluster data, but they may require a significant amount of computer power when the dataset is large. This review paper provides comprehensive information on the clustering algorithm, along with the AE wave features, the range of features for different damage types, and the type of reinforcer.
  • Conference Object
    Enhancing Complex Disease Group Scoring with Mirgedinet: A Multi-Algorithm Machine Learning Framework Based on the GSM Approach
    (IEEE, 2025-06-25) Qumsiyeh, Emma; Bakir-Gungor, Burcu; Yousef, Malik
    Integrating biological prior knowledge for disease gene associations has shown significant promise in discovering new biomarkers with potential translational applications. This work investigates the application of a multi-algorithm machine learning framework based on the Grouping-Scoring-Modeling (G-S-M) approach for improving the prediction of complex diseases. The study identifies the primary gene and miRNA interactions in various complex diseases with the help of miRGediNET, which is a machine-learning based tool that integrates data from three biological databases. Traditional methods have only focused on independence between features; the G-S-M method focuses on aggregating genes based on biological interactions, pinpointing the scoring of gene groups for a disease, and modeling its predictive capability using advanced machine learning algorithms. In this research paper, seven algorithms, including Support Vector Machine, Decision Tree, and CatBoost, were applied to eight datasets extracted from the GEO database. This framework proved very robust in ranking gene clusters, thus predicting critical biomarkers while doing 100-fold randomized cross-validation within the evaluation. The results indicate this approach's high potential for refining disease and supporting research for choosing the best algorithm that can provide biological insights and computational advances.
  • Conference Object
    Exploring Microbiome Signatures in Autism Spectrum Disorder via Grouping-Scoring Based Machine Learning
    (IEEE, 2025-06-25) Temiz, Mustafa; Ersoz, Nur Sebnem; Yousef, Malik; Bakir-Gungor, Burcu
    The rapid increase in omic data production increased the importance of machine learning (ML) methods to analze these data. In particular, the use of metagenomic data in the diagnosis, prognosis and treatment of diseases is becoming widespread. Autism Spectrum Disorder (ASD) is a neurodevelopmental disease that occurs in early childhood and continues lifelong. The aim of this study is to increase ML performance, reduce computational costs and achieve successful classification performance using a small number of metagenomic features. In addition, disease prediction is performed; ASD associated biomarkers are determined using the microBiomeGSM on metagenomic data. Classification is performed at three different taxonomic levels (genus, family and order) using the relative abundance values of species. The best performance metric (0.95 AUC) was obtained at the order taxonomic level using an average of 416 features with microBiomeGSM. The identified ASD-related taxonomic species are presented.
  • Conference Object
    Ensemble Churn Prediction for Internet Service Provider with Machine Learning Techniques
    (IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2020) Goy, Gokhan; Kolukisa, Burak; Bahcevan, Cenk; Gungor, Vehbi Cagri
    With the developing technology in every fields, a competitive marketing environment has been arised In this competitive environment analyzing customer behavior has become vital In particular, the ability to easily change any service provider has become vet) , critical for the company to continue its existence At the same time, the amount of financial resources spent on retaining instituters much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities In this study. we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value In this context the aunt successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936.
  • Conference Object
    Citation - Scopus: 2
    miRcorrNetPro: Unraveling Algorithmic Insights Through Cross-Validation in Multi-Omics Integration for Comprehensive Data Analysis
    (Institute of Electrical and Electronics Engineers Inc., 2023-12-05) Ünlü Yazici, Miray; Yousef, Malik; Marron, J. S.; Bakir-Güngör, Burcu; Yazici, Miray Unlu
    High throughput -omics technologies facilitate the investigation of regulatory mechanisms of complex diseases. Along this line, scientists develop promising tools and methods to extend our understanding at the molecular and functional levels. To this end, miRcorrNet tool performs integrative analysis of MicroRNA (miRNA) and gene expression profiles via machine learning (ML) approach to identify significant miRNA groups and their associated target genes. In this study, we propose miRcorrNetPro tool, which extends miRcorrNet by tracking group scoring, ranking and other information through the cross-validation iterations. Heatmap visualizations enable deep novel insights into the collective behavior of clusters of groups in cellular signaling and hence facilitate detection of potential biomarkers for the disease under investigation. Although miRcorrNetPro is designed as a generic tool, here we present our findings and potential miRNA biomarkers for Breast Cancer (BRCA). The miRcorrNetPro tool and all other supplementary files are available at https://github.com/Miray-Unlu/miRcorrNetPro. © 2024 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 1
    Words Speak Louder Than Actions: Decoding Emotions Through NLP
    (Institute of Electrical and Electronics Engineers Inc., 2024-10-26) Paksoy, Melda; Bakal, Gokhan
    Emotion detection in text remains a significant challenge in Natural Language Processing due to human emotions' complexity and subtle nuances. This paper presents multiple experimental models for emotion classification using an up-to-date dataset curated to address 13 emotions implied in Twitter posts. We evaluated various machine learning (ML) models, including Logistic Regression, Random Forest, SVM, and XGBoost, alongside deep learning (DL) architectures such as LSTM and CNN. Our results demonstrate the efficacy of deep learning models, particularly the CNN model by achieving an impressive F1 score of 0.99. This study contributes to emotion detection capabilities, paving the way for more nuanced and accurate sentiment analysis (SA) in various text analysis applications. © 2025 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 1
    The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behçet's Disease
    (Institute of Electrical and Electronics Engineers Inc., 2018-09) Görmez, Yasin; Işik, Yunus Emre; Bakir-Güngör, Burcu
    Behçet's disease is a long-term multisystem inflammatory disorder, characterized by recurrent attacks affecting several organs. As the genotyping individuals get cheaper and easier following the developments in genomic technologies, genome-wide association studies (GWAS) emerged. By this means, via studying big-sized case-control groups for a specific disease, potential genetic variations, single nucleotide polymorphisms (SNPs) are identified. Although several genetic risk factors are identified for Behçet's disease with the help of these studies via scanning around a million of SNPs, these variations could only explain up to 20% of the disease's genetic risk. In this study, for Behçet's disease classification, via comparing all the SNPs genotyped in GWAS, with the SNPs selected via using genetic knowledge, gain ratio and information gain; both reduction in the feature size and improvement in the classification accuracy is aimed. Also, using different classification algorithms such as random forest, k-nearest neighbour and logistic regression, their effects on the classification accuracy are investigated. Our results showed that compared to other feature selection methods, with at least 81% success rate, the selection of the SNPs using the genetic information (of their GWAS p-values, indicating the significance of the SNP against the disease) provides 15% to 42% improvement in all classification algorithms. This improvement is statistically sound. While gain ratio and information gain feature selection techniques yield similar classification accuracies, the models using all SNPs could not exceed 50% accuracies and results in the worst performance. © 2019 Elsevier B.V., All rights reserved.