Scopus İndeksli Yayınlar Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395
Browse
11 results
Search Results
Conference Object Enhancing Complex Disease Group Scoring with Mirgedinet: A Multi-Algorithm Machine Learning Framework Based on the GSM Approach(IEEE, 2025-06-25) Qumsiyeh, Emma; Bakir-Gungor, Burcu; Yousef, MalikIntegrating biological prior knowledge for disease gene associations has shown significant promise in discovering new biomarkers with potential translational applications. This work investigates the application of a multi-algorithm machine learning framework based on the Grouping-Scoring-Modeling (G-S-M) approach for improving the prediction of complex diseases. The study identifies the primary gene and miRNA interactions in various complex diseases with the help of miRGediNET, which is a machine-learning based tool that integrates data from three biological databases. Traditional methods have only focused on independence between features; the G-S-M method focuses on aggregating genes based on biological interactions, pinpointing the scoring of gene groups for a disease, and modeling its predictive capability using advanced machine learning algorithms. In this research paper, seven algorithms, including Support Vector Machine, Decision Tree, and CatBoost, were applied to eight datasets extracted from the GEO database. This framework proved very robust in ranking gene clusters, thus predicting critical biomarkers while doing 100-fold randomized cross-validation within the evaluation. The results indicate this approach's high potential for refining disease and supporting research for choosing the best algorithm that can provide biological insights and computational advances.Conference Object Exploring Microbiome Signatures in Autism Spectrum Disorder via Grouping-Scoring Based Machine Learning(IEEE, 2025-06-25) Temiz, Mustafa; Ersoz, Nur Sebnem; Yousef, Malik; Bakir-Gungor, BurcuThe rapid increase in omic data production increased the importance of machine learning (ML) methods to analze these data. In particular, the use of metagenomic data in the diagnosis, prognosis and treatment of diseases is becoming widespread. Autism Spectrum Disorder (ASD) is a neurodevelopmental disease that occurs in early childhood and continues lifelong. The aim of this study is to increase ML performance, reduce computational costs and achieve successful classification performance using a small number of metagenomic features. In addition, disease prediction is performed; ASD associated biomarkers are determined using the microBiomeGSM on metagenomic data. Classification is performed at three different taxonomic levels (genus, family and order) using the relative abundance values of species. The best performance metric (0.95 AUC) was obtained at the order taxonomic level using an average of 416 features with microBiomeGSM. The identified ASD-related taxonomic species are presented.Conference Object Citation - WoS: 1Citation - Scopus: 1Prediction of Type 2 Diabetes Using Metagenomic Data and Identification of Taxonomic Biomarkers(IEEE, 2024-05-15) Temiz, Mustafa; Kuzudisli, Cihan; Yousef, Malik; Bakir-Gungor, BurcuNowadays, different molecular levels of -omics data on diseases are generated and analyzing these data with machine learning methods is one of the popular research topics. Among these data, the use of metagenomic data to facilitate the diagnosis, detection and treatment of diseases is increasing day by day. Type 2 diabetes (T2D) is a chronic disease characterized by insulin resistance and progressive dysfunction of pancreatic beta cells. While the number of people with diabetes is increasing by around 8% annually, the cost of treating the disease is rising by 18% per year. Therefore, the number of studies on the diagnosis, development and progression of T2D is increasing over time. The aim of this study is to achieve higher machine learning performance by using fewer metagenomic features and to achieve better classification performance by reducing computational costs. In this study, we compare the performance of three different methods using T2D-related metagenomic data. First, the MetaPhlAn tool is used to calculate the taxonomic species and their relative abundances in each sample. The SVM-RCE, RCE-IFE and microBiomeGSM tools used in this study are methods that perform classification by grouping and scoring features and are known to work well on complex datasets. In this study, the best results were obtained with the RCE-IFE tool with an AUC of 0.72 with an average of 125 features information. In addition, key taxonomic species identified by these tools as associated with T2D are presented in comparison to the literature.Conference Object Citation - Scopus: 1NLP-Driven Fake News Detection: A Machine Learning Perspective(IEEE, 2025-05-23) Coban, Mert Korkut; Bakal, GokhanThe rapid spread of fake news poses a significant challenge, impacting public opinion, decision-making, and societal trust. This study explores the application of Natural Language Processing (NLP) and Machine Learning (ML) techniques for robust fake news detection. Using datasets such as ISOT Fake News, WELFake, and Football Fake News, the project employs advanced preprocessing methods and feature extraction techniques, including TF-IDF, Word2Vec, and GloVe. A comprehensive evaluation of machine learning models-Random Forest, Support Vector Machines (SVM), and Neural Networks-was conducted to identify the optimal configuration. Results demonstrate that Random Forest with TF-IDF excels in in-domain detection, achieving an F1-score of 99.70%, while Neural Networks paired with Word2Vec and GloVe embeddings outperform in cross-dataset scenarios. The study highlights the importance of dataset size, domain relevance, and feature representation in achieving high generalizability. These findings provide a scalable framework for combating misinformation on digital platforms.Conference Object Metagenomic Data Analysis With Machine Learning to Discover Colorectal Cancer-Associated Enzymes(IEEE, 2024-05-15) Ersoz, Nur Sebnem; Kuzudisli, Cihan; Yousef, Malik; Bakir-Gungor, BurcuThe human gut microbiome comprises over 10 trillion microbes and plays important roles in maintaining metabolism, body homeostasis, impacting immune function. Metagenomics which studies genomic data from clinical and environmental samples is crucial in understanding the interplay between the host and the gut microbiome. Recently, functional profiling of metagenomes helps to identify alterations in microbial functions, particularly enzyme-encoding genes. Colorectal cancer (CRC) is known as one of the leading causes of cancer-related deaths. In this study, we aimed to find the CRC-associated enzymes by analyzing metagenomic data with different machine learning methods. A total of 1262 samples including CRC and control groups from different countries were used in this study. This dataset was obtained by functionally profiling metagenomics data and estimating community level enzyme commission (EC) abundance values. For the analysis of this dataset, RCE-IFE and SVM-RCE machine learning methods, which are group-based feature selection methods, were compared with 6 different individual feature selection methods. 10 times Monte-Carlo Cross Validation was used in our experiments. It was observed that RCE-IFE, Extreme Gradient Boosting and Select K Best methods similarly provided the best performances. Especially in this study, besides the its high performance, the group-based feature selection method RCE-IFE grouped enzymes into clusters unlike TFS, and then identified biologically relevant CRC-associated enzymes.Conference Object Citation - WoS: 3Citation - Scopus: 3Examining Tongue Movement Intentions in EEG With Machine and Deep Learning: An Approach for Dysphagia Rehabilitation(IEEE, 2024-08-26) Aslan, Sevgi Gokce; Yilmaz, BulentDysphagia, a common swallowing disorder particularly prevalent among older adults and often associated with neurological conditions, significantly affects individuals' quality of life by negatively impacting their eating habits, physical health, and social interactions. This study investigates the potential of brain-computer interface (BCI) technologies in dysphagia rehabilitation, focusing specifically on motor imagery paradigms based on EEG signals and integration with machine learning and deep learning methods for tongue movement. Traditional machine learning classifiers, such as K-Nearest Neighbors (KNN), Support Vector Machine (SVM), Decision Tree, Naive Bayes, Random Forest, AdaBoost, Bagging, Kernel, and Neural Network were employed in discrimination of rest and imagination phases of EEG signals obtained from 30 healthy subjects. Scalogram images obtained using continuous wavelet transform of EEG signals corresponding to the rest and imagination phases of the experiment were used as the input images to the CNN architecture. As a result, KNN and SVM, exhibited lower accuracy rates compared to ensemble methods like AdaBoost and Random Forest, which are effective in handling complex datasets. Additionally, a deep learning approach achieved an accuracy rate of 83%. Overall, this study demonstrates the promising role of BCI technologies and machine learning techniques in dysphagia rehabilitation.Conference Object Data-Driven Local Control Design for Dead Band Control of Load Tap Changers(IEEE, 2024-09-02) Savasci, Alper; Ceylan, Oguzhan; Paudyal, SumitThis study presents an off-line optimization-guided machine learning approach for coordinating the local control rules of on-load tap changers (OLTCs) and step-voltage regulators (SVRs). Based on a bang-bang control rule, these legacy devices autonomously regulate the feeder voltage around the nominal level by varying the tap position in the lower or raise direction. The characterizing parameter of the local control rule is the dead band, which affects the number of tap switching in operation and is directly related to the economical use life of the equipment. The bandwidth is typically set within a standard voltage range and is generally kept constant in daily operation. However, adjusting the bandwidth dynamically can prevent excessive tap switching while maintaining satisfactory voltage regulation for varying loading and distributed generation conditions. Our approach aims to set the bandwidth parameter systematically and efficiently through a machine learning-based scheme, which is trained with a dataset formed by solving the distribution network optimal power flow (DOPF) problem. The performance of learning the bandwidth parameter is demonstrated on the modified 33-node feeder, which is promising for integrated voltage control schemes.Conference Object Citation - WoS: 2Citation - Scopus: 2Classification of Breast Cancer Molecular Subtypes With Grouping-Scoring Approach That Incorporates Disease-Disease Association Information(IEEE, 2024-05-15) Qumsiyeh, Emma; Bakir-Gungor, Burcu; Yousef, MalikThis study uses modern sequencing technology and large biological databases to investigate the molecular intricacies of complicated diseases like cancer. Using gene expression databases and biomarkers, the research aims to improve breast cancer molecular subtype identification for better patient outcomes. Using BRCA LumAB_ Her2Basal dataset, this study compares an integrative machine learning-based strategy (GediNET) to traditional feature selection approaches across machine learning classifiers. GediNET excels at uncovering crucial disease-disease connections and potential biomarkers using the Grouping-Scoring-Modeling (GSM) approach, which favors gene groupings above individual genes. Our comparative analysis highlights GediNET's exceptional performance, notably in terms of accuracy and Area Under the Curve metrics, underscoring its effectiveness in uncovering the genetic intricacies of breast cancer. GediNET's promise to improve disease classification and biomarker identification by improving biological mechanism understanding goes beyond exceeding traditional approaches. The work shows that GediNET's integrative method can promote bioinformatics research by identifying the most informative genes associated with certain diseases, enabling focused and customized medicine.Conference Object Citation - Scopus: 3A Hybrid Adaptive Neuro-Fuzzy Inference System (Anfis) Approach for Professional Bloggers Classification(IEEE, 2019-11) Asim, Yousra; Raza, Basit; Malik, Ahmad Kamran; Shahid, Ahmad R.; Faheem, Muhammad; Kumar, Yogan JayaDespite their small numbers, some users of the online social networks demonstrate the ability to influence others. Bloggers are one of such kind of users that through their ideas and opinions on different topics, influence other users. Their identification may be beneficial for several purposes, such as online marketing for products. Much effort has been expanded towards finding the impact of such bloggers within the blogging community. We have expanded on their work by identifying influential bloggers using labeled data. We have improved upon the accuracy of the classification of professional and non-professional bloggers. We have made use of Adaptive Neuro-Fuzzy Inference System (ANFIS), and the Fuzzy Inference System (FIS) models. Their performance has been gauged and compared with the existing techniques and approaches, such as an Artificial Neural Network (ANN), Alternating Decision Tree (ADTree) algorithm, and Classification Based on Associations (CBA) algorithm. Adaptive techniques (ANFIS and ANN) are found better than the aforementioned rule-based classifiers. The FIS model outperformed the CBA algorithm, but showed similar performance to the ADTree algorithm. Our proposed ANFIS model showed improved results in terms of performance measures with 93% accuracy for blogger classification.Conference Object Citation - Scopus: 1A Comprehensive Investigation into Strip Steel Defect Detection Using Traditional Machine Learning and Deep Learning Models(IEEE, 2025-05-23) Erkantarci, Betul; Kurban, Rifat; Bakal, Mehmet Gokhan; Kose, AbdulkadirThe steel manufacturing sector places great importance on guaranteeing the quality of strip steel products, which has led to a thorough investigation of defect detection approaches. This work conducts a comparative analysis of traditional machine learning and deep learning models to determine their efficacy in detecting defects in strip steel. Our analysis is based on a dataset that includes a variety of images of strip steel surfaces showing different types of defects. In this work, we adopt image preprocessing techniques to improve the quality of input images prior to the application of classification methods. We employ traditional ML algorithms including Support Vector Machine and Random Forest, and deep learning model AlexNet Convolutional Neural Networks for effective defect classification. Consequently, we present comparative evaluations that highlight the strengths and weaknesses of each approach, considering accuracy scores.
