Browsing by Author "Bakir-Güngör, Burcu"
Now showing 1 - 20 of 34
- Results Per Page
- Sort Options
Article Citation - Scopus: 3Aguhyper: a Hyperledger-Based Electronic Health Record Management Framework(PeerJ Inc., 2024) Dedeturk, Beyhan Adanur; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiThe increasing importance of healthcare records, particularly given the emergence of new diseases, emphasizes the need for secure electronic storage and dissemination. With these records dispersed across diverse healthcare entities, their physical maintenance proves to be excessively time-consuming. The prevalent management of electronic healthcare records (EHRs) presents inherent security vulnerabilities, including susceptibility to attacks and potential breaches orchestrated by malicious actors. To tackle these challenges, this article introduces AguHyper, a secure storage and sharing solution for EHRs built on a permissioned blockchain framework. AguHyper utilizes Hyperledger Fabric and the InterPlanetary Distributed File System (IPFS). Hyperledger Fabric establishes the blockchain network, while IPFS manages the off-chain storage of encrypted data, with hash values securely stored within the blockchain. Focusing on security, privacy, scalability, and data integrity, AguHyper’s decentralized architecture eliminates single points of failure and ensures transparency for all network participants. The study develops a prototype to address gaps identified in prior research, providing insights into blockchain technology applications in healthcare. Detailed analyses of system architecture, AguHyper’s implementation configurations, and performance assessments with diverse datasets are provided. The experimental setup incorporates CouchDB and the Raft consensus mechanism, enabling a thorough comparison of system performance against existing studies in terms of throughput and latency. This contributes significantly to a comprehensive evaluation of the proposed solution and offers a unique perspective on existing literature in the field. © 2024 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 2ATGRUVAE: Reducing Noise and Improving Forecasting Performance in Stock Data(Institute of Electrical and Electronics Engineers Inc., 2024) Akkaş, Huseyin; Kolukisa, Burak; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiNowadays, to maximize their income, investors and researchers try to predict the future prices of stocks in the market using artificial intelligence algorithms. However, noise in stock price fluctuations negatively a ffects t he accuracy of the forecasts. To this end, Attention Based Variational Autoencoders with Gated Recurrent Units (ATGRUVAE) method is developed to remove the noise in stock price fluctuations a nd compared with variational, basic and noise removing autoencoders. Exper-iments are conducted using historical stock prices of well-known companies such as Apple, Google and Amazon and 9 different indicator values derived from these stock prices. The noise cleaned stocks are then trained and tested on Extreme Gradient Boosting (XGBoost), Long Short-Term Memory (LSTM) and Linear Regression (LR) models. The results show that the proposed ATGRUVAE model outperforms all three models and demonstrates its ability to capture complex patterns in stock market data. © 2025 Elsevier B.V., All rights reserved.Article Citation - Scopus: 4CCPred: Global and Population-Specific Colorectal Cancer Prediction and Metagenomic Biomarker Identification at Different Molecular Levels Using Machine Learning Techniques(Elsevier Ltd, 2024) Bakir-Güngör, Burcu; Temiz, Mustafa; Inal, Yasin; Cicekyurt, Emre; Yousef, Malik; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiColorectal cancer (CRC) ranks as the third most common cancer globally and the second leading cause of cancer-related deaths. Recent research highlights the pivotal role of the gut microbiota in CRC development and progression. Understanding the complex interplay between disease development and metagenomic data is essential for CRC diagnosis and treatment. Current computational models employ machine learning to identify metagenomic biomarkers associated with CRC, yet there is a need to improve their accuracy through a holistic biological knowledge perspective. This study aims to evaluate CRC-associated metagenomic data at species, enzymes, and pathway levels via conducting global and population-specific analyses. These analyses utilize relative abundance values from human gut microbiome sequencing data and robust classification models are built for disease prediction and biomarker identification. For global CRC prediction and biomarker identification, the features that are identified by SelectKBest (SKB), Information Gain (IG), and Extreme Gradient Boosting (XGBoost) methods are combined. Population-based analysis includes within-population, leave-one-dataset-out (LODO) and cross-population approaches. Four classification algorithms are employed for CRC classification. Random Forest achieved an AUC of 0.83 for species data, 0.78 for enzyme data and 0.76 for pathway data globally. On the global scale, potential taxonomic biomarkers include ruthenibacterium lactatiformanas; enzyme biomarkers include RNA 2′ 3′ cyclic 3′ phosphodiesterase; and pathway biomarkers include pyruvate fermentation to acetone pathway. This study underscores the potential of machine learning models trained on metagenomic data for improved disease prediction and biomarker discovery. The proposed model and associated files are available at https://github.com/TemizMus/CCPRED. © 2024 Elsevier B.V., All rights reserved.Conference Object A Comparative Study on Psychiatric Disorders: Identification of Shared Pathways and Common Agents(Institute of Electrical and Electronics Engineers Inc., 2022) Kuzudisli, Cihan; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiDistinct but closely related diseases generally present shared symptoms, which address possible overlaps among their pathogenic mechanisms. Identification of significantly impacted shared pathways and other common agents are expected to elucidate etiology of these disorders and to help design better intervention strategies. In this research effort, we studied six psychiatric disorders including schizophrenia (SCZ), anorexia (AN), bipolar disorder (BD), depressive disorder (DD), autism (AU) and attention deficit hyperactivity disorder (ADHD). Our methodology can be classified into the following two parts: In Part I, common susceptibility genes; and in Part II, genome-wide association studies (GWAS) data were used to find enriched pathways of psychiatric disorders. 59 KEGG pathways were commonly identified in both parts. 31 of these pathways are disease pathways. Pathways related to cancer and infectious diseases were predominant compared to others. Most of the acquired pathways were in accordance with previous studies in literature. A combination of susceptibility genes and GWAS data is an effective approach to identify significantly impacted pathways in multifactorial diseases. In this respect, shared modules were determined after applying hierarchical clustering of the enriched pathways. These identified modules may tell us the association of psychiatric disorders with the enriched pathways. Taken all together, common pathways and shared modules are expected to highlight the causative factors and important mechanisms behind complex psychiatric diseases, leading to effective drug discovery. © 2022 Elsevier B.V., All rights reserved.Conference Object Comparison of Disease Specific Sub-Network Identification Programs(Institute of Electrical and Electronics Engineers Inc., 2018) Dedeturk, Beyhan Adanur; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiActive sub-network search aims to identify a group of interconnected genes in a protein-protein interaction network that contains most of the disease-associated genes. In recent years, to address active sub-network search problem, various algorithms and programs are developed. In this study, performances of disease specific sub-network identification programs are compared. The same input dataset is run in jActiveModules, ActiveSubnetworkGA, CytoHubba, ClusterViz, MCODE, CytoMOBAS, PathFindR, PINBPA and PEWCC programs. Then, functional enrichment analysis is applied on obtained sub-networks. Finally, they are compared according to the results of GO Enrichment Analysis. In addition to these, work performances, features and requirements of programs are compared. © 2019 Elsevier B.V., All rights reserved.Article Citation - Scopus: 4CSA-DE-LR Enhancing Cardiovascular Disease Diagnosis With a Novel Hybrid Machine Learning Approach(PeerJ Inc., 2024) Dedeturk, Beyhan Adanur; Dedeturk, Bilge Kagan; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiCardiovascular diseases (CVD) are a leading cause of mortality globally, necessitating the development of efficient diagnostic tools. Machine learning (ML) and metaheuristic algorithms have become prevalent in addressing these challenges, providing promising solutions in medical diagnostics. However, traditional ML approaches often need to be improved in feature selection and optimization, leading to suboptimal performance in complex diagnostic tasks. To overcome these limitations, this study introduces a new hybrid method called CSA-DE-LR, which combines the clonal selection algorithm (CSA) and differential evolution (DE) with logistic regression. This integration is designed to optimize logistic regression weights efficiently for the accurate classification of CVD. The methodology employs three optimization strategies based on the F1 score, the Matthews correlation coefficient (MCC), and the mean absolute error (MAE). Extensive evaluations on benchmark datasets, namely Cleveland and Statlog, reveal that CSA-DELR outperforms state-of-the-art ML methods. In addition, generalization is evaluated using the Breast Cancer Wisconsin Original (WBCO) and Breast Cancer Wisconsin Diagnostic (WBCD) datasets. Significantly, the proposed model demonstrates superior efficacy compared to previous research studies in this domain. This study’s findings highlight the potential of hybrid machine learning approaches for improving diagnostic accuracy, offering a significant advancement in the fields of medical data analysis and CVD diagnosis. © 2024 Elsevier B.V., All rights reserved.Conference Object The Effect of Different Classifiers on Recursive Cluster Elimination in the Analysis of Transcriptomic Data(Institute of Electrical and Electronics Engineers Inc., 2023) Bulut, Nurten; Bakir-Güngör, Burcu; Qaqish, Bahjat F.; Yousef, Malik; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiGene expression data with limited sample size and a large number of genes are frequently encountered in genetic studies. In such high-dimensional data, identification of genes that distinguish between disease states is a challenging task. Feature selection (FS) is a useful approach in dealing with high dimensionality. Support Vector Machines Recursive Cluster Elimination (SVM-RCE) is a technique for FS in high-dimensional data. The SVM-RCE approach has been utilized for identification of clusters of genes whose expression levels correlate with pathological state. A key step in SVM-RCE is the use of an SVM classifier to assign an area under the curve (AUC) score to each gene cluster based on its ability to predict class labels. In this study, we investigate the use of alternative classifiers in the cluster-scoring step. Specifically, we compare Support Vector Machines, Random Forest, XgBoost, Naive Bayes, and linear logistic regression. In addition to AUC score performance evaluation, the algorithms are compared in terms of the number of selected genes at different levels of clustering and in terms of the running time. © 2023 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 2Effect of Recursive Cluster Elimination With Different Clustering Algorithms Applied to Gene Expression Data(Institute of Electrical and Electronics Engineers Inc., 2023) Kuzudisli, Cihan; Bakir-Güngör, Burcu; Qaqish, Bahjat F.; Yousef, Malik; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiFeature selection (FS) is an effective tool in dealing with high dimensionality and reducing computational cost. Support Vector Machines-Recursive Cluster Elimination (SVM-RCE) is one of several algorithms that have been developed for FS in high dimensional data. SVM-RCE involves a clustering step which originally is k-means. Using various performance metrics, three alternative algorithms are evaluated in this context; k-medoids, Hierarchical Clustering (HC), and Gaussian Mixture Model (GMM). Comparisons will be carried out on five publicly available gene expression datasets. The results show that k-means in SVM-RCE obtains higher performance than other tested algorithms in terms of classification performance. Additionally, HC shows a similar performance to k-means. Our findings show superiority of using k-means. This study can contribute to the development of SVM-RCE with different variations, leading to decrease in the number of selected genes, and an increase in prediction performance. © 2023 Elsevier B.V., All rights reserved.Article Citation - Scopus: 1eTNT: Enhanced Textnettopics With Filtered LDA Topics and Sequential Forward / Backward Topic Scoring Approaches(Science and Information Organization, 2024) Voskergian, Daniel; Jayousi, Rashid; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiTextNetTopics is a novel text classification-based topic modelling approach that focuses on topic selection rather than individual word selection to train a machine learning algorithm. However, one key limitation of TextNetTopics is its scoring component, which evaluates each topic in isolation and ranks them accordingly, ignoring the potential relationships between topics. In addition, the chosen topics may contain redundant or irrelevant features, potentially increasing the feature set size and introducing noise that can degrade the overall model performance. To address these limitations and improve the classification performance, this study introduces an enhancement to TextNetTopics. eTNT integrates two novel scoring approaches: Sequential Forward Topic Scoring (SFTS) and Sequential Backward Topic Scoring (SBTS), which consider topic interactions by assessing sets of topics simultaneously. Moreover, it incorporates a filtering component that aims to enhance topics' quality and discriminative power by removing non-informative features from each topic using Random Forest feature importance values. These integrations aim to streamline the topic selection process and enhance classifier efficiency for text classification. The results obtained from the WOS-5736, LitCovid, and MultiLabel datasets provide valuable insights into the superior effectiveness of eTNT compared to its counterpart, TextNetTopics. © 2024 Elsevier B.V., All rights reserved.Conference Object Evaluating the Impact of Sentiment Analysis on Deep Reinforcement Learning-Based Trading Strategies(Institute of Electrical and Electronics Engineers Inc., 2024) Etcil, Mustafa; Kolukisa, Burak; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiPortfolio optimization is a form of investment management that aims to maximize returns while minimizing risks. However, the inherent complexity and unpredictability of financial markets pose a challenge. Recent advancements in machine learning, particularly in deep reinforcement learning (DRL), offer promising solutions by enabling dynamic and adaptive trading strategies. This paper presents a comprehensive evaluation of three actor-critic-based DRL algorithms-Advantage Actor-Critic (A2C), Deep Deterministic Policy Gradient (DDPG), and Proximal Policy Optimization (PPO)-applied to portfolio optimization. These strategies were implemented in both sentiment-aware and non-sentiment-aware versions, allowing for a direct comparison of their performance. The sentiment-aware models incorporated sentiment analysis using FinBERT and knowledge graphs to measure market sentiment from financial news, while the non-sentiment-aware models relied solely on stock prices and technical indicators. Our comparative study demonstrates that incorporating sentiment analysis resulted in consistently superior risk-adjusted returns and portfolio resilience during market fluctuations compared to non-sentiment-aware strategies. © 2025 Elsevier B.V., All rights reserved.Conference Object Citation - WoS: 22Citation - Scopus: 51Evaluation of Classification Algorithms, Linear Discriminant Analysis and a New Hybrid Feature Selection Methodology for the Diagnosis of Coronary Artery Disease(Institute of Electrical and Electronics Engineers Inc., 2018) Kolukisa, Burak; Hacilar, Hilal; Göy, Gökhan; Kus, Mustafa; Bakir-Güngör, Burcu; Aral, Atilla; Güngör, Vehbi Çağrı; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiAccording to the World Health Organization (WHO), 31% of the world's total deaths in 2016 (17.9 million) was due to cardiovascular diseases (CVD). With the development of information technologies, it has become possible to predict whether people have heart diseases or not by checking certain physical and biochemical values at a lower cost. In this study, we have evalated a set of different classification algorithms, linear discriminant analysis and proposed a new hybrid feature selection methodology for the diagnosis of coronary heart diseases (CHD). Throughout this research effort, using three publicly available Heart Disease diagnosis datasets (UCI Machine Learning Repository), we have conducted comparative performance evaluations in terms of accuracy, sensitivity, specificity, F-measure, AUC and running time. © 2023 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 1A Federated Learning Framework for Classifying the Images in Ultrasonic Nondestructive Testing(Institute of Electrical and Electronics Engineers Inc., 2024) Gulsen, Abdulkadir; Hacilar, Hilal; Kolukisa, Burak; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiUltrasonic inspection is a critical technique in non-destructive testing that ensures the safety and integrity of the material by detecting internal defects. Defect classification within this context is vital for preventing failures and extending the lifespan of materials. However, the advancement of ultrasonic testing technology is hindered by a scarcity of publicly available, realistic datasets, which are essential for developing accurate models. To address these challenges, this paper introduces a Federated Learning (FL) framework employing a Convolutional Neural Network (CNN) model for defect classification using ultrasonic inspection images. This innovative approach allows for the decentralized training of models on private datasets without the need for data exchange, thus preserving data privacy. Our comparative analysis demonstrates that the FL achieves performance comparable to traditional methods while maintaining the confidentiality of sensitive information. The framework also proves to be robust and scalable with an increase in the number of participating clients. This pioneering study highlights the potential of FL in transforming ultrasonic defect classification and suggests possibilities for its application in other areas of non-destructive testing where publicly available datasets are scarce. These findings would encourage researchers to develop a federated platform for enhanced collaboration and explore advanced CNN architectures to improve training efficiency. © 2025 Elsevier B.V., All rights reserved.Conference Object Hepatoselüler Karsinom Oluşumunda Etkili Moleküler Mekanizmaların İn Siliko Yöntemlerle Araştırılması(Institute of Electrical and Electronics Engineers Inc., 2020) Doǧan, Refika Sultan; Saka, Samed; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 04. Yaşam ve Doğa Bilimleri Fakültesi; 04.01. Biyomühendislik; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiHepatocellular carcinoma (HCC) is the most common cause of cancer-related death in the world. The molecular changes in the organism during the development of HCC are not fully understood. The aim of the present study is to contribute to the identification of critical genes and pathways associated with HCC via integrating various bioinformatics methods. In this study, experiments were conducted on gene expression data of 14 HCC tissues and noncancerous control tissues. A total of 1229 genes, which show a statistically significant change between the groups, were identified. Among these, 681 genes were upregulated and 548 genes were downregulated. Via mapping the detected genes into protein protein interaction networks, active subnetwork search, subnetwork topological analysis and functional enrichment analyses were carried out. The interactions between the molecular pathways affected by HCC were also presented. © 2020 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 1The Identification of Discriminative Single Nucleotide Polymorphism Sets for the Classification of Behçet's Disease(Institute of Electrical and Electronics Engineers Inc., 2018) Görmez, Yasin; Işik, Yunus Emre; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiBehçet's disease is a long-term multisystem inflammatory disorder, characterized by recurrent attacks affecting several organs. As the genotyping individuals get cheaper and easier following the developments in genomic technologies, genome-wide association studies (GWAS) emerged. By this means, via studying big-sized case-control groups for a specific disease, potential genetic variations, single nucleotide polymorphisms (SNPs) are identified. Although several genetic risk factors are identified for Behçet's disease with the help of these studies via scanning around a million of SNPs, these variations could only explain up to 20% of the disease's genetic risk. In this study, for Behçet's disease classification, via comparing all the SNPs genotyped in GWAS, with the SNPs selected via using genetic knowledge, gain ratio and information gain; both reduction in the feature size and improvement in the classification accuracy is aimed. Also, using different classification algorithms such as random forest, k-nearest neighbour and logistic regression, their effects on the classification accuracy are investigated. Our results showed that compared to other feature selection methods, with at least 81% success rate, the selection of the SNPs using the genetic information (of their GWAS p-values, indicating the significance of the SNP against the disease) provides 15% to 42% improvement in all classification algorithms. This improvement is statistically sound. While gain ratio and information gain feature selection techniques yield similar classification accuracies, the models using all SNPs could not exceed 50% accuracies and results in the worst performance. © 2019 Elsevier B.V., All rights reserved.Conference Object Identify Commonly Affected Pathways in Psychiatric Diseases(Institute of Electrical and Electronics Engineers Inc., 2018) Bulut, Umit; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiGenome-wide association studies (GWAS) are an extraordinary source of information when it comes to revealing the common variations of human complex diseases. Until now, the large amount of data generated from these studies have not been shown its full potential enough to identify the molecular and functional framework to be able to understand how a molecular system works. Following a more specific perspective, this study focused on the identification of commonly affected pathways of psychiatric diseases. The pathway term as used in molecular biology, depicts a simplified model of a process within the cell or tissue. Lately, several GWAS datasets are publicly available for various disease types such as psychiatric, immune-related, neurodegenerative, cardiovascular and such. A study on each disease and pairwise comparison to understand the behavior of disease and system would be time consuming and exhaustive. Instead of handling the results of these studies one by one, grouping diseases by target points is a more efficient way. This work aims to get one step closer to reveal key points of diseases and target these points to develop personalized medicine approaches. Especially for complex diseases, every drug doesn't show the same effect in every people. This paper contains the definition of molecular pathways, methods to identify disease related pathways, and to find common pathways pairwise in psychiatric diseases. © 2019 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 5Identifying Taxonomic Biomarkers of Colorectal Cancer in Human Intestinal Microbiota Using Multiple Feature Selection Methods(Institute of Electrical and Electronics Engineers Inc., 2022) Jabeer, Amhar; Kocak, Aysegul; Akkaş, Huseyin; Yenisert, Ferhan; Nalbantoĝlu, Özkan Ufuk; Yousef, Malik; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiA variety of bacterial species called gut microbiota work together to maintain a steady intestinal environment. The gastrointestinal tract contains tremendous amount of different species including archaea, bacteria, fungi, and viruses. While these organisms are crucial immune system stabilizers, the dysbiosis of the intestinal flora has been related to gastrointestinal disorders including Colorectal cancer (CRC), intestinal cancer, irritable bowel syndrome and inflammatory bowel disease. In the last decade, next-generation sequencing (NGS) methods have accelerated the identification of human gut flora. CRC is a deathly condition that has been on the rise in the last century, affecting half a million people each year. Since early CRC diagnosis is critical for an effective treatment, there is an immediate requirement for a classification system that can expedite CRC diagnosis. In this study, via analyzing the available metagenomics data on CRC, we aim to facilitate the CRC diagnosis via finding biomarkers linked with CRC, and via building a classification model. We have obtained the metagenomic sequencing data of the healthy individuals and CRC patients from a metagenome-wide association analysis and we have classified this data according to the disease stages. Conditional Mutual Information Maximization (CMIM), Fast Correlation Based Filter (FCBF), Extreme Gradient Boosting (XGBoost), min redundancy max relevance (mRMR), Information Gain (IG) and Select K Best (SKB) feature selection algorithms were utilized to cope with the complexity of the features. We observed that the SKB, IG, and XGBoost techniques made significant contributions to decrease the microbiota in use for CRC diagnosis, thereby reducing cost and time. We realized that our Random Forest classifier outperformed Adaboost, Support Vector Machine, Decision Tree, Logitboost and stacking ensemble classifiers in terms of CRC classification performance. Our results reiterated some known and some potential microbiome associated mechanisms in CRC, which could aid the design of new diagnostics based on the microbiome. © 2022 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 1Integrative Analyses in Omics Data: Machine Learning Perspective(Deutsche Gesellschaft fur Medizinische Informatik, Biometrie und Epidemiologie e.V., 2023) Ünlü Yazici, Miray; Bakir-Güngör, Burcu; Yousef, Malik; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik Fakültesi; 04. Yaşam ve Doğa Bilimleri Fakültesi; 04.01. BiyomühendislikDevelopments in the high throughput technologies have enabled the production of an immense amount of knowledge at the multi-omics level. Considering complex diseases which are affected by multi-factors, single omics datasets might not be sufficient to unveil the molecular mechanisms of heterogeneous diseases. Providing a comprehensive and systematic overview to explain disease hallmarks in significant depth is critical. Utilizing multi-omics datasets has led to the development of a variety of tools and platforms. Machine learning models are utilized in a wide variety of tools to tackle the complexity of disorders and to identify new biomolecular signatures and potential markers. Underlying aspects of these approaches are based on training the models for making predictions and classification of the given data. In this review, we describe current machine learning-based approaches and available implementations. Challenges in the enlightenment of disease mechanisms of onset and progression and future development of the field of medicine will be discussed. The prominence of biological interpretation of model output with corresponding biological knowledge will be also covered in this review. © 2023 Elsevier B.V., All rights reserved.Conference Object Citation - WoS: 1Citation - Scopus: 1Kolon Kanserinde Etkilenen Yolak Alt Ağlarini Ve Kümelenmelerini Belirlemek için Yeni Bir Yöntem(Institute of Electrical and Electronics Engineers Inc., 2019) Göy, Gökhan; Ünlü Yazici, Miray; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik Fakültesi; 04. Yaşam ve Doğa Bilimleri Fakültesi; 04.01. BiyomühendislikNowadays new technological developments that play an important role in the production of big data have brought about the interpretation, sharing and storage of data related to complex diseases. Combining multi-omic data in different molecular levels is potentially important for understanding the biological origin of complex diseases. One of these complex diseases is cancer of different types, which has one of the highest causes of death worldwide. The integration of multiple omic data in the framework of a comprehensive analysis and identification of relevant pathways contribute to the development of therapeutic approaches related to disease. In this study, RNA and methylation data (genes and p values) of colon adenocarcinoma were obtained from TCGA data portal and combined with Fisher's method. While protein subnetworks affected by the disease were identified by using subnetwork algorithm, pathways related to the disease and genes associated with these pathways were determined by functional enrichment analysis. Using gene-pathway relationship matrix, kappa scores of pathways were determined by similarity calculation. In this way, the pathways were clustered according to the hierarchically optimal number, as a result, the most important pathway clusters and related genes that are effective in disease formation identified. © 2020 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 1Koroner Arter Hastalığı Tanısı İçin Alan Bilgisi İçeren Topluluk Öznitelik Seçim Yöntemi(Institute of Electrical and Electronics Engineers Inc., 2020) Kolukisa, Burak; Güngör, Vehbi Çağrı; Bakir-Güngör, Burcu; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiCoronary Artery Disease (CAD) is the condition where, the heart is not fed enough as a result of the accumulation of fatty matter called atheroma in the walls of the arteries. In 2016, CAD accounts for 31% (17.9 million) of the world's total deaths and its diagnosis is difficult. It is estimated that approximately 23.6 million people will die from this disease in 2030. With the development of machine learning and data mining techniques, it might be possible to diagnose CAD inexpensively and easily via examining some physical and biochemical values. In this study, for the CAD classification problem, a novel ensemble feature selection methodology that incorporates domain knowledge is proposed. Via applying the proposed methodology on the UCI Cleveland CAD dataset and using different classification algorithms, performance metrics are compared. It is shown that in our experiments, when Multilayer Perceptron classifier is used with 9 selected features, our proposed solution reached 85.47% accuracy, 82.96% accuracy and 0.839 F-Measure. As a future work, we aim to generate a machine learning model that can quickly diagnose CAD on real-time data in hospitals. © 2021 Elsevier B.V., All rights reserved.Conference Object Leveraging MicroRNA-Gene Associations With Mirgedinet: An Intelligent Approach for Enhanced Classification of Breast Cancer Molecular Subtypes(Springer Science and Business Media Deutschland GmbH, 2025) Qumsiyeh, Emma; Bakir-Güngör, Burcu; Yousef, Malik; 01. Abdullah Gül University; 02. 04. Bilgisayar Mühendisliği; 02. Mühendislik FakültesiUnderstanding the molecular subtypes of breast cancer is crucial for advancing targeted therapies and precision medicine. For the BRCA molecular subtype prediction problem, this study employs miRGediNET, a machine-learning approach that integrates data from miRTarBase, DisGeNET, and HMDD databases to investigate shared gene associations between MicroRNA (miRNA) activity and disease mechanisms. Using the BRCA LumAB_Her2Basal dataset, we evaluate miRGediNET’s performance against traditional feature selection methods, including CMIM, mRmR, Information Gain (IG), SelectKBest (SKB), Fast Correlation-Based Filter (FCBF), and XGBoost (XGB). These feature selection techniques were assessed using various classification algorithms including Random Forest (RF), Support Vector Machine (SVM), LogitBoost, Decision Tree, and AdaBoost, all executed with default parameters. The feature selection methods were tested using Monte Carlo Cross-Validation, where performance metrics obtained for each iteration were averaged to ensure robustness. Our findings reveal that miRGediNET outperforms traditional methods in accuracy and Area Under the Curve (AUC), emphasizing its superior capability to identify key genes that bridge miRNA interactions and breast cancer mechanisms. Notably, both miRGediNET and Information Gain (IG) feature selection consistently identified ESR1, a critical biomarker frequently reported in recent research associated with breast cancer prognosis and resistance to endocrine therapies. This integrative approach provides deeper biological insights into miRNA-disease interactions, paving the way for enhanced patient stratification, biomarker discovery, and personalized medicine strategies. The miRGediNET tool, developed on the KNIME platform, offers a practical resource for further exploration in the field of bioinformatics and oncology. © 2025 Elsevier B.V., All rights reserved.
