Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 10 of 32

An Attention-Based Autoencoder Model with Gated Recurrent Unit for Stock Price Movement Prediction
(Springer Nature, 2026) Kolukisa, Burak; Bakir-Gungor, Burcu; Akkas, Huseyin
Predicting stock movements is crucial for investors looking to maximize their profits in rapidly changing financial markets. However, noise in stock price data makes it harder to detect the trends and ultimately decreases the performance of predictive models. To address these challenges that are caused by noise, this study proposes two novel models, i.e., an Attention-based Autoencoder (ABA) and an Attention-based Variational Autoencoder with Gated Recurrent Units (AGRUA), which uniquely integrate denoising, attention, and GRU layers. Firstly, a data set is created by adding 9 different technical indicators to the historical data of Borsa Istanbul (BIST 30) stocks. Secondly, proposed methods and other well-known deep learning models were used to remove noise from the data sets. Finally, each denoised dataset was fed separately to the Extreme Gradient Boosting model and subjected to a buy-sell process. The results were measured using trading indicators such as amount of the profits and Sharpe Ratio, Sortino Ratio and Maximum Drawdown. The proposed models produced substantial financial gains, with AGRUA achieving the highest total profit and ABA achieving the lowest average Maximum Drawdown, thereby demonstrating superior risk-adjusted performance. Lastly, Friedman and Nemenyi tests confirmed that AGRUA and ABA surpass most of the benchmarks in profit and risk-adjusted returns. The performance of the proposed method demonstrates its value in capturing nonlinear market patterns and improving decision accuracy, emphasizing the need for noise reduction before forecasting.
Citation - Scopus: 4
RCE-IFE: Recursive Cluster Elimination with Intra-Cluster Feature Elimination
(PeerJ Inc., 2025-02-07) Kuzudisli, Cihan; Bakir-Gungor, Burcu; Qaqish, Bahjat; Yousef, Malik
Citation - Scopus: 7
Network Anomaly Detection Using Deep Autoencoder and Parallel Artificial Bee Colony Algorithm-Trained Neural Network
(PeerJ Inc., 2024-10-08) Dedeturk, Bilge Kagan; Bakir-Gungor, Burcu; Hacılar, Hilal; Gungor, Vehbi Cagri
G-S a Prior Biological Knowledge-Based Pattern Detection and Enrichment Framework for Multi-Omics Data Integration
(MDPI, 2025-11-29) Unlu Yazici, Miray; Bakir-Gungor, Burcu; Yousef, Malik
The rapid advancements in high-throughput technologies have led to a dramatic increase in diverse -omics data types, enabling comprehensive analyses, especially for complex diseases like cancer. Despite the development of multi-omics approaches, the challenges of scaling integration to massive, heterogeneous -omics datasets suggest that novel computational tools need to be designed. In this study, we propose an approach for integrating microRNA (miRNA) and messenger RNA (mRNA) expression data, incorporating prior biological knowledge (PBK). This approach scores and ranks groups of miRNAs and their associated genes using cross-validation iterations. The proposed method incorporates a Pattern detection (P) component to identify molecular motifs unique to each biological group. The analysis also facilitates the visualization of the groups, facilitating the identification of co-occurring groups and their characteristic features across iterations. Furthermore, the groups are scored using an over-representation analysis through a new Enrichment (E) component in each iteration. The clusters of the groups based on the Enrichment Scores (ESs) are visualized in a heatmap to obtain novel insights into the collective behavior and dependencies of the groups, aiming to understand the molecular mechanisms of complex diseases. The developed G-S-M-E tool not only provides performance metrics and biological scores at the group level but also offers comprehensive insights into intricate multi-omics interactions. In summary, our study emphasizes the importance of mathematical and data science methodologies in elucidating intricate multi-omics integration, yielding a formalized approach that deepens our comprehension of complex diseases.
Enhancing Complex Disease Group Scoring with Mirgedinet: A Multi-Algorithm Machine Learning Framework Based on the GSM Approach
(IEEE, 2025-06-25) Qumsiyeh, Emma; Bakir-Gungor, Burcu; Yousef, Malik
Integrating biological prior knowledge for disease gene associations has shown significant promise in discovering new biomarkers with potential translational applications. This work investigates the application of a multi-algorithm machine learning framework based on the Grouping-Scoring-Modeling (G-S-M) approach for improving the prediction of complex diseases. The study identifies the primary gene and miRNA interactions in various complex diseases with the help of miRGediNET, which is a machine-learning based tool that integrates data from three biological databases. Traditional methods have only focused on independence between features; the G-S-M method focuses on aggregating genes based on biological interactions, pinpointing the scoring of gene groups for a disease, and modeling its predictive capability using advanced machine learning algorithms. In this research paper, seven algorithms, including Support Vector Machine, Decision Tree, and CatBoost, were applied to eight datasets extracted from the GEO database. This framework proved very robust in ranking gene clusters, thus predicting critical biomarkers while doing 100-fold randomized cross-validation within the evaluation. The results indicate this approach's high potential for refining disease and supporting research for choosing the best algorithm that can provide biological insights and computational advances.
Exploring Microbiome Signatures in Autism Spectrum Disorder via Grouping-Scoring Based Machine Learning
(IEEE, 2025-06-25) Temiz, Mustafa; Ersoz, Nur Sebnem; Yousef, Malik; Bakir-Gungor, Burcu
The rapid increase in omic data production increased the importance of machine learning (ML) methods to analze these data. In particular, the use of metagenomic data in the diagnosis, prognosis and treatment of diseases is becoming widespread. Autism Spectrum Disorder (ASD) is a neurodevelopmental disease that occurs in early childhood and continues lifelong. The aim of this study is to increase ML performance, reduce computational costs and achieve successful classification performance using a small number of metagenomic features. In addition, disease prediction is performed; ASD associated biomarkers are determined using the microBiomeGSM on metagenomic data. Classification is performed at three different taxonomic levels (genus, family and order) using the relative abundance values of species. The best performance metric (0.95 AUC) was obtained at the order taxonomic level using an average of 416 features with microBiomeGSM. The identified ASD-related taxonomic species are presented.
Correction: Engineering Novel Features for Diabetes Complication Prediction Using Synthetic Electronic Health Records
(Frontiers Media S.A., 2025-08-29) Voskergian, Daniel; Bakir-Gungor, Burcu; Yousef, Malik
In-silico Identification of Papillary Thyroid Carcinoma Molecular Mechanisms
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2019-04) Ersoz, Nur Sebnem; Guzel, Yasin; Bakir-Gungor, Burcu
Representing approximately 70% to 80% of thyroid cancers, papillary thyroid cancer (PTC) is the most common type of thyroid cancers. PTC is seen in all age groups, but it is seen more frequently in women than in men. Detection of biomarker proteins of papillary thyroid cancinoma plays an important role in the diagnosis of the disease. In this study, we aim to find target genes and pathways that are associated with papillar thyroid carcinoma, by integrating different bioinformatics methods. For this purpose, usingin-silico methodologies, candidate genes and pathways that could explain disease development mechanisms are identified. Throughout this study, firstly we identified differentially expressed genes as the amount of their protein product differ between patient and healthy groups. Secondly, by using active subnetworks search algorithms, topologic analyses and functional enrichment tests, candidate proteins,which could be thought as PTC biomarkers, and affected pathways are identified.
In-Silico Methods to Identify Common MicroRNAs and Pathways of Neuromuscular Diseases
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2019-04) Yazici, Miray Unlu; Menges, Evrim Aksu; Ulum, Yeliz Z. Akkaya; Hayta, Burcu Balci; Bakir-Gungor, Burcu; Balcihayta, Burcu; Akkaya Ulum, Yeliz Z.
Neuromuscular disorders (NMD) are a heterogeneous group of diseases characterized by the loss of function of the peripheral nerves and muscles. However, there are no effective and widespread therapeutic approaches to prevent or delay the progression of these disease types. microRNAs (miRNAs) which cause significant changes in gene expression by binding to target messenger RNAs (mRNAs), are known to have an effect on disease mechanisms. In this study, by integrating different bioinformatics methods, we aim to find miRNAs, target genes and pathways related to a group of neuromuscular diseases. For this purpose, we determined 17 miRNAs that show significant expression changes between patient and healthy groups; predicted target genes of these miRNAs; and identified affected pathways using subnetwork discovery, functional enrichment based algorithms. In our study, we integrated different in-silico approaches that proceed in top-down manner or bottom-up manner. The identified candidate miRNAs, genes and pathways, which could help to explain neuromuscular disease development mechanisms, are now under investigation in wet-lab.
Citation - WoS: 1
Citation - Scopus: 1
Textnettopics-SFTS-SBTS Textnettopics Scoring Approaches Based Sequential Forward and Backward
(Springer International Publishing AG, 2024) Voskergian, Daniel; Bakir-Gungor, Burcu; Yousef, Malik
TextNetTopics is a text classification-based topic modeling approach that performs topic selection rather than word selection to train a machine learning algorithm. However, one main limitation of TextNetTopics is that its scoring component (the S component) assesses each topic independently and ranks them accordingly, neglecting the potential relationship between topics. In order to address this limitation and improve the classification performance, this study introduces an enhancement to TextNetTopics. TextNetTopics-SFTS-SBTS integrates two novel scoring approaches: Sequential Forward Topic Scoring (SFTS) and Sequential Backward Topic Scoring (SBTS), which consider topic interactions by assessing sets of topics simultaneously. This integration aims to streamline the topic selection process and enhance classifier efficiency for text classification. The results obtained across three datasets offer valuable insights into the context-dependent effectiveness of the new scoring mechanisms across diverse datasets and varying numbers of topics involved in the analysis.

Scopus İndeksli Yayınlar Koleksiyonu

Browse

Filters

Settings

Sort By

Results per page

Search Results