Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 10 of 35

Predicting Respiratory Infection and Symptoms Development Using Gene Set Enrichment Scores and Machine Learning
(Elsevier Sci Ltd, 2026) Aydin, Zafer; Isik, Yunus Emre
Recent advancements in precision medicine enable personalized predictions grounded in individual-level genetic data. However, relying solely on a single type of data can decrease prediction accuracy and limit the biological interpretability of the resulting models. Incorporating predefined genetic knowledge, such as derived gene sets, can improve performance and provide deeper biological insights for complex diseases, including respiratory infections. This study aimed to evaluate the usability of enrichment scores (ES), calculated using gene sets from the Molecular Signatures Database (MSigDB), as a feature representation for machine learning models to predict respiratory viral infections and symptom development. In addition, the proposed feature representation approach was extensively compared with the de facto gene-level expression representation. A total of 36,834 predefined gene sets were compiled from the MSigDB, and their ES values were calculated. Experiments used the GSE73072 dataset from Gene Expression Omnibus, containing gene expression profiles before and after virus exposure. Various machine learning and feature selection algorithms were applied to ES-based and probe-level feature sets. The results showed that both feature representation approaches achieved an area under the precision-recall curve (AUPRC) value greater than 0.90 for all tasks. Compared with the Respiratory Viral DREAM Challenge leaderboard phase, our models showed a 14.8% improvement in pre-exposure predictions (T0) and a 17.4% improvement in symptom classification. Using enrichment scores as a feature representation generally resulted in better performance than probe-level representation when predicting respiratory infections and symptom development. Identifying key gene sets through feature selection and comparing them with essential genes for respiratory viruses enabled a more comprehensive analysis, providing deeper insights into the pathways that contribute to these predictions.
Toward AI-Enhanced Robotics and Smart Platforms for Sustainable Agriculture and Wetland Coexistence
(Institute of Electrical and Electronics Engineers Inc., 2025) Dubinsky, Yael; Aydin, Zafer; Winokur, Michael; Kohen-Vacs, Dan; Bukhshtaber, Natalia; Berselli, Giovanni; Zabulis, Xenophon
Citation - Scopus: 1
Feature Selection for Protein Dihedral Angle Prediction
(Institute of Electrical and Electronics Engineers Inc., 2017-09) Aydin, Zafer; Kaynar, Oǧuz; Görmez, Yasin
GraphUnet-SS: A Novel Deep Learning Model for Protein Secondary Structure Prediction Based on U-Net Architecture
(Elsevier Ltd, 2026-04) Aydin, Zafer; Görmez, Yasin; Sabzekar, Mostafa
Citation - WoS: 7
Citation - Scopus: 8
The Determination of Distinctive Single Nucleotide Polymorphism Sets for the Diagnosis of Behcet's Disease
(IEEE Computer Soc, 2022-05-01) Isik, Yunus Emre; Gormez, Yasin; Aydin, Zafer; Bakir-Gungor, Burcu
Behcet's Disease (BD) is a multi-system inflammatory disorder in which the etiology remains unclear. The most probable hypothesis is that genetic tendency and environmental factors play roles in the development of BD. In order to find the essential reasons, genetic changes on thousands of genes should be analyzed. Besides, there is a need for extra analysis to find out which genetic factor affects the disease. Machine learning approaches have high potential for extracting the knowledge from genomics and selecting the representative Single Nucleotide Polymorphisms (SNPs) as the most effective features for the clinical diagnosis process. In this study, we have attempted to identify representative SNPs using feature selection methods, incorporating biological information and aimed to develop a machine-learning model for diagnosing Behcet's disease. By combining biological information and machine learning classifiers, up to 99.64 percent accuracy of disease prediction is achieved using only 13,611 out of 311,459 SNPs. In addition, we revealed the SNPs that are most distinctive by performing repeated feature selection in cross-validation experiments.
Citation - WoS: 3
Citation - Scopus: 3
Template Scoring Methods for Protein Torsion Angle Prediction
(Springer-Verlag Berlin, 2015) Aydin, Zafer; Baker, David; Noble, William Stafford
Prediction of backbone torsion angles provides important constraints about the 3D structure of a protein and is receiving a growing interest in the structure prediction community. In this paper, we introduce a three-stage machine learning classifier to predict the 7-state torsion angles of a protein. The first two stages employ dynamic Bayesian and neural networks to produce an ab-initio prediction of torsion angle states starting from sequence profiles. The third stage is a committee classifier, which combines the ab-initio prediction with a structural frequency profile derived from templates obtained by HHsearch. We develop several structural profile models and obtain significant improvements over the Laplacian scoring technique through: (1) scaling templates by integer powers of sequence identity score, (2) incorporating other alignment scores as multiplicative factors (3) adjusting or optimizing parameters of the profile models with respect to the similarity interval of the target. We also demonstrate that the torsion angle prediction accuracy improves at all levels of target-template similarity even when templates are distant from the target. The improvement is at significantly higher rates as template structures gradually get closer to target.
Citation - WoS: 2
Citation - Scopus: 2
Structural Profile Matrices for Predicting Structural Properties of Proteins
(World Scientific Publ Co Pte Ltd, 2020-07-10) Azginoglu, Nuh; Aydin, Zafer; Celik, Mete
Predicting structural properties of proteins plays a key role in predicting the 3D structure of proteins. In this study, new structural profile matrices (SPM) are developed for protein secondary structure, solvent accessibility and torsion angle class predictions, which could be used as input to 3D prediction algorithms. The structural templates employed in computing SPMs are detected by eight alignment methods in LOMETS server, gap affine alignment method, ScanProsite, PfamScan, and HHblits. The contribution of each template is weighted by its similarity to target, which is assessed by several sequence alignment scores. For comparison, the SPMs are also computed using Homolpro, which uses BLAST for target template alignments and does not assign weights to templates. Incorporating the SPMs into DSPRED classifier, the prediction accuracy improves significantly as demonstrated by cross-validation experiments on two difficult benchmarks. The most accurate predictions are obtained using the SPMs derived by threading methods in LOMETS server. On the other hand, the computational cost of computing these SPMs was the highest.
Citation - WoS: 1
Citation - Scopus: 8
Short Term Electricity Load Forecasting: A Case Study of Electric Utility Market in Turkey
(Institute of Electrical and Electronics Engineers Inc., 2015-04) Ishik, Muhammed Yasin; Göze, Tolga; Ozcan, Ihsan; Güngör, Vehbi Çağrı; Aydin, Zafer; Yasin, Muhammed
With the recent developments in energy sector, the pricing of electricity is now governed by the spot market where a variety of market mechanisms are effective. After the new legislation of market liberalization in Turkey, competition-based on hourly price has received a growing interest in the energy market, which necessitated generators and electric utility companies to add new dimensions to their scope of operation: short-term load and price forecasting. The field has several opportunities though not free from challenges. The dynamic behavior of the market price has caused the electric load to become variable and non-stationary. Furthermore, the number of nodes, in which the load must be predicted, is not constant anymore and can no longer be estimated by experts alone. In this competitive scenario, statistical forecasting methods that can automatically and accurately process thousands of data samples are essential. The purpose of this study is to demonstrate the importance of short-term load forecasting, how it has received a growing interest in Turkey and to propose an artificial neural network that can forecast the short term electricity load. Through detailed performance evaluations, we demonstrate that our forecasting method is capable of predicting the hourly load accurately. © 2017 Elsevier B.V., All rights reserved.
Citation - Scopus: 3
ROSE: A Novel Approach for Protein Secondary Structure Prediction
(Springer Science and Business Media Deutschland GmbH, 2021) Görmez, Yasin; Aydin, Zafer
Three-dimensional structure of protein gives important information about protein’s function. Since it is time-consuming and costly to find the structure of protein by experimental methods, estimation of three-dimensional structures of proteins through computational methods has been an efficient alternative. One of the most important steps for the 3-D protein structure prediction is protein secondary structure prediction. Proteins which contain different number and sequences of amino acids may have similar structures. Thus, extracting appropriate input features has crucial importance for secondary structure prediction. In this study, a novel model, ROSE, is proposed for secondary structure prediction that obtains probability distributions as a feature vector by using two position specific scoring matrices obtained by PSIBLAST and HHblits. ROSE is a two-stage hybrid classifier that uses a one-dimensional bi-directional recurrent neural network at the first stage and a support vector machine at the second stage. It is also combined with DSPRED method, which employs dynamic Bayesian networks and a support vector machine. ROSE obtained comparable results to DSPRED in cross-validation experiments performed on a difficult benchmark and can be used as an alternative to protein secondary structure prediction. © 2021 Elsevier B.V., All rights reserved.
Citation - WoS: 11
Citation - Scopus: 20
ROI Detection in Mammogram Images Using Wavelet-Based Haralick and Hog Features
(IEEE, 2018-12) Tasdemir, Sena Busra Yengec; Tasdemir, Kasim; Aydin, Zafer; Yengec Tasdemir, Sena Busra
Digital mammography is a widespread medical imaging technique that is used for early detection and diagnosis of breast cancer. Detecting the region of interest (ROI) helps to locate the abnormal areas, which may be analyzed further by a radiologist or a CAD system. In this paper, a new classification method is proposed for ROI detection in mammography images. Features are extracted using Wavelet transform, Haralick and HOG descriptors. To reduce the number of dimensions and eliminate irrelevant features, a wrapper-based feature selection method is implemented. Several feature extraction methods and machine learning classifiers are compared by performing a leave-one-image-out cross-validation experiment on a difficult dataset. The proposed feature extraction method provides the best accuracy of 87.5% and the second-best area under curve (AUC) score of 84% when employed in a random forest classifier.

Scopus İndeksli Yayınlar Koleksiyonu

Browse

Filters

Settings

Sort By

Results per page

Search Results