Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 10 of 33
  • Conference Object
    Citation - Scopus: 1
    Feature Selection for Protein Dihedral Angle Prediction
    (Institute of Electrical and Electronics Engineers Inc., 2017-09) Aydin, Zafer; Kaynar, Oǧuz; Görmez, Yasin
  • Article
    GraphUnet-SS: A Novel Deep Learning Model for Protein Secondary Structure Prediction Based on U-Net Architecture
    (Elsevier Ltd, 2026-04) Aydin, Zafer; Görmez, Yasin; Sabzekar, Mostafa
  • Article
    Citation - WoS: 6
    Citation - Scopus: 7
    The Determination of Distinctive Single Nucleotide Polymorphism Sets for the Diagnosis of Behcet's Disease
    (IEEE Computer Soc, 2022-05-01) Isik, Yunus Emre; Gormez, Yasin; Aydin, Zafer; Bakir-Gungor, Burcu
    Behcet's Disease (BD) is a multi-system inflammatory disorder in which the etiology remains unclear. The most probable hypothesis is that genetic tendency and environmental factors play roles in the development of BD. In order to find the essential reasons, genetic changes on thousands of genes should be analyzed. Besides, there is a need for extra analysis to find out which genetic factor affects the disease. Machine learning approaches have high potential for extracting the knowledge from genomics and selecting the representative Single Nucleotide Polymorphisms (SNPs) as the most effective features for the clinical diagnosis process. In this study, we have attempted to identify representative SNPs using feature selection methods, incorporating biological information and aimed to develop a machine-learning model for diagnosing Behcet's disease. By combining biological information and machine learning classifiers, up to 99.64 percent accuracy of disease prediction is achieved using only 13,611 out of 311,459 SNPs. In addition, we revealed the SNPs that are most distinctive by performing repeated feature selection in cross-validation experiments.
  • Conference Object
    Citation - WoS: 3
    Citation - Scopus: 3
    Template Scoring Methods for Protein Torsion Angle Prediction
    (Springer-Verlag Berlin, 2015) Aydin, Zafer; Baker, David; Noble, William Stafford
    Prediction of backbone torsion angles provides important constraints about the 3D structure of a protein and is receiving a growing interest in the structure prediction community. In this paper, we introduce a three-stage machine learning classifier to predict the 7-state torsion angles of a protein. The first two stages employ dynamic Bayesian and neural networks to produce an ab-initio prediction of torsion angle states starting from sequence profiles. The third stage is a committee classifier, which combines the ab-initio prediction with a structural frequency profile derived from templates obtained by HHsearch. We develop several structural profile models and obtain significant improvements over the Laplacian scoring technique through: (1) scaling templates by integer powers of sequence identity score, (2) incorporating other alignment scores as multiplicative factors (3) adjusting or optimizing parameters of the profile models with respect to the similarity interval of the target. We also demonstrate that the torsion angle prediction accuracy improves at all levels of target-template similarity even when templates are distant from the target. The improvement is at significantly higher rates as template structures gradually get closer to target.
  • Article
    Citation - WoS: 2
    Citation - Scopus: 2
    Structural Profile Matrices for Predicting Structural Properties of Proteins
    (World Scientific Publ Co Pte Ltd, 2020-07-10) Azginoglu, Nuh; Aydin, Zafer; Celik, Mete
    Predicting structural properties of proteins plays a key role in predicting the 3D structure of proteins. In this study, new structural profile matrices (SPM) are developed for protein secondary structure, solvent accessibility and torsion angle class predictions, which could be used as input to 3D prediction algorithms. The structural templates employed in computing SPMs are detected by eight alignment methods in LOMETS server, gap affine alignment method, ScanProsite, PfamScan, and HHblits. The contribution of each template is weighted by its similarity to target, which is assessed by several sequence alignment scores. For comparison, the SPMs are also computed using Homolpro, which uses BLAST for target template alignments and does not assign weights to templates. Incorporating the SPMs into DSPRED classifier, the prediction accuracy improves significantly as demonstrated by cross-validation experiments on two difficult benchmarks. The most accurate predictions are obtained using the SPMs derived by threading methods in LOMETS server. On the other hand, the computational cost of computing these SPMs was the highest.
  • Conference Object
    Citation - WoS: 1
    Citation - Scopus: 8
    Short Term Electricity Load Forecasting: A Case Study of Electric Utility Market in Turkey
    (Institute of Electrical and Electronics Engineers Inc., 2015-04) Ishik, Muhammed Yasin; Göze, Tolga; Ozcan, Ihsan; Güngör, Vehbi Çağrı; Aydin, Zafer; Yasin, Muhammed
    With the recent developments in energy sector, the pricing of electricity is now governed by the spot market where a variety of market mechanisms are effective. After the new legislation of market liberalization in Turkey, competition-based on hourly price has received a growing interest in the energy market, which necessitated generators and electric utility companies to add new dimensions to their scope of operation: short-term load and price forecasting. The field has several opportunities though not free from challenges. The dynamic behavior of the market price has caused the electric load to become variable and non-stationary. Furthermore, the number of nodes, in which the load must be predicted, is not constant anymore and can no longer be estimated by experts alone. In this competitive scenario, statistical forecasting methods that can automatically and accurately process thousands of data samples are essential. The purpose of this study is to demonstrate the importance of short-term load forecasting, how it has received a growing interest in Turkey and to propose an artificial neural network that can forecast the short term electricity load. Through detailed performance evaluations, we demonstrate that our forecasting method is capable of predicting the hourly load accurately. © 2017 Elsevier B.V., All rights reserved.
  • Book Part
    Citation - Scopus: 3
    ROSE: A Novel Approach for Protein Secondary Structure Prediction
    (Springer Science and Business Media Deutschland GmbH, 2021) Görmez, Yasin; Aydin, Zafer
    Three-dimensional structure of protein gives important information about protein’s function. Since it is time-consuming and costly to find the structure of protein by experimental methods, estimation of three-dimensional structures of proteins through computational methods has been an efficient alternative. One of the most important steps for the 3-D protein structure prediction is protein secondary structure prediction. Proteins which contain different number and sequences of amino acids may have similar structures. Thus, extracting appropriate input features has crucial importance for secondary structure prediction. In this study, a novel model, ROSE, is proposed for secondary structure prediction that obtains probability distributions as a feature vector by using two position specific scoring matrices obtained by PSIBLAST and HHblits. ROSE is a two-stage hybrid classifier that uses a one-dimensional bi-directional recurrent neural network at the first stage and a support vector machine at the second stage. It is also combined with DSPRED method, which employs dynamic Bayesian networks and a support vector machine. ROSE obtained comparable results to DSPRED in cross-validation experiments performed on a difficult benchmark and can be used as an alternative to protein secondary structure prediction. © 2021 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - WoS: 11
    Citation - Scopus: 20
    ROI Detection in Mammogram Images Using Wavelet-Based Haralick and Hog Features
    (IEEE, 2018-12) Tasdemir, Sena Busra Yengec; Tasdemir, Kasim; Aydin, Zafer; Yengec Tasdemir, Sena Busra
    Digital mammography is a widespread medical imaging technique that is used for early detection and diagnosis of breast cancer. Detecting the region of interest (ROI) helps to locate the abnormal areas, which may be analyzed further by a radiologist or a CAD system. In this paper, a new classification method is proposed for ROI detection in mammography images. Features are extracted using Wavelet transform, Haralick and HOG descriptors. To reduce the number of dimensions and eliminate irrelevant features, a wrapper-based feature selection method is implemented. Several feature extraction methods and machine learning classifiers are compared by performing a leave-one-image-out cross-validation experiment on a difficult dataset. The proposed feature extraction method provides the best accuracy of 87.5% and the second-best area under curve (AUC) score of 84% when employed in a random forest classifier.
  • Article
    Citation - WoS: 7
    Citation - Scopus: 11
    Protein Β-Sheet Prediction Using an Efficient Dynamic Programming Algorithm
    (Elsevier Sci Ltd, 2017-10) Sabzekar, Mostafa; Naghibzadeh, Mahmoud; Eghdami, Mandie; Aydin, Zafer
    Predicting the beta-sheet structure of a protein is one of the most important intermediate steps towards the identification of its tertiary structure. However, it is regarded as the primary bottleneck due to the presence of non-local interactions between several discontinuous regions in beta-sheets. To achieve reliable long-range interactions, a promising approach is to enumerate and rank all beta-sheet conformations for a given protein and find the one with the highest score. The problem with this solution is that the search space of the problem grows exponentially with respect to the number of beta-strands. Additionally, brute force calculation in this conformational space leads to dealing with a combinatorial explosion problem with intractable computational complexity. The main contribution of this paper is to generate and search the space of the problem efficiently to reduce the time complexity of the problem. To achieve this, two tree structures, called sheet-tree and grouping-tree, are proposed. They model the search space by breaking it into sub-problems. Then, an advanced dynamic programming is proposed that stores the intermediate results, avoids repetitive calculation by repeatedly uses them efficiently in successive steps and reduces the space of the problem by removing those intermediate results that will no longer be required in later steps. As a consequence, the following contributions have been made. Firstly, more accurate beta-sheet structures are found by searching all possible conformations, and secondly, the time complexity of the problem is reduced by searching the space of the problem efficiently which makes the proposed method applicable to predict beta-sheet structures with high number of beta-strands. Experimental results on the BetaSheet916 dataset showed significant improvements of the proposed method in both execution time and the prediction accuracy in comparison with the state-of-the-art beta-sheet structure prediction methods Moreover, we investigate the effect of different contact map predictors on the performance of the proposed method using BetaSheet1452 dataset. The source code is available at http://www.conceptsgate.com/BetaTop.rar. (C) 2017 Elsevier Ltd. All rights reserved.
  • Conference Object
    Citation - WoS: 2
    Citation - Scopus: 4
    Open Source Slurm Computer Cluster System Design and a Sample Application
    (Institute of Electrical and Electronics Engineers Inc., 2017-10) Azgınoglu, Nuh; Atasever, Mehmet Umut; Aydin, Zafer; Celik, Mete; Erbay, Hasan
    Cluster computing combines the resources of multiple computers as they act like a single high-performance computer. In this study, a computer cluster consisting of Lustre distributed file system with one cluster server based on Slurm resource management system and thirteen calculation nodes were built by using available and inert computers that have different processors. Different bioinformatics algorithms were run using different data sets in the cluster, and the performance of the clusters was evaluated with the amount of time the computing cluster spent to finish the jobs. © 2018 Elsevier B.V., All rights reserved.