PubMed İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/397

Browse

Search Results

Now showing 1 - 10 of 11

Predicting Respiratory Infection and Symptoms Development Using Gene Set Enrichment Scores and Machine Learning
(Elsevier Sci Ltd, 2026) Aydin, Zafer; Isik, Yunus Emre
Recent advancements in precision medicine enable personalized predictions grounded in individual-level genetic data. However, relying solely on a single type of data can decrease prediction accuracy and limit the biological interpretability of the resulting models. Incorporating predefined genetic knowledge, such as derived gene sets, can improve performance and provide deeper biological insights for complex diseases, including respiratory infections. This study aimed to evaluate the usability of enrichment scores (ES), calculated using gene sets from the Molecular Signatures Database (MSigDB), as a feature representation for machine learning models to predict respiratory viral infections and symptom development. In addition, the proposed feature representation approach was extensively compared with the de facto gene-level expression representation. A total of 36,834 predefined gene sets were compiled from the MSigDB, and their ES values were calculated. Experiments used the GSE73072 dataset from Gene Expression Omnibus, containing gene expression profiles before and after virus exposure. Various machine learning and feature selection algorithms were applied to ES-based and probe-level feature sets. The results showed that both feature representation approaches achieved an area under the precision-recall curve (AUPRC) value greater than 0.90 for all tasks. Compared with the Respiratory Viral DREAM Challenge leaderboard phase, our models showed a 14.8% improvement in pre-exposure predictions (T0) and a 17.4% improvement in symptom classification. Using enrichment scores as a feature representation generally resulted in better performance than probe-level representation when predicting respiratory infections and symptom development. Identifying key gene sets through feature selection and comparing them with essential genes for respiratory viruses enabled a more comprehensive analysis, providing deeper insights into the pathways that contribute to these predictions.
GraphUnet-SS: A Novel Deep Learning Model for Protein Secondary Structure Prediction Based on U-Net Architecture
(Elsevier Ltd, 2026-04) Aydin, Zafer; Görmez, Yasin; Sabzekar, Mostafa
Citation - WoS: 7
Citation - Scopus: 8
The Determination of Distinctive Single Nucleotide Polymorphism Sets for the Diagnosis of Behcet's Disease
(IEEE Computer Soc, 2022-05-01) Isik, Yunus Emre; Gormez, Yasin; Aydin, Zafer; Bakir-Gungor, Burcu
Behcet's Disease (BD) is a multi-system inflammatory disorder in which the etiology remains unclear. The most probable hypothesis is that genetic tendency and environmental factors play roles in the development of BD. In order to find the essential reasons, genetic changes on thousands of genes should be analyzed. Besides, there is a need for extra analysis to find out which genetic factor affects the disease. Machine learning approaches have high potential for extracting the knowledge from genomics and selecting the representative Single Nucleotide Polymorphisms (SNPs) as the most effective features for the clinical diagnosis process. In this study, we have attempted to identify representative SNPs using feature selection methods, incorporating biological information and aimed to develop a machine-learning model for diagnosing Behcet's disease. By combining biological information and machine learning classifiers, up to 99.64 percent accuracy of disease prediction is achieved using only 13,611 out of 311,459 SNPs. In addition, we revealed the SNPs that are most distinctive by performing repeated feature selection in cross-validation experiments.
Citation - WoS: 2
Citation - Scopus: 2
Structural Profile Matrices for Predicting Structural Properties of Proteins
(World Scientific Publ Co Pte Ltd, 2020-07-10) Azginoglu, Nuh; Aydin, Zafer; Celik, Mete
Predicting structural properties of proteins plays a key role in predicting the 3D structure of proteins. In this study, new structural profile matrices (SPM) are developed for protein secondary structure, solvent accessibility and torsion angle class predictions, which could be used as input to 3D prediction algorithms. The structural templates employed in computing SPMs are detected by eight alignment methods in LOMETS server, gap affine alignment method, ScanProsite, PfamScan, and HHblits. The contribution of each template is weighted by its similarity to target, which is assessed by several sequence alignment scores. For comparison, the SPMs are also computed using Homolpro, which uses BLAST for target template alignments and does not assign weights to templates. Incorporating the SPMs into DSPRED classifier, the prediction accuracy improves significantly as demonstrated by cross-validation experiments on two difficult benchmarks. The most accurate predictions are obtained using the SPMs derived by threading methods in LOMETS server. On the other hand, the computational cost of computing these SPMs was the highest.
Citation - WoS: 25
Citation - Scopus: 33
Improved Classification of Colorectal Polyps on Histopathological Images With Ensemble Learning and Stain Normalization
(Elsevier Ireland Ltd, 2023-04) Yengec-Tasdemir, Sena Busra; Aydin, Zafer; Akay, Ebru; Dogan, Serkan; Yilmaz, Bulent
Background and Objective: Early detection of colon adenomatous polyps is critically important because correct detection of it significantly reduces the potential of developing colon cancers in the future. The key challenge in the detection of adenomatous polyps is differentiating it from its visually similar counterpart, non-adenomatous tissues. Currently, it solely depends on the experience of the pathologist. To assist the pathologists, the objective of this work is to provide a novel non-knowledge-based Clinical Decision Support System (CDSS) for improved detection of adenomatous polyps on colon histopathology images. Methods: The domain shift problem arises when the train and test data are coming from different distributions of diverse settings and unequal color levels. This problem, which can be tackled by stain normalization techniques, restricts the machine learning models to attain higher classification accuracies. In this work, the proposed method integrates stain normalization techniques with ensemble of competitively accurate, scalable and robust variants of CNNs, ConvNexts. The improvement is empirically analyzed for five widely employed stain normalization techniques. The classification performance of the proposed method is evaluated on three datasets comprising more than 10k colon histopathology images. Results: The comprehensive experiments demonstrate that the proposed method outperforms the stateof-the-art deep convolutional neural network based models by attaining 95% classification accuracy on the curated dataset, and 91.1% and 90% on EBHI and UniToPatho public datasets, respectively. Conclusions: These results show that the proposed method can accurately classify colon adenomatous polyps on histopathology images. It retains remarkable performance scores even for different datasets coming from different distributions. This indicates that the model has a notable generalization ability. (c) 2023 Elsevier B.V. All rights reserved.
Citation - WoS: 13
Citation - Scopus: 21
IGPRED: Combination of Convolutional Neural and Graph Convolutional Networks for Protein Secondary Structure Prediction
(Wiley, 2021-05-25) Gormez, Yasin; Sabzekar, Mostafa; Aydin, Zafer
There is a close relationship between the tertiary structure and the function of a protein. One of the important steps to determine the tertiary structure is protein secondary structure prediction (PSSP). For this reason, predicting secondary structure with higher accuracy will give valuable information about the tertiary structure. Recently, deep learning techniques have obtained promising improvements in several machine learning applications including PSSP. In this article, a novel deep learning model, based on convolutional neural network and graph convolutional network is proposed. PSIBLAST PSSM, HHMAKE PSSM, physico-chemical properties of amino acids are combined with structural profiles to generate a rich feature set. Furthermore, the hyper-parameters of the proposed network are optimized using Bayesian optimization. The proposed model IGPRED obtained 89.19%, 86.34%, 87.87%, 85.76%, and 86.54% Q3 accuracies for CullPDB, EVAset, CASP10, CASP11, and CASP12 datasets, respectively.
Citation - WoS: 4
Citation - Scopus: 7
IGPRED-Multitask: A Deep Learning Model to Predict Protein Secondary Structure, Torsion Angles and Solvent Accessibility
(IEEE Computer Soc, 2023-03-01) Gormez, Yasin; Aydin, Zafer
Protein secondary structure, solvent accessibility and torsion angle predictions are preliminary steps to predict 3D structure of a protein. Deep learning approaches have achieved significant improvements in predicting various features of protein structure. In this study, IGPRED-Multitask, a deep learning model with multi task learning architecture based on deep inception network, graph convolutional network and a bidirectional long short-term memory is proposed. Moreover, hyper-parameters of the model are fine-tuned using Bayesian optimization, which is faster and more effective than grid search. The same benchmark test data sets as in the OPUS-TASS paper including TEST2016, TEST2018, CASP12, CASP13, CASPFM, HARD68, CAMEO93, CAMEO93_HARD, as well as the train and validation sets, are used for fair comparison with the literature. Statistically significant improvements are observed in secondary structure prediction on 4 datasets, in phi angle prediction on 2 datasets and in psi angel prediction on 3 datasets compared to the state-of-the-art methods. For solvent accessibility prediction, TEST2016 and TEST2018 datasets are used only to assess the performance of the proposed model.
Citation - WoS: 3
Citation - Scopus: 4
Deep-Learning AI-Model for Predicting Dental Plaque in the Young Permanent Teeth of Children Aged 8-13 Years
(MDPI, 2025-04-07) Tez, Banu Cicek; Guzel, Yasin; Eliacik, Bahar Basak Kiziltan; Aydin, Zafer; Kızıltan Eliaçık, Bahar Başak
Background/Objectives: Dental plaque is a significant contributor to various prevalent oral health conditions, including caries, gingivitis, and periodontitis. Consequently, its detection and management are of paramount importance for maintaining oral health. Manual plaque assessment is time-consuming, error-prone, and particularly challenging in uncooperative pediatric patients. These limitations have encouraged researchers to seek faster, more reliable methods. Accordingly, this study aims to develop a deep learning model for detecting and segmenting plaque in young permanent teeth and to evaluate its diagnostic precision. Methods: The dataset comprises 506 dental images from 31 patients aged between 8 and 13 years. Six state-of-the-art models were trained and evaluated using this dataset. The U-Net Transformer model, which yielded the best performance, was further compared against three experienced pediatric dentists for clinical feasibility using 35 randomly selected images from the test set. The clinical trial was registered on under the ID NCT06603233 (1 June 2023). Results: The Intersection over Union (IoU) score of the U-Net Transformer on the test set was measured as 0.7845, and the p-values obtained from the three t-tests conducted for comparison with dentists were found to be below 0.05. Compared with three experienced pediatric dentists, the deep learning model exhibited clinically superior performance in the detection and segmentation of dental plaque in young permanent teeth. Conclusions: This finding highlights the potential of AI-driven technologies in enhancing the accuracy and reliability of dental plaque detection and segmentation in pediatric dentistry.
Citation - WoS: 5
Citation - Scopus: 7
Comparative Analysis of Machine Learning Approaches for Predicting Respiratory Virus Infection and Symptom Severity
(PeerJ Inc, 2023-06-30) Isik, Yunus Emre; Aydin, Zafer
Respiratory diseases are among the major health problems causing a burden on hospitals. Diagnosis of infection and rapid prediction of severity without time-consuming clinical tests could be beneficial in preventing the spread and progression of the disease, especially in countries where health systems remain incapable. Personalized medicine studies involving statistics and computer technologies could help to address this need. In addition to individual studies, competitions are also held such as Dialogue for Reverse Engineering Assessment and Methods (DREAM) challenge which is a community-driven organization with a mission to research biology, bioinformatics, and biomedicine. One of these competitions was the Respiratory Viral DREAM Challenge, which aimed to develop early predictive biomarkers for respiratory virus infections. These efforts are promising, however, the prediction performance of the computational methods developed for detecting respiratory diseases still has room for improvement. In this study, we focused on improving the performance of predicting the infection and symptom severity of individuals infected with various respiratory viruses using gene expression data collected before and after exposure. The publicly available gene expression dataset in the Gene Expression Omnibus, named GSE73072, containing samples exposed to four respiratory viruses (H1N1, H3N2, human rhinovirus (HRV), and respiratory syncytial virus (RSV)) was used as input data. Various preprocessing methods and machine learning algorithms were implemented and compared to achieve the best prediction performance. The experimental results showed that the proposed approaches obtained a prediction performance of 0.9746 area under the precision-recall curve (AUPRC) for infection (i.e., shedding) prediction (SC-1), 0.9182 AUPRC for symptom class prediction (SC-2), and 0.6733 Pearson correlation for symptom score prediction (SC-3) by outperforming the best leaderboard scores of Respiratory Viral DREAM Challenge (a 4.48% improvement for SC-1, a 13.68% improvement for SC-2, and a 13.98% improvement for SC-3). Additionally, over-representation analysis (ORA), which is a statistical method for objectively determining whether certain genes are more prevalent in pre-defined sets such as pathways, was applied using the most significant genes selected by feature selection methods. The results show that pathways associated with the 'adaptive immune system' and 'immune disease' are strongly linked to pre-infection and symptom development. These findings contribute to our knowledge about predicting respiratory infections and are expected to facilitate the development of future studies that concentrate on predicting not only infections but also the associated symptoms.
Citation - Scopus: 15
An Effective Colorectal Polyp Classification for Histopathological Images Based on Supervised Contrastive Learning
(Elsevier Ltd, 2024-04) Yengec-Tasdemir, Sena Busra; Aydin, Zafer; Akay, Ebru; Doǧan, Serkan; Yilmaz, Bulent
Early detection of colon adenomatous polyps is pivotal in reducing colon cancer risk. In this context, accurately distinguishing between adenomatous polyp subtypes, especially tubular and tubulovillous, from hyperplastic variants is crucial. This study introduces a cutting-edge computer-aided diagnosis system optimized for this task. Our system employs advanced Supervised Contrastive learning to ensure precise classification of colon histopathology images. Significantly, we have integrated the Big Transfer model, which has gained prominence for its exemplary adaptability to visual tasks in medical imaging. Our novel approach discerns between in-class and out-of-class images, thereby elevating its discriminatory power for polyp subtypes. We validated our system using two datasets: a specially curated one and the publicly accessible UniToPatho dataset. The results reveal that our model markedly surpasses traditional deep convolutional neural networks, registering classification accuracies of 87.1% and 70.3% for the custom and UniToPatho datasets, respectively. Such results emphasize the transformative potential of our model in polyp classification endeavors. © 2024 Elsevier B.V., All rights reserved.

PubMed İndeksli Yayınlar Koleksiyonu

Browse

Filters

Settings

Sort By

Results per page

Search Results