Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 3 of 3
  • Article
    Citation - WoS: 3
    Citation - Scopus: 3
    MicroRNA Prediction Based on 3D Graphical Representation of RNA Secondary Structures
    (Tubitak Scientific & Technological Research Council Turkey, 2019-08-05) Sacar Demirci, Muserref Duygu; Demirci, Müşerref Duygu Saçar
    MicroRNAs (miRNAs) are posttranscriptional regulators of gene expression. While a miRNA can target hundreds of messenger RNA (mRNAs), an mRNA can be targeted by different miRNAs, not to mention that a single miRNA might have various binding sites in an mRNA sequence. Therefore, it is quite involved to investigate miRNAs experimentally. Thus, machine learning (ML) is frequently used to overcome such challenges. The key parts of a ML analysis largely depend on the quality of input data and the capacity of the features describing the data. Previously, more than 1000 features were suggested for miRNAs. Here, it is shown that using 36 features representing the RNA secondary structure and its dynamic 3D graphical representation provides up to 98% accuracy values. In this study, a new approach for ML-based miRNA prediction is proposed. Thousands of models are generated through classification of known human miRNAs and pseudohairpins with 3 classifiers: decision tree, naive Bayes, and random forest. Although the method is based on human data, the best model was able to correctly assign 96% of nonhuman hairpins from MirGeneDB, suggesting that this approach might be useful for the analysis of miRNAs from other species.
  • Article
    Citation - Scopus: 5
    Hyperplastic and Tubular Polyp Classification Using Machine Learning and Feature Selection
    (Elsevier B.V., 2024) Doǧan, Refika Sultan; Akay, Ebru; Doǧan, Serkan; Yilmaz, Bulent
    Purpose: The aim of this study is to develop an effective approach for differentiating between hyperplastic and tubular adenoma colon polyps, which is one of the most difficult tasks in colonoscopy procedures. The main research challenge is how to improve the classification of these polyp subtypes applying various focusing levels on the polyp images, data preprocessing approaches, and classification algorithms. Methods: This study employed 202 colonoscopy videos from a total of 201 patients, focusing on 59 videos containing hyperplastic and tubular adenoma polyps. Manually extract key frames and several feature extraction and classification techniques were applied. The influence of different datasets with various focuses as well as data preprocessing steps on the performance of classification was examined, and AUC values were calculated using ten classifiers. Results: The study discovered that the optimal dataset, data preprocessing method, and classification algorithm all had significant effects on classification results. The Random Forest model with the Recursive Feature Elimination (RFE) feature selection approach, for example, consistently outperformed other models and achieved the highest AUC value of 0.9067. In terms of accuracy, F1 score, recall, and AUC, the suggested model outperformed a gastroenterologist, nevertheless precision remained slightly lower. Conclusion: This study emphasizes the importance of dataset selection, data preprocessing, and feature selection in enhancing the classification of difficult colon polyp subtypes. The suggested model offers a promising model for the clinical differentiation of hyperplastic and tubular adenoma polyps, potentially improving diagnostic accuracy in gastroenterology. © 2024 Elsevier B.V., All rights reserved.
  • Article
    Citation - Scopus: 4
    CCPred: Global and Population-Specific Colorectal Cancer Prediction and Metagenomic Biomarker Identification at Different Molecular Levels Using Machine Learning Techniques
    (Elsevier Ltd, 2024-11) Bakir-Güngör, Burcu; Temiz, Mustafa; Inal, Yasin; Cicekyurt, Emre; Yousef, Malik
    Colorectal cancer (CRC) ranks as the third most common cancer globally and the second leading cause of cancer-related deaths. Recent research highlights the pivotal role of the gut microbiota in CRC development and progression. Understanding the complex interplay between disease development and metagenomic data is essential for CRC diagnosis and treatment. Current computational models employ machine learning to identify metagenomic biomarkers associated with CRC, yet there is a need to improve their accuracy through a holistic biological knowledge perspective. This study aims to evaluate CRC-associated metagenomic data at species, enzymes, and pathway levels via conducting global and population-specific analyses. These analyses utilize relative abundance values from human gut microbiome sequencing data and robust classification models are built for disease prediction and biomarker identification. For global CRC prediction and biomarker identification, the features that are identified by SelectKBest (SKB), Information Gain (IG), and Extreme Gradient Boosting (XGBoost) methods are combined. Population-based analysis includes within-population, leave-one-dataset-out (LODO) and cross-population approaches. Four classification algorithms are employed for CRC classification. Random Forest achieved an AUC of 0.83 for species data, 0.78 for enzyme data and 0.76 for pathway data globally. On the global scale, potential taxonomic biomarkers include ruthenibacterium lactatiformanas; enzyme biomarkers include RNA 2′ 3′ cyclic 3′ phosphodiesterase; and pathway biomarkers include pyruvate fermentation to acetone pathway. This study underscores the potential of machine learning models trained on metagenomic data for improved disease prediction and biomarker discovery. The proposed model and associated files are available at https://github.com/TemizMus/CCPRED. © 2024 Elsevier B.V., All rights reserved.