1. Home
  2. Browse by Author

Browsing by Author "Orhan, Mehmet Emin"

Filter results by typing the first few letters
Now showing 1 - 3 of 3
  • Results Per Page
  • Sort Options
  • Loading...
    Thumbnail Image
    masterthesis.listelement.badge
    Advancing machine learning analysis of non-coding RNA: A novel approach of negative sequence generation
    (Abdullah Gül Üniversitesi, Fen Bilimleri Enstitüsü, 2024) Orhan, Mehmet Emin; AGÜ, Fen Bilimleri Enstitüsü, Biyomühendislik Ana Bilim Dalı
    Many supervised machine learning models have been developed for the classification and identification of non-coding RNA (ncRNA) sequences. These models play a significant role in the diagnosis and treatment of various diseases. During such analyses, positive learning datasets typically consist of known ncRNA examples, some of which may even be confirmed with strong experimental evidence. However, there is no database of validated negative sequences for ncRNA classes or standardized methodologies for generating high quality negative samples. To overcome this challenge, a new method for generating negative data called the NeRNA (Negative RNA) method has been developed in this study. NeRNA generates negative sequences using known ncRNA sequences and their octal representations, similar with frame shift mutations found in biology but without base deletions or insertions. In this thesis, the NeRNA method was tested separately with four different ncRNA datasets, including microRNA (miRNA), transfer RNA (tRNA), long non-coding RNA (lncRNA), and circular RNA (circRNA). Additionally, a species-specific case study was conducted to demonstrate and compare the performance of the study's miRNA predictions. The results of 1000-fold cross-validation on machine learning algorithms such as Decision Trees, Naive Bayes, Random Forest classifiers, and deep learning algorithms like Multilayer Perceptrons, Convolutional Neural Networks, and Simple Feedforward Neural Networks showed that models developed using datasets generated by NeRNA exhibited significantly high prediction performance. NeRNA has been published as an easy-to-use, updatable, and modifiable KNIME workflow, along with example datasets and required extensions that can be downloaded and utilized. NeRNA is designed specifically as a powerful tool for RNA sequence data analysis.
  • Loading...
    Thumbnail Image
    Article
    CiliaMiner: an integrated database for ciliopathy genes and ciliopathies
    (OXFORD UNIV PRESS, 2023) Turan, Merve Gül; Orhan, Mehmet Emin; Cevik, Sebiha; Kaptan, Oktay I.; 0000-0001-5783-7168; 0000-0002-1757-1374; 0000-0002-0935-1929; 0000-0002-8733-0920; AGÜ, Yaşam ve Doğa Bilimleri Fakültesi, Biyomühendislik Bölümü; Turan, Merve Gül; Orhan, Mehmet Emin; Cevik, Sebiha; Kaptan, Oktay I.
    Cilia are found in eukaryotic species ranging from single-celled organisms, such as Chlamydomonas reinhardtii, to humans, but not in plants. The ability to respond to repellents and/or attractants, regulate cell proliferation and differentiation and provide cellular mobility are just a few examples of how crucial cilia are to cells and organisms. Over 30 distinct rare disorders generally known as ciliopathy are caused by abnormalities or functional impairments in cilia and cilia-related compartments. Because of the complexity of ciliopathies and the rising number of ciliopathies and ciliopathy genes, a ciliopathy-oriented and up-to-date database is required. Here, we present CiliaMiner, a manually curated ciliopathy database that includes ciliopathy lists collected from articles and databases. Analysis reveals that there are 55 distinct disorders likely related to ciliopathy, with over 4000 clinical manifestations. Based on comparative symptom analysis and subcellular localization data, diseases are classifed as primary, secondary or atypical ciliopathies. CiliaMiner provides easy access to all of these diseases and disease genes, as well as clinical features and gene-specifc clinical features, as well as subcellular localization of each protein. Additionally, the orthologs of disease genes are also provided for mice, zebrafsh, Xenopus, Drosophila, Caenorhabditis elegans and Chlamydomonas reinhardtii. CiliaMiner (https://kaplanlab.shinyapps.io/ciliaminer) aims to serve the cilia community with its comprehensive content and highly enriched interactive heatmaps, and will be continually updated.
  • Loading...
    Thumbnail Image
    Article
    NeRNA: A negative data generation framework for machine learning applications of noncoding RNAs
    (PERGAMON-ELSEVIER SCIENCE, 2023) Orhan, Mehmet Emin; Demirci, Yilmaz Mehmet; Sacar Demirci, Muserref Duygu; 0000-0002-1757-1374; 0000-0003-3802-4211; 0000-0003-2012-0598; AGÜ, Yaşam ve Doğa Bilimleri Fakültesi, Biyomühendislik Bölümü; Orhan, Mehmet Emin; Demirci, Yilmaz Mehmet; Sacar Demirci, Muserref Duygu
    Many supervised machine learning based noncoding RNA (ncRNA) analysis methods have been developed to classify and identify novel sequences. During such analysis, the positive learning datasets usually consist of known examples of ncRNAs and some of them might even have weak or strong experimental validation. On the contrary, there are neither databases listing the confirmed negative sequences for a specific ncRNA class nor standardized methodologies developed to generate high quality negative examples. To overcome this challenge, a novel negative data generation method, NeRNA (negative RNA), is developed in this work. NeRNA uses known examples of given ncRNA sequences and their calculated structures for octal representation to create negative sequences in a manner similar to frameshift mutations but without deletion or insertion. NeRNA is tested individually with four different ncRNA datasets including microRNA (miRNA), transfer RNA (tRNA), long noncoding RNA (lncRNA), and circular RNA (circRNA). Furthermore, a species-specific case analysis is performed to demonstrate and compare the performance of NeRNA for miRNA prediction. The results of 1000 fold cross-validation on Decision Tree, Naïve Bayes and Random Forest classifiers, and deep learning algorithms such as Multilayer Perceptron, Convolutional Neural Network, and Simple feedforward Neural Networks indicate that models obtained by using NeRNA generated datasets, achieves substantially high prediction performance. NeRNA is released as an easy-to-use, updatable and modifiable KNIME workflow that can be downloaded with example datasets and required extensions. In particular, NeRNA is designed to be a powerful tool for RNA sequence data analysis.