Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 4 of 4

Developing a Label Propagation Approach for Cancer Subtype Classification Problem
(TUBITAK, 2021) Güner, P.; Bakir-Güngör, B.; Coşkun, M.; Şahan, Pınar Güner
Cancer is a disease in which abnormal cells grow uncontrollably and invade other tissues. Several types of cancer have various subtypes with different clinical and biological implications. Based on these differences, treatment methods need to be customized. The identification of distinct cancer subtypes is an important problem in bioinformatics, since it can guide future precision medicine applications. In order to design targeted treatments, bioinformatics methods attempt to discover common molecular pathology of different cancer subtypes. Along this line, several computational methods have been proposed to discover cancer subtypes or to stratify cancer into informative subtypes. However, existing works do not consider the sparseness of data (genes having low degrees) and result in an ill-conditioned solution. To address this shortcoming, in this paper, we propose an alternative unsupervised method to stratify cancer patients into subtypes using applied numerical algebra techniques. More specifically, we applied a label propagation-based approach to stratify somatic mutation profiles of colon, head and neck, uterine, bladder, and breast tumors. We evaluated the performance of our method by comparing it to the baseline methods. Extensive experiments demonstrate that our approach highly renders tumor classification tasks by largely outperforming the state-of-the-art unsupervised and supervised approaches. © 2022 Elsevier B.V., All rights reserved.
Forecasting the Consumer Price Index in Türkiye Using Machine Learning Models: A Comparative Analysis
(Gazi Univ, 2025-09-01) Söylemez, İsmet; Ünlü, Ramazan; Nalici, Mehmet Eren
This study utilizes machine learning models to forecast Türkiye's Consumer Price Index (CPI), thereby addressing a critical gap in inflation prediction methodologies. The central research problem involves the forecasting of CPI in a volatile economic environment, which is essential for informed policymaking. The primary objective of this study is to evaluate the performance of three machine learning models, such as Decision Tree (DT), Random Forest (RF), and Support Vector Machine (SVM), in forecasting CPI over periods ranging from one to six months, utilizing data from 2012 to 2024. The study's unique contribution lies in the application of the \"SelectKBest\" method, which identifies the most relevant indices, thereby enhancing the efficiency of the models. An ensemble method, Averaging Voting, is also employed to combine the strengths of these models, producing more accurate and robust predictions. The findings indicate that while the RF model consistently generates the most accurate forecasts across all shifts, the SVM model demonstrates a particular strength in the domain of short-term predictions. The ensemble model demonstrates a substantial performance improvement, with a R2 value of 0.962 for one-month ahead of estimates and 0.956 for five-month forecasts. This combined approach has been shown to outperform individual models, offering a more reliable framework for CPI forecasting. The findings offer valuable insights for economic policymakers, enabling more precise and stable inflation predictions in Türkiye.
Citation - WoS: 3
Citation - Scopus: 3
MicroRNA Prediction Based on 3D Graphical Representation of RNA Secondary Structures
(Tubitak Scientific & Technological Research Council Turkey, 2019-08-05) Sacar Demirci, Muserref Duygu; Demirci, Müşerref Duygu Saçar
MicroRNAs (miRNAs) are posttranscriptional regulators of gene expression. While a miRNA can target hundreds of messenger RNA (mRNAs), an mRNA can be targeted by different miRNAs, not to mention that a single miRNA might have various binding sites in an mRNA sequence. Therefore, it is quite involved to investigate miRNAs experimentally. Thus, machine learning (ML) is frequently used to overcome such challenges. The key parts of a ML analysis largely depend on the quality of input data and the capacity of the features describing the data. Previously, more than 1000 features were suggested for miRNAs. Here, it is shown that using 36 features representing the RNA secondary structure and its dynamic 3D graphical representation provides up to 98% accuracy values. In this study, a new approach for ML-based miRNA prediction is proposed. Thousands of models are generated through classification of known human miRNAs and pseudohairpins with 3 classifiers: decision tree, naive Bayes, and random forest. Although the method is based on human data, the best model was able to correctly assign 96% of nonhuman hairpins from MirGeneDB, suggesting that this approach might be useful for the analysis of miRNAs from other species.
Citation - WoS: 8
Citation - Scopus: 10
Lung Cancer Subtype Differentiation From Positron Emission Tomography Images
(Tubitak Scientific & Technological Research Council Turkey, 2020-01-27) Ayyildiz, Oguzhan; Aydin, Zafer; Yilmaz, Bulent; Karacavus, Seyhan; Senkaya, Kubra; Icer, Semra; Kaya, Eser; Taşdemir, Arzu
Lung cancer is one of the deadly cancer types, and almost 85% of lung cancers are nonsmall cell lung cancer (NSCLC). In the present study we investigated classification and feature selection methods for the differentiation of two subtypes of NSCLC, namely adenocarcinoma (ADC) and squamous cell carcinoma (SqCC). The major advances in understanding the effects of therapy agents suggest that future targeted therapies will be increasingly subtype specific. We obtained positron emission tomography (PET) images of 93 patients with NSCLC, 39 of which had ADC while the rest had SqCC. Random walk segmentation was applied to delineate three-dimensional tumor volume, and 39 texture features were extracted to grade the tumor subtypes. We examined 11 classifiers with two different feature selection methods and the effect of normalization on accuracy. The classifiers we used were the k-nearest-neighbor, logistic regression, support vector machine, Bayesian network, decision tree, radial basis function network, random forest, AdaBoostM1, and three stacking methods. To evaluate the prediction accuracy we performed a leave-one-out cross-validation experiment on the dataset. We also considered optimizing certain hyperparameters of these models by performing 10-fold cross-validation separately on each training set. We found that the stacking ensemble classifier, which combines a decision tree, AdaBoostM1, and logistic regression methods by a metalearner, was the most accurate method for detecting subtypes of NSCLC, and normalization of feature sets improved the accuracy of the classification method.

Scopus İndeksli Yayınlar Koleksiyonu

Browse

Filters

Settings

Sort By

Results per page

Search Results