Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 6 of 6
  • Conference Object
    The Effect of Different Classifiers on Recursive Cluster Elimination in the Analysis of Transcriptomic Data
    (Institute of Electrical and Electronics Engineers Inc., 2023-10-11) Bulut, Nurten; Bakir-Güngör, Burcu; Qaqish, Bahjat F.; Yousef, Malik
    Gene expression data with limited sample size and a large number of genes are frequently encountered in genetic studies. In such high-dimensional data, identification of genes that distinguish between disease states is a challenging task. Feature selection (FS) is a useful approach in dealing with high dimensionality. Support Vector Machines Recursive Cluster Elimination (SVM-RCE) is a technique for FS in high-dimensional data. The SVM-RCE approach has been utilized for identification of clusters of genes whose expression levels correlate with pathological state. A key step in SVM-RCE is the use of an SVM classifier to assign an area under the curve (AUC) score to each gene cluster based on its ability to predict class labels. In this study, we investigate the use of alternative classifiers in the cluster-scoring step. Specifically, we compare Support Vector Machines, Random Forest, XgBoost, Naive Bayes, and linear logistic regression. In addition to AUC score performance evaluation, the algorithms are compared in terms of the number of selected genes at different levels of clustering and in terms of the running time. © 2023 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 5
    Identifying Taxonomic Biomarkers of Colorectal Cancer in Human Intestinal Microbiota Using Multiple Feature Selection Methods
    (Institute of Electrical and Electronics Engineers Inc., 2022-09-07) Jabeer, Amhar; Kocak, Aysegul; Akkaş, Huseyin; Yenisert, Ferhan; Nalbantoĝlu, Özkan Ufuk; Yousef, Malik; Bakir-Güngör, Burcu; Bakir Gungor, Burcu
    A variety of bacterial species called gut microbiota work together to maintain a steady intestinal environment. The gastrointestinal tract contains tremendous amount of different species including archaea, bacteria, fungi, and viruses. While these organisms are crucial immune system stabilizers, the dysbiosis of the intestinal flora has been related to gastrointestinal disorders including Colorectal cancer (CRC), intestinal cancer, irritable bowel syndrome and inflammatory bowel disease. In the last decade, next-generation sequencing (NGS) methods have accelerated the identification of human gut flora. CRC is a deathly condition that has been on the rise in the last century, affecting half a million people each year. Since early CRC diagnosis is critical for an effective treatment, there is an immediate requirement for a classification system that can expedite CRC diagnosis. In this study, via analyzing the available metagenomics data on CRC, we aim to facilitate the CRC diagnosis via finding biomarkers linked with CRC, and via building a classification model. We have obtained the metagenomic sequencing data of the healthy individuals and CRC patients from a metagenome-wide association analysis and we have classified this data according to the disease stages. Conditional Mutual Information Maximization (CMIM), Fast Correlation Based Filter (FCBF), Extreme Gradient Boosting (XGBoost), min redundancy max relevance (mRMR), Information Gain (IG) and Select K Best (SKB) feature selection algorithms were utilized to cope with the complexity of the features. We observed that the SKB, IG, and XGBoost techniques made significant contributions to decrease the microbiota in use for CRC diagnosis, thereby reducing cost and time. We realized that our Random Forest classifier outperformed Adaboost, Support Vector Machine, Decision Tree, Logitboost and stacking ensemble classifiers in terms of CRC classification performance. Our results reiterated some known and some potential microbiome associated mechanisms in CRC, which could aid the design of new diagnostics based on the microbiome. © 2022 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 2
    Effect of Recursive Cluster Elimination With Different Clustering Algorithms Applied to Gene Expression Data
    (Institute of Electrical and Electronics Engineers Inc., 2023-10-11) Kuzudisli, Cihan; Bakir-Güngör, Burcu; Qaqish, Bahjat F.; Yousef, Malik
    Feature selection (FS) is an effective tool in dealing with high dimensionality and reducing computational cost. Support Vector Machines-Recursive Cluster Elimination (SVM-RCE) is one of several algorithms that have been developed for FS in high dimensional data. SVM-RCE involves a clustering step which originally is k-means. Using various performance metrics, three alternative algorithms are evaluated in this context; k-medoids, Hierarchical Clustering (HC), and Gaussian Mixture Model (GMM). Comparisons will be carried out on five publicly available gene expression datasets. The results show that k-means in SVM-RCE obtains higher performance than other tested algorithms in terms of classification performance. Additionally, HC shows a similar performance to k-means. Our findings show superiority of using k-means. This study can contribute to the development of SVM-RCE with different variations, leading to decrease in the number of selected genes, and an increase in prediction performance. © 2023 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 3
    İki Durumlu Bir Beyin Bilgisayar Arayüzünde Özellik Çıkarımı ve Sınıflandırma
    (Institute of Electrical and Electronics Engineers Inc., 2016-10) Altindis, Fatih; Yilmaz, Bulent
    Brain Computer Interface (BCI) technology is used to help patients who do not have control over motor neurons such as ALS or paralyzed patients, to communicate with outer world. This work aims to classify motor imageries using real-time EEG dataset, which was published by Graz University, Austria. The dataset consists of two-channel EEG signals of right-hand movement imagery and left-hand movement imagery of 8 subjects. There are a total of 120 motor imagery trials (60 left and 60 right) EEG signals recorded from each subject. EEG signals are filtered and feature vectors were extracted that consist of 24, 32 and 40 relative band power values (RBPV). In this work, feature vectors classified by three different methods, linear discriminant analysis (LDA), K nearest neighbor (KNN) and support vector machines (SVM). Results show that best performance was achieved by 24 RBPV feature vector and LDA classification method. © 2017 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 1
    Man-Hour Prediction for Complex Industrial Products
    (Institute of Electrical and Electronics Engineers Inc., 2023) Unal, Ahmet Emin; Boyar, Halit; Kuleli Pak, Burcu Kuleli; Cem Yildiz, Mehmet; Erten, Ali Erman; Güngör, Vehbi Çağrı; Pak, Burcu Kuleli; Cagri Gungor, Vehbi
    Accurately predicting the cost is crucial for the success of complex industrial projects. There can be several sources contributing to the cost. Traditional methods for cost estimation may not provide the required accuracy and speed to ensure the success of the project. Recently, machine learning techniques have shown promising results in improving cost estimation in various industrial products. This study investigates the performance of gradient-boosting machine learning models and feature engineering techniques on a private dataset of metal sheet project man-hour costs. A comparison of distinct models is conducted, key aspects influencing cost are identified, and the implications of incorporating domain-specific knowledge, including its advantages and disadvantages, are assessed based on performance outcomes. Experimental results demonstrate that LightGBM and XGBoost outperform other models, and feature selection and synthetic data generation techniques improve the performance. Overall, this study highlights the potential of machine learning in metal sheet sampling projects and emphasizes the importance of feature engineering and domain expertise for better model performance. © 2024 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 5
    Emotion Detection Using Multivariate Synchrosqueezing Transform via 2D Circumplex Model
    (Institute of Electrical and Electronics Engineers Inc., 2018) Ozel, Pinar; Akan, Aydin; Yilmaz, Bulent; Özel, Pınar; Akan, Aydin I.; Yilmaz, Bulent
    Emotion detection by utilizing signal processing methods is a challenging area. An open issue in emotional modeling is to obtain an optimum feature set to use for the classification process. This study proposes an approach for emotional state classification by the investigation of EEG signals via multivariate synchrosqueezing transform (MSST). MSST is a post-processing technique to compose a localized time-frequency representation yielding multivariate syncyrosqueezing coefficients. After obtaining these coefficients from EEG signals for 18 subjects from DEAP dataset, coefficients and self-assessment-mannequins (SAM) labels of those subjects are used for emotional state classification by using support vector machines (SVM) nearest neighbor, decision tree, and ensemble methods. The accuracy rate is 70.6% for high valence high arousal (HVHA), 75.4% for low valence high arousal (LVHA), 77.8% for high valence low arousal (HVLA), and 77.2% for low valence low arousal (LVLA) cases using SVM. © 2019 Elsevier B.V., All rights reserved.