Scopus İndeksli Yayınlar Koleksiyonu

Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/395

Browse

Search Results

Now showing 1 - 10 of 12

Ensemble Churn Prediction for Internet Service Provider with Machine Learning Techniques
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2020) Goy, Gokhan; Kolukisa, Burak; Bahcevan, Cenk; Gungor, Vehbi Cagri
With the developing technology in every fields, a competitive marketing environment has been arised In this competitive environment analyzing customer behavior has become vital In particular, the ability to easily change any service provider has become vet) , critical for the company to continue its existence At the same time, the amount of financial resources spent on retaining instituters much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities In this study. we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value In this context the aunt successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936.
Citation - WoS: 15
Citation - Scopus: 15
PriPath: Identifying Dysregulated Pathways From Differential Gene Expression via Grouping, Scoring, and Modeling With an Embedded Feature Selection Approach
(BMC, 2023-02-23) Yousef, Malik; Ozdemir, Fatma; Jaber, Amhar; Allmer, Jens; Bakir-Gungor, Burcu
BackgroundCell homeostasis relies on the concerted actions of genes, and dysregulated genes can lead to diseases. In living organisms, genes or their products do not act alone but within networks. Subsets of these networks can be viewed as modules that provide specific functionality to an organism. The Kyoto encyclopedia of genes and genomes (KEGG) systematically analyzes gene functions, proteins, and molecules and combines them into pathways. Measurements of gene expression (e.g., RNA-seq data) can be mapped to KEGG pathways to determine which modules are affected or dysregulated in the disease. However, genes acting in multiple pathways and other inherent issues complicate such analyses. Many current approaches may only employ gene expression data and need to pay more attention to some of the existing knowledge stored in KEGG pathways for detecting dysregulated pathways. New methods that consider more precompiled information are required for a more holistic association between gene expression and diseases.ResultsPriPath is a novel approach that transfers the generic process of grouping and scoring, followed by modeling to analyze gene expression with KEGG pathways. In PriPath, KEGG pathways are utilized as the grouping function as part of a machine learning algorithm for selecting the most significant KEGG pathways. A machine learning model is trained to differentiate between diseases and controls using those groups. We have tested PriPath on 13 gene expression datasets of various cancers and other diseases. Our proposed approach successfully assigned biologically and clinically relevant KEGG terms to the samples based on the differentially expressed genes. We have comparatively evaluated the performance of PriPath against other tools, which are similar in their merit. For each dataset, we manually confirmed the top results of PriPath in the literature and found that most predictions can be supported by previous experimental research.ConclusionsPriPath can thus aid in determining dysregulated pathways, which applies to medical diagnostics. In the future, we aim to advance this approach so that it can perform patient stratification based on gene expression and identify druggable targets. Thereby, we cover two aspects of precision medicine.
Multifunction Optoelectronic Gate
(Wiley-Blackwell, 2015-02-24) Ozharar, Sarper; Ozdur, Ibrahim; Delfyett, Peter J.
A multifunction optoelectronic gate that can perform as any desired logic gate of two variables was theoretically proposed and a simplified version is experimentally demonstrated. The proposed optoelectronic gate is dynamically configurable, and being wavelength independent, it can act on multiple input optical bits and realize different functions simultaneously. (c) 2015 Wiley Periodicals, Inc. Microwave Opt Technol Lett 57:969-972, 2015
Citation - WoS: 5
Citation - Scopus: 5
Investigating the Carbon Border Adjustment Mechanism Transition Process With Linguistic Summarization Method: A Situational Analysis of Exporting Countries
(Elsevier Sci Ltd, 2024-08) Fidan, Fatma Sener; Aydogan, Sena; Akay, Diyar; Şener Fidan, Fatma
The Paris Agreement holds significant importance since it establishes a global framework for addressing the issue of climate change and endeavors to mitigate the release of greenhouse gases. The Carbon Border Adjustment Mechanism was introduced as an integral component of this agreement, aiming to oversee the carbon emissions associated with imported items within the European Union and provide compensation for the emissions from the nations engaged in importation. It is essential to analyze the countries involved in exporting to the European Union within the Carbon Border Adjustment Mechanism context to mitigate carbon leakage and effectively support the objectives outlined in the Paris Agreement. This research investigated 104 nations engaged in exporting activities to 27 European Union member countries. The linguistic summarization method, a descriptive data analytics tool, was employed for the analysis. A total of 42 Combined Nomenclature codes were encompassed within the scope of evaluation throughout the transition phase of the Carbon Border Adjustment Mechanism. This study examines the characteristics of exporting nations based on three variables: The Environmental Performance Index, a sustainability indicator; the Region in which the countries are located as classified by the World Bank; and the quantity of Renewable Energy Consumption. Additionally, the study explores the characteristics of EU countries, focusing on their Environmental Performance Index score and geography. The study employed fuzzy sets and the fuzzy c-means algorithm as parts of the linguistic summarization technique. Polyadic quantifiers were used to extract linguistic summaries, resulting in the acquisition of 124,227 summaries. A total of 1594 summaries have a truth degree exceeding 0.9. The findings were effectively utilized to assess the influence of the linguistic summarization approach and offered a valuable viewpoint for decisionmakers needing more expertise in this domain.
Citation - WoS: 5
Citation - Scopus: 5
Integrated Querying and Version Control of Context-Specific Biological Networks
(Oxford Univ Press, 2020) Cowman, Tyler; Coskun, Mustafa; Grama, Ananth; Koyuturk, Mehmet
Motivation: Biomolecular data stored in public databases is increasingly specialized to organisms, context/pathology and tissue type, potentially resulting in significant overhead for analyses. These networks are often specializations of generic interaction sets, presenting opportunities for reducing storage and computational cost. Therefore, it is desirable to develop effective compression and storage techniques, along with efficient algorithms and a flexible query interface capable of operating on compressed data structures. Current graph databases offer varying levels of support for network integration. However, these solutions do not provide efficient methods for the storage and querying of versioned networks. Results: We present VerTIoN, a framework consisting of novel data structures and associated query mechanisms for integrated querying of versioned context-specific biological networks. As a use case for our framework, we study network proximity queries in which the user can select and compose a combination of tissue-specific and generic networks. Using our compressed version tree data structure, in conjunction with state-of-the-art numerical techniques, we demonstrate real-time querying of large network databases. Conclusion: Our results show that it is possible to support flexible queries defined on heterogeneous networks composed at query time while drastically reducing response time for multiple simultaneous queries. The flexibility offered by VerTIoN in composing integrated network versions opens significant new avenues for the utilization of ever increasing volume of context-specific network data in a broad range of biomedical applications. Availability and Implementation: VerTIoN is implemented as a C++ library and is available at http://compbio.case.edu/omics/software/vertion and https://github.com/tjcowman/vertion Contact: tyler.cowman@case.edu
Identify Commonly Affected Pathways in Psychiatric Diseases
(Institute of Electrical and Electronics Engineers Inc., 2018-09) Bulut, Umit; Bakir-Güngör, Burcu
Genome-wide association studies (GWAS) are an extraordinary source of information when it comes to revealing the common variations of human complex diseases. Until now, the large amount of data generated from these studies have not been shown its full potential enough to identify the molecular and functional framework to be able to understand how a molecular system works. Following a more specific perspective, this study focused on the identification of commonly affected pathways of psychiatric diseases. The pathway term as used in molecular biology, depicts a simplified model of a process within the cell or tissue. Lately, several GWAS datasets are publicly available for various disease types such as psychiatric, immune-related, neurodegenerative, cardiovascular and such. A study on each disease and pairwise comparison to understand the behavior of disease and system would be time consuming and exhaustive. Instead of handling the results of these studies one by one, grouping diseases by target points is a more efficient way. This work aims to get one step closer to reveal key points of diseases and target these points to develop personalized medicine approaches. Especially for complex diseases, every drug doesn't show the same effect in every people. This paper contains the definition of molecular pathways, methods to identify disease related pathways, and to find common pathways pairwise in psychiatric diseases. © 2019 Elsevier B.V., All rights reserved.
Citation - WoS: 23
Citation - Scopus: 52
Evaluation of Classification Algorithms, Linear Discriminant Analysis and a New Hybrid Feature Selection Methodology for the Diagnosis of Coronary Artery Disease
(Institute of Electrical and Electronics Engineers Inc., 2018-12) Kolukisa, Burak; Hacilar, Hilal; Göy, Gökhan; Kus, Mustafa; Bakir-Güngör, Burcu; Aral, Atilla; Güngör, Vehbi Çağrı
According to the World Health Organization (WHO), 31% of the world's total deaths in 2016 (17.9 million) was due to cardiovascular diseases (CVD). With the development of information technologies, it has become possible to predict whether people have heart diseases or not by checking certain physical and biochemical values at a lower cost. In this study, we have evalated a set of different classification algorithms, linear discriminant analysis and proposed a new hybrid feature selection methodology for the diagnosis of coronary heart diseases (CHD). Throughout this research effort, using three publicly available Heart Disease diagnosis datasets (UCI Machine Learning Repository), we have conducted comparative performance evaluations in terms of accuracy, sensitivity, specificity, F-measure, AUC and running time. © 2023 Elsevier B.V., All rights reserved.
Citation - WoS: 2
Citation - Scopus: 4
Data Mining Techniques in Direct Marketing on Imbalanced Data Using Tomek Link Combined With Random Under-Sampling
(Assoc Computing Machinery, 2021-05-27) Yilmaz, Umit; Gezer, Cengiz; Aydin, Zafer; Gungor, V. CaGri; Yllmaz, Ümit; Aydln, Zafer
Determining the potential customers is very important in direct marketing. Data mining techniques are one of the most important methods for companies to determine potential customers. However, since the number of potential customers is very low compared to the number of non-potential customers, there is a class imbalance problem that significantly affects the performance of data mining techniques. In this paper, different combinations of basic and advanced resampling techniques such as Synthetic Minority Over-sampling Technique (SMOTE), Tomek Link, RUS, and ROS were evaluated to improve the performance of customer classification. Different feature selection techniques are used in order the decrease the number of non-informative features from the data such as Information Gain, Gain Ratio, Chi-squared, and Relief. Classification performance was compared and utilized using several data mining techniques, such as LightGBM, XGBoost, Gradient Boost, Random Forest, AdaBoost, ANN, Logistic Regression, Decision Trees, SVC, Bagging Classifier based on ROC AUC and sensitivity metrics. A combination of Tomek Link and Random Under-Sampling as a resampling technique and Chi-squared method as feature selection algorithm showed superior performance among the other combinations. Detailed performance evaluations demonstrated that with the proposed approach, LightGBM, which is a gradient boosting algorithm based on decision tree, gave the best results among the other classifiers with 0.947 sensitivity and 0.896 ROC AUC value.
A Data Mining Method for Refining Groups in Data Using Dynamic Model Based Clustering
(IEEE, 2013-06) Servi, Tayfun; Erol, H.
A new data mining method is proposed for determining the number and structure of clusters, and refining groups in multivariate heterogeneous data set including groups, partly and completely overlapped group structures by using dynamic model based clustering. It is called dynamic model based clustering since the structure of model changes at each stage of refinement process dynamically. The proposed data mining method works without data reduction for high dimensional data in which some of variables including completely overlapped situations. © 2013 IEEE. © 2013 Elsevier B.V., All rights reserved.
Citation - WoS: 3
Citation - Scopus: 13
NSEM: Duygu Analizi için Özgün Yıǧınlanmiş Topluluk Yöntemi
(Institute of Electrical and Electronics Engineers Inc., 2018-09) Işik, Yunus Emre; Görmez, Yasin; Kaynar, Oǧuz; Aydin, Zafer; Emre Isik, Yunus
Today, people often share their ideas, opinions and feelings through forums, social media sites, blogs and similar platforms. For this reason, access to these data has become very easy. Increase in the number of shares makes it possible to analyze and use these data in terms of marketing and politics. However, due to the large number of data, it is impossible that this analysis will be done by humans. Determination of what type of emotion is included automatically is done by sentiment analysis methods. In these methods, the text is defined as a mathematical vector and classified by machine learning methods. Ensemble methods are one of the most important methods used as classifiers in sentiment analysis. In these methods, a classifier error is tried to be solved by another classifier. In sentiment analysis, the feature vector that describes the text is as important as the classifier. Feature vectors obtained using different methods can make mistakes in different places. For this reason, in this study, NSEM is proposed for sentiment analysis, which is a new ensemble method that uses 2 different classifiers and 2 different feature extraction methods. As a result of the analysis, the proposed method is the most successful method with an accuracy rate of 79.1%. © 2019 Elsevier B.V., All rights reserved.

Scopus İndeksli Yayınlar Koleksiyonu

Browse

Filters

Settings

Sort By

Results per page

Search Results