Browsing by Author "Goy, Gokhan"

Now showing 1 - 6 of 6

Ensemble Churn Prediction for Internet Service Provider with Machine Learning Techniques
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2020) Goy, Gokhan; Kolukisa, Burak; Bahcevan, Cenk; Gungor, Vehbi Cagri
With the developing technology in every fields, a competitive marketing environment has been arised In this competitive environment analyzing customer behavior has become vital In particular, the ability to easily change any service provider has become vet) , critical for the company to continue its existence At the same time, the amount of financial resources spent on retaining instituters much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities In this study. we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value In this context the aunt successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936.
Citation - WoS: 25
Citation - Scopus: 31
miRmoduleNet: Detecting miRNA-mRNA Regulatory Modules
(Frontiers Media S.A., 2022) Yousef, Malik; Goy, Gokhan; Bakir-Gungor, Burcu
Increasing evidence that MicroRNAs (miRNAs) play a key role in carcinogenesis has revealed the need for elucidating the mechanisms of miRNA regulation and the roles of miRNAs in gene-regulatory networks. A better understanding of the interactions between miRNAs and their mRNA targets will provide a better understanding of the complex biological processes that occur during carcinogenesis. Increased efforts to reveal these interactions have led to the development of a variety of tools to detect and understand these interactions. We have recently described a machine learning approach miRcorrNet, based on grouping and scoring (ranking) groups of genes, where each group is associated with a miRNA and the group members are genes with expression patterns that are correlated with this specific miRNA. The miRcorrNet tool requires two types of -omics data, miRNA and mRNA expression profiles, as an input file. In this study we describe miRModuleNet, which groups mRNA (genes) that are correlated with each miRNA to form a star shape, which we identify as a miRNA-mRNA regulatory module. A scoring procedure is then applied to each module to further assess their contribution in terms of classification. An important output of miRModuleNet is that it provides a hierarchical list of significant miRNA-mRNA regulatory modules. miRModuleNet was further validated on external datasets for their disease associations, and functional enrichment analysis was also performed. The application of miRModuleNet aids the identification of functional relationships between significant biomarkers and reveals essential pathways involved in cancer pathogenesis.
Makine Öğrenmesi Teknikleri ile İnternet Servis Sağlayicisi için Müşteri Kayip Tahmini
(IEEE, 2020) Goy, Gokhan; Kolukisa, Burak; Bahcevan, Cenk; Gungor, Vehbi Cagri
With the developing technology in every fields, a competitive marketing environment has been arised In this competitive environment analyzing customer behavior has become vital In particular, the ability to easily change any service provider has become vet) , critical for the company to continue its existence At the same time, the amount of financial resources spent on retaining instituters much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities In this study. we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value In this context the aunt successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936.
Enlightening the Molecular Mechanisms of Type 2 Diabetes With a Novel Pathway Clustering and Pathway Subnetwork Approach
(Tubitak Scientific & Technological Research Council Turkey, 2022) Bakir-Gungor, Burcu; Yazici, Miray Unlu; Goy, Gokhan; Temiz, Mustafa
Type 2 diabetes mellitus (T2D) constitutes 90% of the diabetes cases, and it is a complex multifactorial disease. In the last decade, genome-wide association studies (GWASs) for T2D successfully pinpointed the genetic variants (typically single nucleotide polymorphisms, SNPs) that associate with disease risk. In order to diminish the burden of multiple testing in GWAS, researchers attempted to evaluate the collective effects of interesting variants. In this regard, pathway-based analyses of GWAS became popular to discover novel multigenic functional associations. Still, to reveal the unaccounted 85 to 90% of T2D variation, which lies hidden in GWAS datasets, new post-GWAS strategies need to be developed. In this respect, here we reanalyze three metaanalysis data of GWAS in T2D, using the methodology that we have developed to identify disease-associated pathways by combining nominally significant evidence of genetic association with the known biochemical pathways, protein-protein interaction (PPI) networks, and the functional information of selected SNPs. In this research effort, to enlighten the molecular mechanisms underlying T2D development and progress, we integrated different in silico approaches that proceed in top-down manner and bottom-up manner, and presented a comprehensive analysis at protein subnetwork, pathway, and pathway subnetwork levels. Using the mutual information based on the shared genes, the identified protein subnetworks and the affected pathways of each dataset were compared. While most of the identified pathways recapitulate the pathophysiology of T2D, our results show that incorporating SNP functional properties, PPI networks into GWAS can dissect leading molecular pathways, and it could offer improvement over traditional enrichment strategies.
A New Method to Identify Affected Pathway Subnetworks and Clusters in Colon Cancer
(IEEE, 345 E 47TH ST, NEW YORK, NY 10017 USA, 2019) Goy, Gokhan; Yazici, Miray Unlu; Bakir-Gungor, Buren
Nowadays new technological developments that play an important role in the production of big data have brought about the interpretation, sharing and storage of data related to complex diseases. Combining multi-omic data in different molecular levels is potentially important for understanding the biological origin of complex diseases. One of these complex diseases is cancer of different types, which has one of the highest causes of death worldwide. The integration of multiple omic data in the framework of a comprehensive analysis and identification of relevant pathways contribute to the development of therapeutic approaches related to disease. In this study, RNA and methylation data (genes and p values) of colon adenocarcinoma were obtained from TCGA data portal and combined with Fisher's method. While protein subnetworks affected by the disease were identified by using subnetwork algorithm, pathways related to the disease and genes associated with these pathways were determined by functional enrichment analysis. Using gene-pathway relationship matrix, kappa scores of pathways were determined by similarity calculation. In this way, the pathways were clustered according to the hierarchically optimal number, as a result, the most important pathway clusters and related genes that are effective in disease formation identified.
Citation - WoS: 2
Credit Card Fraud Detection With Machine Learning Methods
(IEEE, 2019) Goy, Gokhan; Gezer, Cengiz; Gungor, Vehbi Cagri
With the increase in credit card usage of people, the credit card transactions increase dramatically. It is difficult to identify fraudulent transactions among the vast amount of credit card transactions. Although credit card fraud is limited in number of transactions, it causes serious problems in terms of financial losses for individuals and organizations. Even though large number of studies has been conducted to solve this problem, there is no generally accepted solution. In this paper, a publicly available data set is used. The unbalance problem of the data set was solved by using hybrid sampling methods together. On this data set, comparative performance evaluations have been conducted. Different from other studies, the Area Under the Curve (AUC) metric, which expresses the success in such data sets, has also been used in addition to standard performance metrics. Since it is also important to quickly detect credit card fraud transactions; the running time of different methods is also presented as another performance metric.