WoS İndeksli Yayınlar Koleksiyonu
Permanent URI for this collectionhttps://hdl.handle.net/20.500.12573/394
Browse
10 results
Search Results
Article Citation - WoS: 4Relationship Between Neutrophil Gelatinase-Associated Lipocalin and Mortality in Acute Kidney Injury(Galenos Yayincilik, 2018-12-03) Kayaalti, Selda; Kayaalti, Omer; Aksebzeci, Bekir HakanObjective: Almost half of intensive care patients are affected by acute kidney injury (AKI). The purpose of this study is to determine parameters that can be used for predicting of early (within 28 days) and late (within 90 days) mortality in patients who are followed-up with AKI in intensive care units. Materials and Methods: In this study, a dataset that contains 50 patients with AKI in intensive care units was used. This dataset contains blood urea nitrogen, creatinine, plasma and urinary neutrophil gelatinase-associated hpocalin (NGAL), lactate dehydrogenase, alkaline phosphatase and gammaglutamyl transpeptidase values of patients who were admitted to intensive care for various reasons and who developed AKI on the days 1, 3 and 7. In addition to these values, laboratory results such as serum electrolytes on day 1, blood gas; vital signs such as mean arterial pressure, central venous pressure; and demographic data were also recorded. Data mining techniques were applied to determine correlation between all of these data and mortality. Results: The threshold level of urinary NGAL on day 7 was determined to be 69 ng/mL, and strong correlation was found between this threshold level and early mortality. Similarly, the threshold level of plasma NGAL on day 7 was determined to be 150 ng/mL, and this was highly correlated with early mortality. Besides, strong correlation was also found between the difference in the urinary NGAL levels on day 1 and 7, and early mortality. Conclusion: In this study, plasma and urinary NGAL levels were found to be closely related to early mortality in patients who were followed-up with AKI in intensive care units. On the other hand, any parameter associated with late mortality was not found.Conference Object Identify Commonly Affected Pathways in Psychiatric Diseases(Institute of Electrical and Electronics Engineers Inc., 2018-09) Bulut, Umit; Bakir-Güngör, BurcuGenome-wide association studies (GWAS) are an extraordinary source of information when it comes to revealing the common variations of human complex diseases. Until now, the large amount of data generated from these studies have not been shown its full potential enough to identify the molecular and functional framework to be able to understand how a molecular system works. Following a more specific perspective, this study focused on the identification of commonly affected pathways of psychiatric diseases. The pathway term as used in molecular biology, depicts a simplified model of a process within the cell or tissue. Lately, several GWAS datasets are publicly available for various disease types such as psychiatric, immune-related, neurodegenerative, cardiovascular and such. A study on each disease and pairwise comparison to understand the behavior of disease and system would be time consuming and exhaustive. Instead of handling the results of these studies one by one, grouping diseases by target points is a more efficient way. This work aims to get one step closer to reveal key points of diseases and target these points to develop personalized medicine approaches. Especially for complex diseases, every drug doesn't show the same effect in every people. This paper contains the definition of molecular pathways, methods to identify disease related pathways, and to find common pathways pairwise in psychiatric diseases. © 2019 Elsevier B.V., All rights reserved.Conference Object Citation - WoS: 22Citation - Scopus: 52Evaluation of Classification Algorithms, Linear Discriminant Analysis and a New Hybrid Feature Selection Methodology for the Diagnosis of Coronary Artery Disease(Institute of Electrical and Electronics Engineers Inc., 2018-12) Kolukisa, Burak; Hacilar, Hilal; Göy, Gökhan; Kus, Mustafa; Bakir-Güngör, Burcu; Aral, Atilla; Güngör, Vehbi ÇağrıAccording to the World Health Organization (WHO), 31% of the world's total deaths in 2016 (17.9 million) was due to cardiovascular diseases (CVD). With the development of information technologies, it has become possible to predict whether people have heart diseases or not by checking certain physical and biochemical values at a lower cost. In this study, we have evalated a set of different classification algorithms, linear discriminant analysis and proposed a new hybrid feature selection methodology for the diagnosis of coronary heart diseases (CHD). Throughout this research effort, using three publicly available Heart Disease diagnosis datasets (UCI Machine Learning Repository), we have conducted comparative performance evaluations in terms of accuracy, sensitivity, specificity, F-measure, AUC and running time. © 2023 Elsevier B.V., All rights reserved.Conference Object Citation - WoS: 2Citation - Scopus: 4Data Mining Techniques in Direct Marketing on Imbalanced Data Using Tomek Link Combined With Random Under-Sampling(Assoc Computing Machinery, 2021-05-27) Yilmaz, Umit; Gezer, Cengiz; Aydin, Zafer; Gungor, V. CaGri; Yllmaz, Ümit; Aydln, ZaferDetermining the potential customers is very important in direct marketing. Data mining techniques are one of the most important methods for companies to determine potential customers. However, since the number of potential customers is very low compared to the number of non-potential customers, there is a class imbalance problem that significantly affects the performance of data mining techniques. In this paper, different combinations of basic and advanced resampling techniques such as Synthetic Minority Over-sampling Technique (SMOTE), Tomek Link, RUS, and ROS were evaluated to improve the performance of customer classification. Different feature selection techniques are used in order the decrease the number of non-informative features from the data such as Information Gain, Gain Ratio, Chi-squared, and Relief. Classification performance was compared and utilized using several data mining techniques, such as LightGBM, XGBoost, Gradient Boost, Random Forest, AdaBoost, ANN, Logistic Regression, Decision Trees, SVC, Bagging Classifier based on ROC AUC and sensitivity metrics. A combination of Tomek Link and Random Under-Sampling as a resampling technique and Chi-squared method as feature selection algorithm showed superior performance among the other combinations. Detailed performance evaluations demonstrated that with the proposed approach, LightGBM, which is a gradient boosting algorithm based on decision tree, gave the best results among the other classifiers with 0.947 sensitivity and 0.896 ROC AUC value.Conference Object Citation - WoS: 2Credit Card Fraud Detection With Machine Learning Methods(IEEE, 2019-09) Goy, Gokhan; Gezer, Cengiz; Gungor, Vehbi CagriWith the increase in credit card usage of people, the credit card transactions increase dramatically. It is difficult to identify fraudulent transactions among the vast amount of credit card transactions. Although credit card fraud is limited in number of transactions, it causes serious problems in terms of financial losses for individuals and organizations. Even though large number of studies has been conducted to solve this problem, there is no generally accepted solution. In this paper, a publicly available data set is used. The unbalance problem of the data set was solved by using hybrid sampling methods together. On this data set, comparative performance evaluations have been conducted. Different from other studies, the Area Under the Curve (AUC) metric, which expresses the success in such data sets, has also been used in addition to standard performance metrics. Since it is also important to quickly detect credit card fraud transactions; the running time of different methods is also presented as another performance metric.Conference Object A Data Mining Method for Refining Groups in Data Using Dynamic Model Based Clustering(IEEE, 2013-06) Servi, Tayfun; Erol, H.A new data mining method is proposed for determining the number and structure of clusters, and refining groups in multivariate heterogeneous data set including groups, partly and completely overlapped group structures by using dynamic model based clustering. It is called dynamic model based clustering since the structure of model changes at each stage of refinement process dynamically. The proposed data mining method works without data reduction for high dimensional data in which some of variables including completely overlapped situations. © 2013 IEEE. © 2013 Elsevier B.V., All rights reserved.Conference Object Citation - WoS: 3Citation - Scopus: 13NSEM: Duygu Analizi için Özgün Yıǧınlanmiş Topluluk Yöntemi(Institute of Electrical and Electronics Engineers Inc., 2018-09) Işik, Yunus Emre; Görmez, Yasin; Kaynar, Oǧuz; Aydin, Zafer; Emre Isik, YunusToday, people often share their ideas, opinions and feelings through forums, social media sites, blogs and similar platforms. For this reason, access to these data has become very easy. Increase in the number of shares makes it possible to analyze and use these data in terms of marketing and politics. However, due to the large number of data, it is impossible that this analysis will be done by humans. Determination of what type of emotion is included automatically is done by sentiment analysis methods. In these methods, the text is defined as a mathematical vector and classified by machine learning methods. Ensemble methods are one of the most important methods used as classifiers in sentiment analysis. In these methods, a classifier error is tried to be solved by another classifier. In sentiment analysis, the feature vector that describes the text is as important as the classifier. Feature vectors obtained using different methods can make mistakes in different places. For this reason, in this study, NSEM is proposed for sentiment analysis, which is a new ensemble method that uses 2 different classifiers and 2 different feature extraction methods. As a result of the analysis, the proposed method is the most successful method with an accuracy rate of 79.1%. © 2019 Elsevier B.V., All rights reserved.Conference Object Makine Öğrenmesi Teknikleri ile İnternet Servis Sağlayicisi için Müşteri Kayip Tahmini(IEEE, 2020) Goy, Gokhan; Kolukisa, Burak; Bahcevan, Cenk; Gungor, Vehbi CagriWith the developing technology in every fields, a competitive marketing environment has been arised In this competitive environment analyzing customer behavior has become vital In particular, the ability to easily change any service provider has become vet) , critical for the company to continue its existence At the same time, the amount of financial resources spent on retaining instituters much less than to obtain new clients. In this context, the traditional methods of examining vast amount of data obtained today for establishing decision support systems have lost their validities In this study. we used a dataset which is provided by TurkNet serving as an internet service provider in Turkey. Various preprocessing steps has performed on this dataset and then classification algorithms ran. Afterwards results have obtained and compared. The results of these experiments analyzed in terms of the area under the curve value In this context the aunt successful classifier algorithm has been determined as the Random Trees algorithm with a value of 0.936.Conference Object Citation - WoS: 2Citation - Scopus: 6Makine Öğrenmesi Yöntemleri ile Kredi Kartı Sahteciliğinin Tespiti(Institute of Electrical and Electronics Engineers Inc., 2019-09) Göy, Gökhan; Gezer, Cengiz; Güngör, Vehbi ÇağrıWith the increase in credit card usage of people, the credit card transactions increase dramatically. It is difficult to identify fraudulent transactions among the vast amount of credit card transactions. Although credit card fraud is limited in number of transactions, it causes serious problems in terms of financial losses for individuals and organizations. Even though large number of studies has been conducted to solve this problem, there is no generally accepted solution. In this paper, a publicly available data set is used. The unbalance problem of the data set was solved by using hybrid sampling methods together. On this data set, comparative performance evaluations have been conducted. Different from other studies, the Area Under the Curve (AUC) metric, which expresses the success in such data sets, has also been used in addition to standard performance metrics. Since it is also important to quickly detect credit card fraud transactions; the running time of different methods is also presented as another performance metric. © 2020 Elsevier B.V., All rights reserved.Conference Object Citation - Scopus: 1Koroner Arter Hastalığı Tanısı İçin Alan Bilgisi İçeren Topluluk Öznitelik Seçim Yöntemi(Institute of Electrical and Electronics Engineers Inc., 2020-10-05) Kolukisa, Burak; Güngör, Vehbi Çağrı; Bakir-Güngör, Burcu; Gungor, Burcu BakirCoronary Artery Disease (CAD) is the condition where, the heart is not fed enough as a result of the accumulation of fatty matter called atheroma in the walls of the arteries. In 2016, CAD accounts for 31% (17.9 million) of the world's total deaths and its diagnosis is difficult. It is estimated that approximately 23.6 million people will die from this disease in 2030. With the development of machine learning and data mining techniques, it might be possible to diagnose CAD inexpensively and easily via examining some physical and biochemical values. In this study, for the CAD classification problem, a novel ensemble feature selection methodology that incorporates domain knowledge is proposed. Via applying the proposed methodology on the UCI Cleveland CAD dataset and using different classification algorithms, performance metrics are compared. It is shown that in our experiments, when Multilayer Perceptron classifier is used with 9 selected features, our proposed solution reached 85.47% accuracy, 82.96% accuracy and 0.839 F-Measure. As a future work, we aim to generate a machine learning model that can quickly diagnose CAD on real-time data in hospitals. © 2021 Elsevier B.V., All rights reserved.
