Bakal, Mehmet Gökhan

Loading...
Profile Picture
Name Variants
Bakal, G
Bakal, Gokhan
Bakal, Mehmet
Job Title
Dr. Öğr. Üyesi
Email Address
gokhan.bakal@agu.edu.tr
Main Affiliation
02. 04. Bilgisayar Mühendisliği
02. Mühendislik Fakültesi
01. Abdullah Gül University
Status
Current Staff
Website
Scopus Author ID
Turkish CoHE Profile ID
Google Scholar ID
WoS Researcher ID

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo

4

Research Products

9

INDUSTRY, INNOVATION AND INFRASTRUCTURE
INDUSTRY, INNOVATION AND INFRASTRUCTURE Logo

1

Research Products

14

LIFE BELOW WATER
LIFE BELOW WATER Logo

1

Research Products
Documents

22

Citations

137

h-index

7

Documents

11

Citations

73

Scholarly Output

23

Articles

9

Views / Downloads

74/0

Supervised MSc Theses

2

Supervised PhD Theses

0

WoS Citation Count

26

Scopus Citation Count

55

WoS h-index

3

Scopus h-index

5

Patents

0

Projects

0

WoS Citations per Publication

1.13

Scopus Citations per Publication

2.39

Open Access Source

5

Supervised Theses

2

Google Analytics Visitor Traffic

JournalCount
-- 6th International Conference on Computer Science and Engineering, UBMK 2021 -- Ankara -- 1768262
-- 9th International Conference on Computer Science and Engineering, UBMK 2024 -- Antalya -- 2049062
7th International Congress on Human-Computer Interaction, Optimization and Robotic Applications-ICHORA -- MAY 23-24, 2025 -- Ankara, TURKIYE2
-- 2023 International Conference on Smart Applications, Communications and Networking, SmartNets 2023 -- Istanbul -- 1919022
Applied Soft Computing1
Current Page: 1 / 4

Scopus Quartile Distribution

Competency Cloud

GCRIS Competency Cloud

Scholarly Output Search Results

Now showing 1 - 10 of 23
  • Article
    Enhancing Deep Learning Performance Through a Genetic Algorithm-Enhanced Approach: Focusing on LSTM
    (2024) Bakal, Mehmet; Şen, Tarık Üveys
    Derin öğrenme, görüntü sınıflandırma, doğal dil işleme ve konuşma tanıma gibi çeşitli uygulamalarda dikkat çekici başarılar elde etmiştir. Ancak, derin sinir ağlarını eğitmek, karmaşık mimarileri ve gereken parametre sayısı nedeniyle zorlu bir süreçtir. Genetik algoritmalar, derin öğrenme için alternatif bir optimizasyon teknik olarak önerilmiştir ve optimal bir ağ parametre setini minimize eden bir amaç fonksiyonu bulmak için etkili bir alternatif yöntem sunar. Bu makalede, derin öğrenme ile genetik algoritmaları entegre eden, özellikle LSTM modellerini kullanarak performansı artırmayı amaçlayan yeni bir yaklaşım öneriyoruz. Yöntemimiz, genetik algoritmalar aracılığıyla öğrenme hızı, grup boyutu, katman başına nöron sayısı ve katman derinliği gibi kritik hiper-parametreleri optimize eder. Ayrıca, genetik algoritma parametrelerinin optimizasyon sürecini nasıl etkilediğine dair kapsamlı bir analiz yaparak, LSTM model performansını iyileştirmedeki önemli etkilerini gösteriyoruz. Genel olarak, sunulan yöntem, derin sinir ağlarının performansını artırmak için güçlü bir mekanizma sunmakta olup bu nedenle yapay zekâ disiplininde gelecekteki uygulamalar için önemli bir potansiyele sahip olduğuna inanıyoruz.
  • Conference Object
    Multi-Method Text Summarization: Evaluating Extractive and BART-Based Approaches on CNN/Daily Mail
    (Institute of Electrical and Electronics Engineers Inc., 2025) Inal, Yasin; Bakal, Gokhan; Esit, Muhammed
    With the exponential growth of digital content, efficient text summarization has become increasingly crucial for managing information overload. This paper presents a comprehensive approach to text summarization using both extractive and abstractive methods, implemented on the CNN/Daily Mail dataset. We leverage pre-trained BART (Bidirectional and AutoRegressive Transformers) models and fine-tuning techniques to generate high-quality summaries. Our approach demonstrates significant improvements, with our best model trained on 287 k samples achieving ROUGE-1 F1 scores of 0.4174, ROUGE-2 F1 scores of 0.1932, and ROUGE-L F1 scores of 0.2910. We provide detailed comparisons between extractive methods and various BART model configurations, analyzing the impact of training dataset size and model architecture on summarization quality. Additionally, we share our implementation through an opensource NLP toolkit to facilitate further research and practical applications in the field. © 2025 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 1
    Improving Salary Offer Processes With Classification Based Machine Learning Models
    (Institute of Electrical and Electronics Engineers Inc., 2024) Kaya, Rukiye; Saatci, Mehtap; Bakal, Gokhan
    In job applications, salary is major motivational factor for employees and making accurate salary prediction is crucial for both employers and employees. Utilizing advanced technologies can significantly enhance the accuracy and efficiency of salary prediction process. In this study, we explore Machine Learning (ML) methods to enhance salary prediction process. We evaluated seven classification models for predicting salary categories, with the Artificial Neural Network (ANN) model achieving the highest accuracy at 58.2% on the test dataset, followed by the K-Nearest Neighbors (KNN) model with an accuracy of 56.8%. Additionally, we employed ensemble models to further enhance prediction accuracy. Among these, the Majority Voting Classifier using Hard Voting achieved the highest accuracy at 59.3%, demonstrating the potential of ensemble techniques in refining salary predictions. The developed salary prediction tool estimates the most appropriate salary category for each candidate and help mitigate potential biases in manual salary assessments, hence enables a more objective and consistent compensation system. ∗CRITICAL: Do Not Use Symbols, Special Characters, or Math in Paper Title or Abstract, and do not cite other papers in the abstract. © 2024 Elsevier B.V., All rights reserved.
  • Article
    Document Classification With Contextually Enriched Word Embeddings
    (2024) Akbaş, Ayhan; Mahmood, Raad; Bakal, Mehmet
    The text classification task has a wide range of application domains for distinct purposes, such as the classification of articles, social media posts, and sentiments. As a natural language processing application, machine learning and deep learning techniques are intensively utilized in solving such challenges. One common approach is employing the discriminative word features comprising Bag-of-Words and n-grams to conduct text classification experiments. The other powerful approach is exploiting neural network-based (specifically deep learning models) through either sentence, word, or character levels. In this study, we proposed a novel approach to classify documents with contextually enriched word embeddings powered by the neighbor words accessible through the trigram word series. In the experiments, a well-known web of science dataset is exploited to demonstrate the novelty of the models. Consequently, we built various models constructed with and without the proposed approach to monitor the models' performances. The experimental models showed that the proposed neighborhood-based word embedding enrichment has decent potential to use in further studies.
  • Article
    Citation - WoS: 5
    Citation - Scopus: 11
    Enhancing Sentiment Analysis in Stock Market Tweets Through Bert-Based Knowledge Transfer
    (Springer, 2025) Cicekyurt, Emre; Bakal, Gokhan
    One of the widely studied text classification efforts is sentiment analysis. It is a specific examination involving natural language processing and machine learning methods to understand semantic orientation from textual data. Working social media posts, such as tweets, for sentiment analysis, is quite common among researchers due to the speed of information dissemination. In this regard, forecasting stock market tweets is a widely studied research topic. Some studies have revealed a strong connection between sentiment and stock market performance, while others have not found any notable associations. The proposed work shows two distinct approaches to sentiment analysis over the stock market tweets. The first approach employs traditional machine learning algorithms, including logistic regression, random forest, and XGBoost. The second approach constructs deep learning (as a subfield of machine learning) models using LSTM and CNN algorithms to classify the test instances into positive, negative, or neutral classes through ten randomly shuffled data splits. In this study, the labeled data size is gradually increased utilizing a pre-trained model, FinBERT. It is exclusively employed to label unlabeled data instances to integrate them into the experiments. The goal is to monitor the effect of the additional newly-labeled examples on the sentiment analysis performance. The experiments showed that the average F1-score improved by 20% for the deep learning models and 17% for the machine learning models. In the end, the paper reveals a strong positive correlation between training data size and the classification performance of the experimental approaches.
  • Article
    Citation - Scopus: 5
    Building a Challenging Medical Dataset for Comparative Evaluation of Classifier Capabilities
    (Elsevier Ltd, 2024) Bozkurt, Berat; Coskun, Kerem; Bakal, Gokhan
    Since the 2000s, digitalization has been a crucial transformation in our lives. Nevertheless, digitalization brings a bulk of unstructured textual data to be processed, including articles, clinical records, web pages, and shared social media posts. As a critical analysis, the classification task classifies the given textual entities into correct categories. Categorizing documents from different domains is straightforward since the instances are unlikely to contain similar contexts. However, document classification in a single domain is more complicated due to sharing the same context. Thus, we aim to classify medical articles about four common cancer types (Leukemia, Non-Hodgkin Lymphoma, Bladder Cancer, and Thyroid Cancer) by constructing machine learning and deep learning models. We used 383,914 medical articles about four common cancer types collected by the PubMed API. To build classification models, we split the dataset into 70% as training, 20% as testing, and 10% as validation. We built widely used machine-learning (Logistic Regression, XGBoost, CatBoost, and Random Forest Classifiers) and modern deep-learning (convolutional neural networks - CNN, long short-term memory - LSTM, and gated recurrent unit - GRU) models. We computed the average classification performances (precision, recall, F-score) to evaluate the models over ten distinct dataset splits. The best-performing deep learning model(s) yielded a superior F1 score of 98%. However, traditional machine learning models also achieved reasonably high F1 scores, 95% for the worst-performing case. Ultimately, we constructed multiple models to classify articles, which compose a hard-to-classify dataset in the medical domain. © 2024 Elsevier B.V., All rights reserved.
  • Conference Object
    Citation - Scopus: 7
    A Comparative Analysis on Medical Article Classification Using Text Mining & Machine Learning Algorithms
    (Institute of Electrical and Electronics Engineers Inc., 2021) Kolukisa, Burak; Dedeturk, Bilge Kagan; Dedeturk, Beyhan Adanur; Gulsen, Abdulkadir; Bakal, Gokhan
    The document classification task is one of the widely studied research fields on multiple domains. The core motivation of the classification task is that the manual classification efforts are impractical due to the exponentially growing document volumes. Thus, we densely need to exploit automated computational approaches, such as machine learning models along with data & text mining techniques. In this study, we concentrated on the classification of medical articles specifically on common cancer types, due to the significance of the field and the decent number of available documents of interest. We deliberately targeted MEDLINE articles about common cancer types because most cancer types share a similar literature composition. Therefore, this situation makes the classification effort relatively more complicated. To this end, we built multiple machine learning models, including both traditional and deep learning architectures. We achieved the best performance (R¿82% F score) by the LSTM model. Overall, our results demonstrate a strong effect of exploiting both text mining and machine learning methods to distinguish medical articles on common cancer types. © 2022 Elsevier B.V., All rights reserved.
  • Conference Object
    Text Classification Experiments on Contextual Graphs Built by N-Gram Series
    (Springer International Publishing AG, 2025) Sen, Tarik Uveys; Yakit, Mehmet Can; Gumus, Mehmet Semih; Abar, Orhan; Bakal, Gokhan
    Traditional n-gram textual features, commonly employed in conventional machine learning models, offer lower performance rates on high-volume datasets compared to modern deep learning algorithms, which have been intensively studied for the past decade. The main reason for this performance disparity is that deep learning approaches handle textual data through the word vector space representation by catching the contextually hidden information in a better way. Nonetheless, the potential of the n-gram feature set to reflect the context is open to further investigation. In this sense, creating graphs using discriminative ngram series with high classification power has never been fully exploited by researchers. Hence, the main goal of this study is to contribute to the classification power by including the long-range neighborhood relationships for each word in the word embedding representations. To achieve this goal, we transformed the textual data by employing n-gram series into a graph structure and then trained a graph convolution network model. Consequently, we obtained contextually enriched word embeddings and observed F1-score performance improvements from 0.78 to 0.80 when we integrated those convolution-based word embeddings into an LSTM model. This research contributes to improving classification capabilities by leveraging graph structures derived from discriminative n-gram series.
  • Article
    Citation - WoS: 13
    Citation - Scopus: 9
    An Empirical Study of Sentiment Analysis Utilizing Machine Learning and Deep Learning Algorithms
    (Springernature, 2024) Erkantarci, Betul; Bakal, Gokhan
    Among text-mining studies, one of the most studied topics is the text classification task applied in various domains, including medicine, social media, and academia. As a sub-problem in text classification, sentiment analysis has been widely investigated to classify often opinion-based textual elements. Specifically, user reviews and experiential feedback for products or services have been employed as fundamental data sources for sentiment analysis efforts. As a result of rapidly emerging technological advancements, social media platforms such as Twitter, Facebook, and Reddit, have become central opinion-sharing mediums since the early 2000s. In this sense, we build various machine-learning models to solve the sentiment analysis problem on the Reddit comments dataset in this work. The experimental models we constructed achieve F1 scores within intervals of 73-76%. Consequently, we present comparative performance scores obtained by traditional machine learning and deep learning models and discuss the results.
  • Conference Object
    Citation - Scopus: 4
    A Transfer Learning Application on the Reliability of Psychological Drugs' Comments
    (Institute of Electrical and Electronics Engineers Inc., 2023) Sen, Tarik Uveys; Bakal, Gokhan
    As digitalization and the Internet stay emerging concepts by gaining popularity, the accuracy of personal reviews/opinions will be a critical issue. This circumstance also particularly applies to patients taking psychological drugs, where accurate information is crucial for other patients and medical professionals. In this study, we analyze drug reviews from drugs.com to determine the effectiveness of reviews for psychological drugs. Our dataset includes over 200,000 drug reviews, which we labeled as positive, negative, or neutral according to their rating scores. We apply machine learning (ML) models, including Logistic Regression, Recurrent Neural Network (RNN), and Long Short-Term Memory (LSTM) algorithms, to predict the sentiment class of each review. Our results demonstrate an F1-Weighted score of 85.3% for the LSTM model. However, by applying the transfer learning technique, we further improved the F1 score (nearly 3% increase) obtained by the LSTM model. Our findings proved that there is no contextual difference between the comments made by the patients suffering from psychological or other diseases. © 2023 Elsevier B.V., All rights reserved.