From Traditional to Deep: Evaluating Sentiment Analysis Models on a Large-Scale Tweet Dataset

Loading...

Journal Title

Journal ISSN

Volume Title

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

relationships.isProjectOf

relationships.isJournalIssueOf

Abstract

This study investigates the effectiveness of various machine learning (ML) and deep learning (DL) techniques for large-scale sentiment analysis on Twitter data. We leverage a publicly available dataset of one million tweets, annotated with four sentiment labels (positive, negative, uncertainty, and liti-gious), to train and evaluate a range of models. Our experiments demonstrate that traditional ML algorithms, particularly XG-Boost, achieve high performance, with the best F1 score reaching 95.81% using a combination of unigrams and bigrams. Among DL models, a hybrid CNN-BiGRU architecture yields the highest average F1 score of 95.42%. Our findings highlight the strengths of different approaches for sentiment analysis on Twitter data and emphasize the importance of data preprocessing and model selection for achieving optimal performance. © 2025 Elsevier B.V., All rights reserved.

Description

Keywords

Deep Learning, Machine Learning, Sentiment Analysis, Text Mining, Adversarial Machine Learning, Contrastive Learning, Deep Reinforcement Learning, Analysis Models, Deep Learning, F1 Scores, Large-Scales, Learning Techniques, Machine-Learning, Positive/Negative, Sentiment Analysis, Text-Mining, Uncertainty, Tweets

Fields of Science

Citation

WoS Q

Scopus Q

OpenCitations Logo
OpenCitations Citation Count
N/A

Volume

Issue

Start Page

451

End Page

456
PlumX Metrics
Citations

Scopus : 1

Captures

Mendeley Readers : 5

SCOPUS™ Citations

1

checked on Jun 03, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.36

Sustainable Development Goals

GOOD HEALTH AND WELL-BEING3
GOOD HEALTH AND WELL-BEING