Document Classification With Contextually Enriched Word Embeddings
Loading...
Date
2024
Authors
Journal Title
Journal ISSN
Volume Title
Publisher
Open Access Color
GOLD
Green Open Access
Yes
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
The text classification task has a wide range of application domains for distinct purposes, such as the classification of articles, social media posts, and sentiments. As a natural language processing application, machine learning and deep learning techniques are intensively utilized in solving such challenges. One common approach is employing the discriminative word features comprising Bag-of-Words and n-grams to conduct text classification experiments. The other powerful approach is exploiting neural network-based (specifically deep learning models) through either sentence, word, or character levels. In this study, we proposed a novel approach to classify documents with contextually enriched word embeddings powered by the neighbor words accessible through the trigram word series. In the experiments, a well-known web of science dataset is exploited to demonstrate the novelty of the models. Consequently, we built various models constructed with and without the proposed approach to monitor the models' performances. The experimental models showed that the proposed neighborhood-based word embedding enrichment has decent potential to use in further studies.
Description
Keywords
Deep Learning, Text classification, N-grams, Word2Vec, LSTM
Fields of Science
05 social sciences, 0202 electrical engineering, electronic engineering, information engineering, 0501 psychology and cognitive sciences, 02 engineering and technology
Citation
WoS Q
N/A
Scopus Q
N/A

OpenCitations Citation Count
N/A
Source
Balkan Journal of Electrical and Computer Engineering
Volume
12
Issue
1
Start Page
90
End Page
97
Collections
PlumX Metrics
Captures
Mendeley Readers : 4


