Document Classification with Contextually Enriched Word Embeddings

dc.contributor.author Mahmood, Raad Saadi
dc.contributor.author Bakal, Gokhan
dc.contributor.author Akbas, Ayhan
dc.contributor.authorID 0000-0003-2897-3894 en_US
dc.contributor.department AGÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümü en_US
dc.contributor.institutionauthor Bakal, Mehmet Gokhan
dc.date.accessioned 2025-02-17T13:19:37Z
dc.date.available 2025-02-17T13:19:37Z
dc.date.issued 2024 en_US
dc.description.abstract The text classification task has a wide range of application domains for distinct purposes, such as the classification of articles, social media posts, and sentiments. As a natural language processing application, machine learning and deep learning techniques are intensively utilized in solving such challenges. One common approach is employing the discriminative word features comprising Bag-of-Words and n-grams to conduct text classification experiments. The other powerful approach is exploiting neural network-based (specifically deep learning models) through either sentence, word, or character levels. In this study, we proposed a novel approach to classify documents with contextually enriched word embeddings powered by the neighbor words accessible through the trigram word series. In the experiments, a well-known web of science dataset is exploited to demonstrate the novelty of the models. Consequently, we built various models constructed with and without the proposed approach to monitor the models' performances. The experimental models showed that the proposed neighborhood-based word embedding enrichment has decent potential to use in further studies. en_US
dc.identifier.endpage 97 en_US
dc.identifier.issn 2147-284X
dc.identifier.issue 1 en_US
dc.identifier.startpage 90 en_US
dc.identifier.uri https://doi.org/10.17694/bajece.1366812
dc.identifier.uri https://hdl.handle.net/20.500.12573/2435
dc.identifier.volume 12 en_US
dc.language.iso eng en_US
dc.publisher Bajece (İstanbul Teknik Ünv) en_US
dc.relation.isversionof 10.17694/bajece.1366812 en_US
dc.relation.journal Balkan Journal of Electrical and Computer Engineering en_US
dc.relation.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Text classification en_US
dc.subject Deep Learning en_US
dc.subject LSTM en_US
dc.subject Word2Vec en_US
dc.subject Word2Vec en_US
dc.subject N-grams en_US
dc.title Document Classification with Contextually Enriched Word Embeddings en_US
dc.type article en_US

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
tmp-273a713f-6ca4-4fd7-bb28-99e1562eb9db.a4bc6baf64f84244b0105296ffe6d4f1.pdf.pdf
Size:
912.11 KB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: