eTNT: Enhanced Textnettopics With Filtered LDA Topics and Sequential Forward / Backward Topic Scoring Approaches

Loading...
Publication Logo

Date

2024

Journal Title

Journal ISSN

Volume Title

Publisher

Science and Information Organization

Open Access Color

GOLD

Green Open Access

Yes

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

TextNetTopics is a novel text classification-based topic modelling approach that focuses on topic selection rather than individual word selection to train a machine learning algorithm. However, one key limitation of TextNetTopics is its scoring component, which evaluates each topic in isolation and ranks them accordingly, ignoring the potential relationships between topics. In addition, the chosen topics may contain redundant or irrelevant features, potentially increasing the feature set size and introducing noise that can degrade the overall model performance. To address these limitations and improve the classification performance, this study introduces an enhancement to TextNetTopics. eTNT integrates two novel scoring approaches: Sequential Forward Topic Scoring (SFTS) and Sequential Backward Topic Scoring (SBTS), which consider topic interactions by assessing sets of topics simultaneously. Moreover, it incorporates a filtering component that aims to enhance topics' quality and discriminative power by removing non-informative features from each topic using Random Forest feature importance values. These integrations aim to streamline the topic selection process and enhance classifier efficiency for text classification. The results obtained from the WOS-5736, LitCovid, and MultiLabel datasets provide valuable insights into the superior effectiveness of eTNT compared to its counterpart, TextNetTopics. © 2024 Elsevier B.V., All rights reserved.

Description

Keywords

Machine Learning, Text Classification, Topic Modeling, Topic Scoring, Adversarial Machine Learning, Decision Trees, Federated Learning, Learning Algorithms, Wiener Filtering, Classification Performance, Features Sets, Machine Learning Algorithms, Machine-Learning, Modeling Approach, Modeling Performance, Overall-Model, Text Classification, Topic Modeling, Topic Scoring, Contrastive Learning, Text classification, Machine learning, Topic scoring, Topic modeling

Fields of Science

Citation

WoS Q

Q3

Scopus Q

Q3
OpenCitations Logo
OpenCitations Citation Count
1

Source

International Journal of Advanced Computer Science and Applications

Volume

15

Issue

7

Start Page

1135

End Page

1144
PlumX Metrics
Citations

Scopus : 1

Captures

Mendeley Readers : 1

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.3626

Sustainable Development Goals