TextNetTopics_TIS: Enhancing Textnettopics With Random Forest-Based Topic Importance Scoring

Loading...
Publication Logo

Date

2024

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

TextNetTopics is an innovative Latent Dirichlet Allocation-based topic selection method for training text classification models. One main limitation is its computationally intensive scoring mechanism, especially when applied to many topics. This scoring mechanism involves training a machine learning model (i.e., Random Forest) on each topic using the Monte-Carlo Cross-Validation approach and assigning a score value based on a specific performance metric (e.g., accuracy or F1-score). Moreover, the measured score does not account for the interactions between all features residing in all topics. This paper presents a new topic-scoring mechanism called Topic Importance Scoring. This computationally efficient approach trains a Random Forest model on all topics simultaneously and leverages the extracted feature importance values to give each topic a score reflecting its classification potential. The experiments on three diverse datasets confirm that the proposed method's performance is superior to the Topic Performance Scoring, which was used in the original TextNetTopics method. © 2024 Elsevier B.V., All rights reserved.

Description

IEEE SMC; IEEE Turkiye Section

Keywords

Machine Learning, Text Classification, Topic Modeling, Topic Scoring, Topic Selection, Contrastive Learning, Decision Trees, Intelligent Systems, Random Forests, Latent Dirichlet Allocation, Machine-Learning, Performance, Selection Methods, Text Classification, Text Classification Models, Topic Modeling, Topic Scoring, Topic Selection, Adversarial Machine Learning

Fields of Science

Citation

WoS Q

N/A

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
1

Source

-- 2024 Innovations in Intelligent Systems and Applications Conference, ASYU 2024 -- Ankara -- 204562

Volume

Issue

Start Page

1

End Page

6
PlumX Metrics
Citations

Scopus : 1

SCOPUS™ Citations

1

checked on Mar 04, 2026

Page Views

1

checked on Mar 04, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
0.3862

Sustainable Development Goals

SDG data is not available