TextNetTopics_TIS: Enhancing Textnettopics With Random Forest-Based Topic Importance Scoring
Loading...

Date
2024
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
TextNetTopics is an innovative Latent Dirichlet Allocation-based topic selection method for training text classification models. One main limitation is its computationally intensive scoring mechanism, especially when applied to many topics. This scoring mechanism involves training a machine learning model (i.e., Random Forest) on each topic using the Monte-Carlo Cross-Validation approach and assigning a score value based on a specific performance metric (e.g., accuracy or F1-score). Moreover, the measured score does not account for the interactions between all features residing in all topics. This paper presents a new topic-scoring mechanism called Topic Importance Scoring. This computationally efficient approach trains a Random Forest model on all topics simultaneously and leverages the extracted feature importance values to give each topic a score reflecting its classification potential. The experiments on three diverse datasets confirm that the proposed method's performance is superior to the Topic Performance Scoring, which was used in the original TextNetTopics method. © 2024 Elsevier B.V., All rights reserved.
Description
IEEE SMC; IEEE Turkiye Section
Keywords
Machine Learning, Text Classification, Topic Modeling, Topic Scoring, Topic Selection, Contrastive Learning, Decision Trees, Intelligent Systems, Random Forests, Latent Dirichlet Allocation, Machine-Learning, Performance, Selection Methods, Text Classification, Text Classification Models, Topic Modeling, Topic Scoring, Topic Selection, Adversarial Machine Learning
Fields of Science
Citation
WoS Q
N/A
Scopus Q
N/A

OpenCitations Citation Count
1
Source
-- 2024 Innovations in Intelligent Systems and Applications Conference, ASYU 2024 -- Ankara -- 204562
Volume
Issue
Start Page
1
End Page
6
Collections
PlumX Metrics
Citations
Scopus : 1
SCOPUS™ Citations
1
checked on Mar 04, 2026
Page Views
1
checked on Mar 04, 2026
Google Scholar™


