TextNetTopics_TIS: Enhancing Textnettopics With Random Forest-Based Topic Importance Scoring

dc.contributor.author Voskergian, Daniel
dc.contributor.author Bakir-Güngör, Burcu
dc.contributor.author Yousef, Malik
dc.date.accessioned 2025-09-25T10:58:43Z
dc.date.available 2025-09-25T10:58:43Z
dc.date.issued 2024-10-16
dc.description IEEE SMC; IEEE Turkiye Section en_US
dc.description.abstract TextNetTopics is an innovative Latent Dirichlet Allocation-based topic selection method for training text classification models. One main limitation is its computationally intensive scoring mechanism, especially when applied to many topics. This scoring mechanism involves training a machine learning model (i.e., Random Forest) on each topic using the Monte-Carlo Cross-Validation approach and assigning a score value based on a specific performance metric (e.g., accuracy or F1-score). Moreover, the measured score does not account for the interactions between all features residing in all topics. This paper presents a new topic-scoring mechanism called Topic Importance Scoring. This computationally efficient approach trains a Random Forest model on all topics simultaneously and leverages the extracted feature importance values to give each topic a score reflecting its classification potential. The experiments on three diverse datasets confirm that the proposed method's performance is superior to the Topic Performance Scoring, which was used in the original TextNetTopics method. © 2024 Elsevier B.V., All rights reserved. en_US
dc.description.sponsorship IEEE SMC; IEEE Turkiye Section
dc.identifier.doi 10.1109/ASYU62119.2024.10757168
dc.identifier.isbn 9798350379433
dc.identifier.scopus 2-s2.0-85213388261
dc.identifier.uri https://doi.org/10.1109/ASYU62119.2024.10757168
dc.identifier.uri https://hdl.handle.net/20.500.12573/4762
dc.language.iso en en_US
dc.publisher Institute of Electrical and Electronics Engineers Inc. en_US
dc.relation.ispartof -- 2024 Innovations in Intelligent Systems and Applications Conference, ASYU 2024 -- Ankara -- 204562 en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Machine Learning en_US
dc.subject Text Classification en_US
dc.subject Topic Modeling en_US
dc.subject Topic Scoring en_US
dc.subject Topic Selection en_US
dc.subject Contrastive Learning en_US
dc.subject Decision Trees en_US
dc.subject Intelligent Systems en_US
dc.subject Random Forests en_US
dc.subject Latent Dirichlet Allocation en_US
dc.subject Machine-Learning en_US
dc.subject Performance en_US
dc.subject Selection Methods en_US
dc.subject Text Classification en_US
dc.subject Text Classification Models en_US
dc.subject Topic Modeling en_US
dc.subject Topic Scoring en_US
dc.subject Topic Selection en_US
dc.subject Adversarial Machine Learning en_US
dc.title TextNetTopics_TIS: Enhancing Textnettopics With Random Forest-Based Topic Importance Scoring en_US
dc.type Conference Object en_US
dspace.entity.type Publication
gdc.author.scopusid 57200259158
gdc.author.scopusid 25932029800
gdc.author.scopusid 14029389000
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C5
gdc.coar.access metadata only access
gdc.coar.type text::conference output
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Voskergian] Daniel, Department of Computer Engineering, Al-Quds University, Abu Dis, Palestine; [Bakir-Güngör] Burcu, Department of Computer Engineering, Abdullah Gül Üniversitesi, Kayseri, Turkey; [Yousef] Malik, Department of Information Systems, Zefat College, Safad, Israel en_US
gdc.description.endpage 6
gdc.description.publicationcategory Konferans Öğesi - Uluslararası - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality N/A
gdc.description.startpage 1
gdc.description.wosquality N/A
gdc.identifier.openalex W4406267498
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 0.0
gdc.oaire.influence 2.3811355E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 2.2424942E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration International
gdc.openalex.fwci 0.39
gdc.openalex.normalizedpercentile 0.71
gdc.opencitations.count 1
gdc.plumx.scopuscites 1
gdc.scopus.citedcount 1
relation.isAuthorOfPublication.latestForDiscovery e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files