A Comparative Analysis on Medical Article Classification Using Text Mining & Machine Learning Algorithms
Loading...
Date
2021
Journal Title
Journal ISSN
Volume Title
Publisher
Institute of Electrical and Electronics Engineers Inc.
Open Access Color
Green Open Access
No
OpenAIRE Downloads
OpenAIRE Views
Publicly Funded
No
Abstract
The document classification task is one of the widely studied research fields on multiple domains. The core motivation of the classification task is that the manual classification efforts are impractical due to the exponentially growing document volumes. Thus, we densely need to exploit automated computational approaches, such as machine learning models along with data & text mining techniques. In this study, we concentrated on the classification of medical articles specifically on common cancer types, due to the significance of the field and the decent number of available documents of interest. We deliberately targeted MEDLINE articles about common cancer types because most cancer types share a similar literature composition. Therefore, this situation makes the classification effort relatively more complicated. To this end, we built multiple machine learning models, including both traditional and deep learning architectures. We achieved the best performance (R¿82% F score) by the LSTM model. Overall, our results demonstrate a strong effect of exploiting both text mining and machine learning methods to distinguish medical articles on common cancer types. © 2022 Elsevier B.V., All rights reserved.
Description
Keywords
Deep Learning, Document Classification, Machine Learning, Text Mining, Data Mining, Information Retrieval Systems, Learning Algorithms, Long Short-Term Memory, Natural Language Processing Systems, Text Processing, Classification Tasks, Comparative Analyzes, Deep Learning, Document Classification, Machine Learning Algorithms, Machine Learning Models, Mining Machines, Multiple Domains, Research Fields, Text-Mining, Diseases
Fields of Science
0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology
Citation
WoS Q
N/A
Scopus Q
N/A

OpenCitations Citation Count
5
Source
-- 6th International Conference on Computer Science and Engineering, UBMK 2021 -- Ankara -- 176826
Volume
Issue
Start Page
360
End Page
365
Collections
PlumX Metrics
Citations
Scopus : 7
Captures
Mendeley Readers : 2
SCOPUS™ Citations
7
checked on Mar 28, 2026
Google Scholar™

OpenAlex FWCI
2.0751
Sustainable Development Goals
3
GOOD HEALTH AND WELL-BEING


