A Comparative Analysis on Medical Article Classification Using Text Mining & Machine Learning Algorithms

Loading...
Publication Logo

Date

2021

Journal Title

Journal ISSN

Volume Title

Publisher

Institute of Electrical and Electronics Engineers Inc.

Open Access Color

Green Open Access

No

OpenAIRE Downloads

OpenAIRE Views

Publicly Funded

No
Impulse
Average
Influence
Average
Popularity
Average

Research Projects

Journal Issue

Abstract

The document classification task is one of the widely studied research fields on multiple domains. The core motivation of the classification task is that the manual classification efforts are impractical due to the exponentially growing document volumes. Thus, we densely need to exploit automated computational approaches, such as machine learning models along with data & text mining techniques. In this study, we concentrated on the classification of medical articles specifically on common cancer types, due to the significance of the field and the decent number of available documents of interest. We deliberately targeted MEDLINE articles about common cancer types because most cancer types share a similar literature composition. Therefore, this situation makes the classification effort relatively more complicated. To this end, we built multiple machine learning models, including both traditional and deep learning architectures. We achieved the best performance (R¿82% F score) by the LSTM model. Overall, our results demonstrate a strong effect of exploiting both text mining and machine learning methods to distinguish medical articles on common cancer types. © 2022 Elsevier B.V., All rights reserved.

Description

Keywords

Deep Learning, Document Classification, Machine Learning, Text Mining, Data Mining, Information Retrieval Systems, Learning Algorithms, Long Short-Term Memory, Natural Language Processing Systems, Text Processing, Classification Tasks, Comparative Analyzes, Deep Learning, Document Classification, Machine Learning Algorithms, Machine Learning Models, Mining Machines, Multiple Domains, Research Fields, Text-Mining, Diseases

Fields of Science

0202 electrical engineering, electronic engineering, information engineering, 02 engineering and technology

Citation

WoS Q

N/A

Scopus Q

N/A
OpenCitations Logo
OpenCitations Citation Count
5

Source

-- 6th International Conference on Computer Science and Engineering, UBMK 2021 -- Ankara -- 176826

Volume

Issue

Start Page

360

End Page

365
PlumX Metrics
Citations

Scopus : 7

Captures

Mendeley Readers : 2

SCOPUS™ Citations

7

checked on Mar 28, 2026

Google Scholar Logo
Google Scholar™
OpenAlex Logo
OpenAlex FWCI
2.0751

Sustainable Development Goals

3

GOOD HEALTH AND WELL-BEING
GOOD HEALTH AND WELL-BEING Logo