Review of Feature Selection Approaches Based on Grouping of Features

dc.contributor.author Kuzudisli, Cihan
dc.contributor.author Bakir-Gungor, Burcu
dc.contributor.author Bulut, Nurten
dc.contributor.author Qaqish, Bahjat
dc.contributor.author Yousef, Malik
dc.date.accessioned 2025-09-25T10:56:31Z
dc.date.available 2025-09-25T10:56:31Z
dc.date.issued 2023
dc.description.abstract With the rapid development in technology, large amounts of high-dimensional data have been generated. This high dimensionality including redundancy and irrelevancy poses a great challenge in data analysis and decision making. Feature selection (FS) is an effective way to reduce dimensionality by eliminating redundant and irrelevant data. Most traditional FS approaches score and rank each feature individually; and then perform FS either by eliminating lower ranked features or by retaining highly -ranked features. In this review, we discuss an emerging approach to FS that is based on initially grouping features, then scoring groups of features rather than scoring individual features. Despite the presence of reviews on clustering and FS algorithms, to the best of our knowledge, this is the first review focusing on FS techniques based on grouping. The typical idea behind FS through grouping is to generate groups of similar features with dissimilarity between groups, then select representative features from each cluster. Approaches under supervised, unsupervised, semi supervised and integrative frameworks are explored. The comparison of experimental results indicates the effectiveness of sequential, optimization-based (i.e., fuzzy or evolutionary), hybrid and multi-method approaches. When it comes to biological data, the involvement of external biological sources can improve analysis results. We hope this work's findings can guide effective design of new FS approaches using feature grouping. en_US
dc.description.sponsorship Zefat Academic College; Abdullah Gul University Support Foundation (AGUV) en_US
dc.description.sponsorship & nbsp;This work has been supported by the Zefat Academic College. Burcu Bakir-Gungor's work has been supported by the Abdullah Gul University Support Foundation (AGUV) . The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. en_US
dc.description.sponsorship Abdullah Gul University Support Foundation; Zefat Academic College
dc.identifier.doi 10.7717/peerj.15666
dc.identifier.issn 2167-8359
dc.identifier.scopus 2-s2.0-85168542613
dc.identifier.uri https://doi.org/10.7717/peerj.15666
dc.identifier.uri https://hdl.handle.net/20.500.12573/4572
dc.language.iso en en_US
dc.publisher PeerJ Inc en_US
dc.relation.ispartof PeerJ en_US
dc.rights info:eu-repo/semantics/openAccess en_US
dc.subject Feature Selection en_US
dc.subject Feature Grouping en_US
dc.subject Supervised en_US
dc.subject Unsupervised en_US
dc.subject Integrative en_US
dc.title Review of Feature Selection Approaches Based on Grouping of Features en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.scopusid 57219838821
gdc.author.scopusid 25932029800
gdc.author.scopusid 58545209000
gdc.author.scopusid 6603889265
gdc.author.scopusid 14029389000
gdc.bip.impulseclass C3
gdc.bip.influenceclass C4
gdc.bip.popularityclass C3
gdc.coar.access open access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Kuzudisli, Cihan] Hasan Kalyoncu Univ, Dept Comp Engn, Gaziantep, Turkiye; [Kuzudisli, Cihan] Abdullah Gul Univ, Dept Elect & Comp Engn, Kayseri, Turkiye; [Bakir-Gungor, Burcu; Bulut, Nurten] Abdullah Gul Univ, Dept Comp Engn, Kayseri, Turkiye; [Qaqish, Bahjat] Univ North Carolina Chapel Hill, Dept Biostat, Chapel Hill, NC USA; [Yousef, Malik] Zefat Acad Coll, Dept Informat Syst, Safed, Israel; [Yousef, Malik] Zefat Acad Coll, Galilee Digital Hlth Res Ctr, Safed, Israel en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage e15666
gdc.description.volume 11 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q2
gdc.identifier.openalex W4384525165
gdc.identifier.pmid 37483989
gdc.identifier.wos WOS:001034479400005
gdc.index.type WoS
gdc.index.type Scopus
gdc.index.type PubMed
gdc.oaire.accesstype GOLD
gdc.oaire.diamondjournal false
gdc.oaire.downloads 20
gdc.oaire.impulse 52.0
gdc.oaire.influence 5.0605995E-9
gdc.oaire.isgreen true
gdc.oaire.keywords QH301-705.5
gdc.oaire.keywords Bioinformatics
gdc.oaire.keywords Feature selection
gdc.oaire.keywords Feature grouping
gdc.oaire.keywords Integrative
gdc.oaire.keywords R
gdc.oaire.keywords Medicine
gdc.oaire.keywords Biology (General)
gdc.oaire.keywords Supervised
gdc.oaire.keywords Algorithms
gdc.oaire.keywords Unsupervised
gdc.oaire.popularity 4.1169326E-8
gdc.oaire.publicfunded false
gdc.oaire.views 59
gdc.openalex.collaboration International
gdc.openalex.fwci 9.3956
gdc.openalex.normalizedpercentile 0.99
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 44
gdc.plumx.mendeley 77
gdc.plumx.newscount 1
gdc.plumx.pubmedcites 5
gdc.plumx.scopuscites 58
gdc.scopus.citedcount 58
gdc.virtual.author Güngör, Burcu
gdc.virtual.author Bulut, Nurten
gdc.wos.citedcount 44
relation.isAuthorOfPublication e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isAuthorOfPublication d8bc96b3-4d8e-4e9b-bf89-f835b37044fa
relation.isAuthorOfPublication.latestForDiscovery e17be1f8-1c9a-45f2-bf0d-f8b348d2dba0
relation.isOrgUnitOfPublication 665d3039-05f8-4a25-9a3c-b9550bffecef
relation.isOrgUnitOfPublication 52f507ab-f278-4a1f-824c-44da2a86bd51
relation.isOrgUnitOfPublication ef13a800-4c99-4124-81e0-3e25b33c0c2b
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
peerj-15666.pdf
Size:
1.09 MB
Format:
Adobe Portable Document Format
Description:
Makale Dosyası

License bundle

Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: