Beyond Visual Cues: Emotion Recognition in Images With Text-Aware Fusion

dc.contributor.author Sungur, Kerim Serdar
dc.contributor.author Bakal, Gokhan
dc.date.accessioned 2025-09-25T10:41:36Z
dc.date.available 2025-09-25T10:41:36Z
dc.date.issued 2025-04
dc.description Bakal, Mehmet/0000-0003-2897-3894 en_US
dc.description.abstract Sentiment analysis is a widely studied problem for understanding human emotions and potential outcomes. As it can be performed over textual data, working on visual data elements is also critically substantial to examining the current emotional status. In this effort, the aim is to investigate any potential enhancements in sentiment analysis predictions through visual instances by integrating textual data as additional knowledge reflecting the contextual information of the images. Thus, two separate models have been developed as image-processing and text-processing models in which both models were trained on distinct datasets comprising the same five human emotions. Following, the outputs of the individual models' last dense layers are combined to construct the hybrid multimodel empowered by visual and textual components. The fundamental focus is to evaluate the performance of the hybrid model in which the textual knowledge is concatenated with visual data. Essentially, the hybrid model achieved nearly a 3% F1-score improvement compared to the plain image classification model utilizing convolutional neural network architecture. In essence, this research underscores the potency of fusing textual context with visual information to refine sentiment analysis predictions. The findings not only emphasize the potential of a multi-modal approach but also spotlight a promising avenue for future advancements in emotion analysis and understanding. en_US
dc.identifier.doi 10.1016/j.displa.2024.102958
dc.identifier.issn 0141-9382
dc.identifier.issn 1872-7387
dc.identifier.scopus 2-s2.0-85213972328
dc.identifier.uri https://doi.org/10.1016/j.displa.2024.102958
dc.identifier.uri https://hdl.handle.net/20.500.12573/3370
dc.language.iso en en_US
dc.publisher Elsevier en_US
dc.relation.ispartof Displays en_US
dc.rights info:eu-repo/semantics/closedAccess en_US
dc.subject Sentiment Analysis en_US
dc.subject Hybrid Model en_US
dc.subject Image & Text Processing en_US
dc.subject Deep Learning en_US
dc.subject Deep Learning en_US
dc.title Beyond Visual Cues: Emotion Recognition in Images With Text-Aware Fusion en_US
dc.type Article en_US
dspace.entity.type Publication
gdc.author.id Bakal, Mehmet/0000-0003-2897-3894
gdc.author.scopusid 59498595800
gdc.author.scopusid 57074041500
gdc.author.wosid Bakal, Mehmet Gokhan/Aat-2797-2020
gdc.bip.impulseclass C5
gdc.bip.influenceclass C5
gdc.bip.popularityclass C4
gdc.coar.access metadata only access
gdc.coar.type text::journal::journal article
gdc.collaboration.industrial false
gdc.description.department Abdullah Gül University en_US
gdc.description.departmenttemp [Sungur, Kerim Serdar; Bakal, Gokhan] Abdullah Gul Univ, Dept Comp Engn, Erkilet Blvd Sumer Campus, TR-38080 Kayseri, Turkiye en_US
gdc.description.publicationcategory Makale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanı en_US
gdc.description.scopusquality Q3
gdc.description.startpage 102958
gdc.description.volume 87 en_US
gdc.description.woscitationindex Science Citation Index Expanded
gdc.description.wosquality Q2
gdc.identifier.openalex W4406011794
gdc.identifier.wos WOS:001409063900001
gdc.index.type WoS
gdc.index.type Scopus
gdc.oaire.diamondjournal false
gdc.oaire.impulse 3.0
gdc.oaire.influence 2.6415303E-9
gdc.oaire.isgreen false
gdc.oaire.popularity 4.868141E-9
gdc.oaire.publicfunded false
gdc.openalex.collaboration National
gdc.openalex.fwci 4.53
gdc.openalex.normalizedpercentile 0.94
gdc.openalex.toppercent TOP 10%
gdc.opencitations.count 1
gdc.plumx.mendeley 5
gdc.plumx.scopuscites 5
gdc.scopus.citedcount 5
gdc.wos.citedcount 4
relation.isAuthorOfPublication.latestForDiscovery 53ed538c-20d9-45c8-af59-7fa4d1b90cf7
relation.isOrgUnitOfPublication.latestForDiscovery 665d3039-05f8-4a25-9a3c-b9550bffecef

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Name:
1-s2.0-S0141938224003226-main.pdf
Size:
1.08 MB
Format:
Adobe Portable Document Format
Description:
Watermarked PDF

License bundle

Now showing 1 - 1 of 1
Loading...
Name:
license.txt
Size:
1.44 KB
Format:
Item-specific license agreed upon to submission
Description: