[HTML][HTML] Navigating the Multimodal Landscape: A Review on Integration of Text and Image Data in Machine Learning Architectures

M Binte Rashid, MS Rahaman, P Rivas - Machine Learning and …, 2024 - mdpi.com
Images and text have become essential parts of the multimodal machine learning (MMML)
framework in today's world because data are always available, and technological …

[HTML][HTML] Deepfake tweets classification using stacked Bi-LSTM and words embedding

V Rupapara, F Rustam, A Amaar… - PeerJ Computer …, 2021 - peerj.com
The spread of altered media in the form of fake videos, audios, and images, has been
largely increased over the past few years. Advanced digital manipulation tools and …

Multimodal sentiment analysis using deep learning and fuzzy logic: A comprehensive survey

HN Do, HT Phan, NT Nguyen - Applied Soft Computing, 2024 - Elsevier
Multimodal sentiment analysis (MSA) is the process of identifying sentiment polarities that
users may simultaneously display in text, audio, and video data. Sentiment analysis …

Collaborative fine-grained interaction learning for image–text sentiment analysis

X Xiao, Y Pu, D Zhou, J Cao, J Gu, Z Zhao… - Knowledge-Based Systems, 2023 - Elsevier
Investigating interactions between image and text can effectively improve image–text
sentiment analysis, but most existing methods do not explore image–text interaction at fine …

Image–text sentiment analysis via context guided adaptive fine-tuning transformer

X Xiao, Y Pu, Z Zhao, R Nie, D Xu, W Qian… - Neural Processing …, 2023 - Springer
Compared with single-modal content, multimodal content conveys user's sentiments and
feelings more vividly. Thus, multimodal sentiment analysis has become a research hotspot …

Prediction of sentiment polarity in restaurant reviews using an ordinal regression approach based on evolutionary XGBoost

DA Al-Qudah, AZ Ala'M, AI Cristea… - PeerJ Computer …, 2025 - peerj.com
As the business world shifts to the web and tremendous amounts of data become available
on multilingual mobile applications, new business and research challenges and …

Dynamic Weighted Gating for Enhanced Cross-Modal Interaction in Multimodal Sentiment Analysis

N Wang, Q Wang - ACM Transactions on Multimedia Computing …, 2024 - dl.acm.org
Advancements in Multimodal Sentiment Analysis (MSA) have predominantly focused on
leveraging the interdependence of text, acoustic, and visual modalities to enhance …

CHEF: A Framework for Deploying Heterogeneous Models on Clusters With Heterogeneous FPGAs

Y Tang, Y Song, N Elango, SR Priya… - … on Computer-Aided …, 2024 - ieeexplore.ieee.org
Deep neural networks (DNNs) are rapidly evolving from streamlined single-modality single-
task (SMST) to multimodality multitask (MMMT) with large variations for different layers and …

BIT: Improving image-text sentiment analysis via learning bidirectional image-text interaction

X Xiao, Y Pu, Z Zhao, J Gu, D Xu - 2023 International Joint …, 2023 - ieeexplore.ieee.org
Exploring the interaction between image and text has a great strength for image-text
sentiment analysis. However, most methods only focus on learning forward interaction in …

M5: Multi-modal Multi-task Model Mapping on Multi-FPGA with Accelerator Configuration Search

AK Kamath, S Abi-Karam, A Bhat… - … Design, Automation & …, 2023 - ieeexplore.ieee.org
Recent machine learning (ML) models have advanced from single-modality single-task to
multi-modality multi-task (MMMT). MMMT models typically have multiple backbones of …