Multimodal data fusion: an overview of methods, challenges, and prospects

D Lahat, T Adali, C Jutten - Proceedings of the IEEE, 2015 - ieeexplore.ieee.org
In various disciplines, information about the same phenomenon can be acquired from
different types of detectors, at different conditions, in multiple experiments or subjects …

Audio surveillance: A systematic review

M Crocco, M Cristani, A Trucco, V Murino - ACM Computing Surveys …, 2016 - dl.acm.org
Despite surveillance systems becoming increasingly ubiquitous in our living environment,
automated surveillance, currently based on video sensory modality and machine …

Multimodal fusion for multimedia analysis: a survey

PK Atrey, MA Hossain, A El Saddik, MS Kankanhalli - Multimedia systems, 2010 - Springer
This survey aims at providing multimedia researchers with a state-of-the-art overview of
fusion strategies, which are used for combining multiple modalities in order to accomplish …

Multimodal fusion framework: A multiresolution approach for emotion classification and recognition from physiological signals

GK Verma, US Tiwary - NeuroImage, 2014 - Elsevier
The purpose of this paper is twofold:(i) to investigate the emotion representation models and
find out the possibility of a model with minimum number of continuous dimensions and (ii) to …

Video surveillance systems-current status and future trends

V Tsakanikas, T Dagiuklas - Computers & Electrical Engineering, 2018 - Elsevier
Within this survey an attempt is made to document the present status of video surveillance
systems. The main components of a surveillance system are presented and studied …

High-level event recognition in unconstrained videos

YG Jiang, S Bhattacharya, SF Chang… - International journal of …, 2013 - Springer
The goal of high-level event recognition is to automatically detect complex high-level events
in a given video sequence. This is a difficult task especially when videos are captured under …

Audiovisual fusion: Challenges and new approaches

AK Katsaggelos, S Bahaadini… - Proceedings of the …, 2015 - ieeexplore.ieee.org
In this paper, we review recent results on audiovisual (AV) fusion. We also discuss some of
the challenges and report on approaches to address them. One important issue in AV fusion …

Pixels that sound

E Kidron, YY Schechner, M Elad - 2005 IEEE Computer Society …, 2005 - ieeexplore.ieee.org
People and animals fuse auditory and visual information to obtain robust perception. A
particular benefit of such cross-modal analysis is the ability to localize visual events …

Audio-visual event recognition in surveillance video sequences

M Cristani, M Bicego, V Murino - IEEE Transactions on …, 2007 - ieeexplore.ieee.org
In the context of the automated surveillance field, automatic scene analysis and
understanding systems typically consider only visual information, whereas other modalities …

Multi-target DoA estimation with an audio-visual fusion mechanism

X Qian, M Madhavi, Z Pan, J Wang… - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
Most of the prior studies in the spatial Direction of Arrival (DoA) domain focus on a single
modality. However, humans use auditory and visual senses to detect the presence of sound …