Active Speaker Detection using Audio, Visual and Depth Modalities: A Survey

SNAM Robi, MAZM Ariffin, MAM Izhar, N Ahmad… - IEEE …, 2024 - ieeexplore.ieee.org
The rapid progress of multimodal signal processing in recent years has cleared the way for
novel applications in human-computer interaction, surveillance, and telecommunication …

Audio-video fusion strategies for active speaker detection in meetings

L Pibre, F Madrigal, C Equoy, F Lerasle… - Multimedia Tools and …, 2023 - Springer
Meetings are a common activity in professional contexts, and it remains challenging to
endow vocal assistants with advanced functionalities to facilitate meeting management. In …

Speech Diarization and ASR with GMM

AK Sharma, V Bhavikatti, A Nidawani… - arXiv preprint arXiv …, 2023 - arxiv.org
In this research paper, we delve into the topics of Speech Diarization and Automatic Speech
Recognition (ASR). Speech diarization involves the separation of individual speakers within …

Design and Development of an Integrated Internet of Audio and Video Sensors for COVID-19 Coughing and Sneezing Recognition

S Kiaei, S Honarparvar, S Saeedi… - 2021 IEEE 12th Annual …, 2021 - ieeexplore.ieee.org
There are a lot of ongoing efforts to combat the COVID-19 pandemic using different
combinations of low-cost sensing technologies, information/communication technologies …