Signing outside the studio: Benchmarking background robustness for continuous sign language...

Reviewing 25 years of continuous sign language recognition research: Advances, challenges, and prospects

S Alyami, H Luqman, M Hammoudeh - Information Processing & …, 2024 - Elsevier

Sign language is a form of visual communication employing hand gestures, body
movements, and facial expressions. The growing prevalence of hearing impairment has …

被引用次数：5 相关文章

[PDF] thecvf.com

CoSign: Exploring co-occurrence signals in skeleton-based continuous sign language recognition

P Jiao, Y Min, Y Li, X Wang, L Lei… - Proceedings of the …, 2023 - openaccess.thecvf.com

The co-occurrence signals (eg, hand shape, facial expression, and lip pattern) play a critical
role in Continuous Sign Language Recognition (CSLR). Compared to RGB data, skeleton …

被引用次数：14 相关文章所有 3 个版本

[PDF] thecvf.com

Generative bias for robust visual question answering

JW Cho, DJ Kim, H Ryu… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Abstract The task of Visual Question Answering (VQA) is known to be plagued by the issue
of VQA models exploiting biases within the dataset to make its final prediction. Various …

被引用次数：45 相关文章所有 10 个版本

A review on computational methods based automated sign language recognition system for hearing and speech impaired community

EJ Robert, HJ Duraisamy - Concurrency and Computation …, 2023 - Wiley Online Library

The recent advancements in computer vision and deep learning have led to promising
progress in various motion detection and gesture recognition methods. Thriving efforts in the …

被引用次数：9 相关文章

[PDF] arxiv.org

Self-sufficient framework for continuous sign language recognition

Y Jang, Y Oh, JW Cho, M Kim, DJ Kim… - ICASSP 2023-2023 …, 2023 - ieeexplore.ieee.org

The goal of this work is to develop self-sufficient framework for Continuous Sign Language
Recognition (CSLR) that addresses key issues of sign language recognition. These include …

被引用次数：13 相关文章所有 11 个版本

[PDF] arxiv.org

Systemic Biases in Sign Language AI Research: A Deaf-Led Call to Reevaluate Research Agendas

A Desai, M De Meulder, JA Hochgesang… - arXiv preprint arXiv …, 2024 - arxiv.org

Growing research in sign language recognition, generation, and translation AI has been
accompanied by calls for ethical development of such technologies. While these works are …

被引用次数：12 相关文章所有 4 个版本

[PDF] ieee.org

Semi-supervised image captioning by adversarially propagating labeled data

DJ Kim, TH Oh, J Choi, IS Kweon - IEEE Access, 2024 - ieeexplore.ieee.org

We present a novel data-efficient semi-supervised framework to improve the generalization
of image captioning models. Constructing a large-scale labeled image captioning dataset is …

被引用次数：6 相关文章所有 4 个版本

[PDF] arxiv.org

Slowfast Network for Continuous Sign Language Recognition

J Ahn, Y Jang, JS Chung - ICASSP 2024-2024 IEEE …, 2024 - ieeexplore.ieee.org

The objective of this work is the effective extraction of spatial and dynamic features for
Continuous Sign Language Recognition (CSLR). To accomplish this, we utilise a two …

被引用次数：9 相关文章所有 5 个版本

[PDF] acm.org

Let Me Finish My Sentence: Video Temporal Grounding with Holistic Text Understanding

J Woo, H Ryu, Y Jang, JW Cho, JS Chung - Proceedings of the 32nd …, 2024 - dl.acm.org

Video Temporal Grounding (VTG) aims to identify visual frames in a video clip that match
text queries. Recent studies in VTG employ cross-attention to correlate visual frames and …

Preserving Multi-Modal Capabilities of Pre-trained VLMs for Improving Vision-Linguistic Compositionality

Y Oh, JW Cho, DJ Kim, IS Kweon, J Kim - arXiv preprint arXiv:2410.05210, 2024 - arxiv.org

In this paper, we propose a new method to enhance compositional understanding in pre-
trained vision and language models (VLMs) without sacrificing performance in zero-shot …