BartPho: pre-trained sequence-to-sequence models for Vietnamese

NL Tran, DM Le, DQ Nguyen - arXiv preprint arXiv:2109.09701, 2021 - arxiv.org
We present BARTpho with two versions, BARTpho-syllable and BARTpho-word, which are
the first public large-scale monolingual sequence-to-sequence models pre-trained for …

Social, environmental, and technical: Factors at play in the current use and future design of small-group captioning

EJ McDonnell, P Liu, SM Goodman… - Proceedings of the …, 2021 - dl.acm.org
Real-time captioning is a critical accessibility tool for many d/Deaf and hard of hearing
(DHH) people. While the vast majority of captioning work has focused on formal settings and …

Capitalization and punctuation restoration: a survey

V Păiş, D Tufiş - Artificial Intelligence Review, 2022 - Springer
Ensuring proper punctuation and letter casing is a key pre-processing step towards applying
complex natural language processing algorithms. This is especially significant for textual …

“Easier or Harder, Depending on Who the Hearing Person Is”: Codesigning Videoconferencing Tools for Small Groups with Mixed Hearing Status

EJ McDonnell, SH Moon, L Jiang… - Proceedings of the …, 2023 - dl.acm.org
With improvements in automated speech recognition and increased use of
videoconferencing, real-time captioning has changed significantly. This shift toward broadly …

Efficient automatic punctuation restoration using bidirectional transformers with robust inference

M Courtland, A Faulkner… - Proceedings of the 17th …, 2020 - aclanthology.org
Though people rarely speak in complete sentences, punctuation confers many benefits to
the readers of transcribed speech. Unfortunately, most ASR systems do not produce …

Visualization of Speech Prosody and Emotion in Captions: Accessibility for Deaf and Hard-of-Hearing Users

C de Lacerda Pataca, M Watkins, R Peiris… - Proceedings of the …, 2023 - dl.acm.org
Speech is expressive in ways that caption text does not capture, with emotion or emphasis
information not conveyed. We interviewed eight Deaf and Hard-of-Hearing (dhh) individuals …

Unified multimodal punctuation restoration framework for mixed-modality corpus

Y Zhu, L Wu, S Cheng, M Wang - ICASSP 2022-2022 IEEE …, 2022 - ieeexplore.ieee.org
The punctuation restoration task aims to correctly punctuate the output transcriptions of
automatic speech recognition systems. Previous punctuation models, either using text only …

Making punctuation restoration robust and fast with multi-task learning and knowledge distillation

M Hentschel, E Tsunoo, T Okuda - ICASSP 2021-2021 IEEE …, 2021 - ieeexplore.ieee.org
In punctuation restoration, we try to recover the missing punctuation from automatic speech
recognition output to improve understandability. Currently, large pre-trained transformers …

Deaf and hard-of-hearing users' prioritization of genres of online video content requiring accurate captions

L Berke, M Seita, M Huenerfauth - … of the 17th International Web for All …, 2020 - dl.acm.org
Online video is an important information source, yet its pace of growth, including user-
submitted content, is so rapid that automatic captioning technologies are needed to make …

Understanding Social and Environmental Factors to Enable Collective Access Approaches to the Design of Captioning Technology

E McDonnell - Proceedings of the 24th International ACM …, 2022 - dl.acm.org
Oftentimes human computer interactions (HCI) accessibility research designs technology to
support Deaf and disabled people in their existing social contexts. I, instead, propose an …