Lecture presentations multimodal dataset: Towards understanding multimodality in educational videos

DW Lee, C Ahuja, PP Liang, S Natu… - Proceedings of the …, 2023 - openaccess.thecvf.com
Many educational videos use slide presentations, a sequence of visual pages that contain
text and figures accompanied by spoken language, which are constructed and presented …

GenAssist: Making image generation accessible

M Huh, YH Peng, A Pavel - Proceedings of the 36th Annual ACM …, 2023 - dl.acm.org
Blind and low vision (BLV) creators use images to communicate with sighted audiences.
However, creating or retrieving images is challenging for BLV creators as it is difficult to use …

Social, environmental, and technical: Factors at play in the current use and future design of small-group captioning

EJ McDonnell, P Liu, SM Goodman… - Proceedings of the …, 2021 - dl.acm.org
Real-time captioning is a critical accessibility tool for many d/Deaf and hard of hearing
(DHH) people. While the vast majority of captioning work has focused on formal settings and …

Diffscriber: Describing Visual Design Changes to Support Mixed-Ability Collaborative Presentation Authoring

YH Peng, J Wu, J Bigham, A Pavel - Proceedings of the 35th Annual …, 2022 - dl.acm.org
Visual slide-based presentations are ubiquitous, yet slide authoring tools are largely
inaccessible to people who are blind or visually impaired (BVI). When authoring …

Beyond Instructions: A Taxonomy of Information Types in How-to Videos

S Yang, S Kwak, J Lee, J Kim - Proceedings of the 2023 CHI Conference …, 2023 - dl.acm.org
How-to videos are rich in information—they not only give instructions but also provide
justifications or descriptions. People seek different information to meet their needs, and …

Slidecho: Flexible non-visual exploration of presentation videos

YH Peng, JP Bigham, A Pavel - Proceedings of the 23rd International …, 2021 - dl.acm.org
We present Slidecho, a system that enables non-visual access of the slide content in a
presentation video on-demand. Slidecho automatically extracts slides and their text and …

Slide Gestalt: Automatic Structure Extraction in Slide Decks for Non-Visual Access

YH Peng, P Chi, A Kannan, MR Morris… - Proceedings of the 2023 …, 2023 - dl.acm.org
Presentation slides commonly use visual patterns for structural navigation, such as titles,
dividers, and build slides. However, screen readers do not capture such intention, making it …

Supporting novices author audio descriptions via automatic feedback

R Natalie, J Tseng, H Kacorri, K Hara - … of the 2023 CHI Conference on …, 2023 - dl.acm.org
Audio descriptions (AD) make videos accessible to those who cannot see them. But many
videos lack AD and remain inaccessible as traditional approaches involve expensive …

Heal: A knowledge graph for distress management conversations

A Welivita, P Pu - Proceedings of the AAAI Conference on Artificial …, 2022 - ojs.aaai.org
The demands of the modern world are increasingly responsible for causing psychological
burdens and bringing adverse impacts on our mental health. As a result, neural …

Exploring Community-Driven Descriptions for Making Livestreams Accessible

D Killough, A Pavel - Proceedings of the 25th International ACM …, 2023 - dl.acm.org
People watch livestreams to connect with others and learn about their hobbies. Livestreams
feature multiple visual streams including the main video, webcams, on-screen overlays, and …