End-to-end dependency parsing of spoken french

LeBenchmark 2.0: A standardized, replicable and enhanced framework for self-supervised representations of French speech

T Parcollet, H Nguyen, S Evain, MZ Boito… - Computer Speech & …, 2024 - Elsevier

Self-supervised learning (SSL) is at the origin of unprecedented improvements in many
different domains including computer vision and natural language processing. Speech …

被引用次数：20 相关文章所有 13 个版本

[PDF] arxiv.org

Audio-visual neural syntax acquisition

CIJ Lai, F Shi, P Peng, Y Kim, K Gimpel… - 2023 IEEE Automatic …, 2023 - ieeexplore.ieee.org

We study phrase structure induction from visually-grounded speech. The core idea is to first
segment the speech waveform into sequences of word segments, and subsequently induce …

被引用次数：4 相关文章所有 7 个版本

[PDF] arxiv.org

Cascading and direct approaches to unsupervised constituency parsing on spoken sentences

Y Tseng, CIJ Lai, H Lee - ICASSP 2023-2023 IEEE …, 2023 - ieeexplore.ieee.org

Past work on unsupervised parsing is constrained to written form. In this paper, we present
the first study on unsupervised spoken constituency parsing given unlabeled spoken …

被引用次数：5 相关文章所有 3 个版本

[PDF] arxiv.org

Learning Language Structures through Grounding

F Shi - arXiv preprint arXiv:2406.09662, 2024 - arxiv.org

Language is highly structured, with syntactic and semantic structures, to some extent,
agreed upon by speakers of the same language. With implicit or explicit awareness of such …

被引用次数：2 相关文章

被引用次数：1 相关文章

[PDF] univ-grenoble-alpes.fr

PROPICTO: Developing Speech‑to‑Pictograph Translation Systems to Enhance Communication Accessibility

L Ormaechea, P Bouillon… - … Conference of The …, 2023 - hal.univ-grenoble-alpes.fr

PROPICTO is a project funded by the French National Research Agency and the Swiss
National Science Foundation, that aims at creating Speech-to-Pictograph translation …