Bestow: Efficient and streamable speech language model with the best of two worlds in gpt and t5

文章

学术资源搜索

获得 2 条结果（用时0.02秒）

我的图书馆

Bestow: Efficient and streamable speech language model with the best of two worlds in gpt and t5

在引用文章中搜索

[PDF] arxiv.org

EMMeTT: Efficient Multimodal Machine Translation Training

P Żelasko, Z Chen, M Wang, D Galvez… - arXiv preprint arXiv …, 2024 - arxiv.org

A rising interest in the modality extension of foundation language models warrants
discussion on the most effective, and efficient, multimodal training approach. This work …

[PDF] arxiv.org

Language Model Can Listen While Speaking

Z Ma, Y Song, C Du, J Cong, Z Chen, Y Wang… - arXiv preprint arXiv …, 2024 - arxiv.org

Dialogue serves as the most natural manner of human-computer interaction (HCI). Recent
advancements in speech language models (SLM) have significantly enhanced speech …