EMMeTT: Efficient Multimodal Machine Translation Training
A rising interest in the modality extension of foundation language models warrants
discussion on the most effective, and efficient, multimodal training approach. This work …
discussion on the most effective, and efficient, multimodal training approach. This work …
Language Model Can Listen While Speaking
Dialogue serves as the most natural manner of human-computer interaction (HCI). Recent
advancements in speech language models (SLM) have significantly enhanced speech …
advancements in speech language models (SLM) have significantly enhanced speech …