[HTML][HTML] Multi 3 WOZ: A Multilingual, Multi-Domain, Multi-Parallel Dataset for Training and Evaluating Culturally Adapted Task-Oriented Dialog Systems

S Hu, H Zhou, M Hergul, M Gritta, G Zhang… - Transactions of the …, 2023 - direct.mit.edu
Creating high-quality annotated data for task-oriented dialog (ToD) is known to be
notoriously difficult, and the challenges are amplified when the goal is to create equitable …

Analyzing and Adapting Large Language Models for Few-Shot Multilingual NLU: Are We There Yet?

E Razumovskaia, I Vulić, A Korhonen - arXiv preprint arXiv:2403.01929, 2024 - arxiv.org
Supervised fine-tuning (SFT), supervised instruction tuning (SIT) and in-context learning
(ICL) are three alternative, de facto standard approaches to few-shot learning. ICL has …

Sqatin: Supervised instruction tuning meets question answering for improved dialogue nlu

E Razumovskaia, G Glavaš, A Korhonen… - arXiv preprint arXiv …, 2023 - arxiv.org
Task-oriented dialogue (ToD) systems help users execute well-defined tasks across a
variety of domains (eg, $\textit {flight booking} $ or $\textit {food ordering} $), with their …

A Systematic Study of Performance Disparities in Multilingual Task-Oriented Dialogue Systems

S Hu, H Zhou, M Yuan, M Gritta, G Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Achieving robust language technologies that can perform well across the world's many
languages is a central goal of multilingual NLP. In this work, we take stock of and empirically …

Diaggpt: An llm-based chatbot with automatic topic management for task-oriented dialogue

L Cao - arXiv preprint arXiv:2308.08043, 2023 - arxiv.org
Large Language Models (LLMs), such as ChatGPT, are becoming increasingly
sophisticated, demonstrating capabilities that closely resemble those of humans. These AI …

OpenSLU: A Unified, Modularized, and Extensible Toolkit for Spoken Language Understanding

L Qin, Q Chen, X Xu, Y Feng, W Che - arXiv preprint arXiv:2305.10231, 2023 - arxiv.org
Spoken Language Understanding (SLU) is one of the core components of a task-oriented
dialogue system, which aims to extract the semantic meaning of user queries (eg, intents …

Romanization-based large-scale adaptation of multilingual language models

S Purkayastha, S Ruder, J Pfeiffer, I Gurevych… - arXiv preprint arXiv …, 2023 - arxiv.org
Large multilingual pretrained language models (mPLMs) have become the de facto state of
the art for cross-lingual transfer in NLP. However, their large-scale deployment to many …

NegotiationToM: A Benchmark for Stress-testing Machine Theory of Mind on Negotiation Surrounding

C Chan, C Jiayang, Y Yim, Z Deng, W Fan, H Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Large Language Models (LLMs) have sparked substantial interest and debate concerning
their potential emergence of Theory of Mind (ToM) ability. Theory of mind evaluations …

Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond

B Lee, I Calapodescu, M Gaido, M Negri… - arXiv preprint arXiv …, 2024 - arxiv.org
We present Speech-MASSIVE, a multilingual Spoken Language Understanding (SLU)
dataset comprising the speech counterpart for a portion of the MASSIVE textual corpus …

M2QA: Multi-domain Multilingual Question Answering

L Engländer, H Sterz, C Poth, J Pfeiffer… - arXiv preprint arXiv …, 2024 - arxiv.org
Generalization and robustness to input variation are core desiderata of machine learning
research. Language varies along several axes, most importantly, language instance (eg …