NLU++: A multi-label, slot-rich, generalisable dataset for natural language understanding in task-oriented dialogue

I Casanueva, I Vulić, GP Spithourakis… - arXiv preprint arXiv …, 2022 - arxiv.org
We present NLU++, a novel dataset for natural language understanding (NLU) in task-
oriented dialogue (ToD) systems, with the aim to provide a much more challenging …

Robustness testing of language understanding in task-oriented dialog

J Liu, R Takanobu, J Wen, D Wan, H Li, W Nie… - arXiv preprint arXiv …, 2020 - arxiv.org
Most language understanding models in task-oriented dialog systems are trained on a small
amount of annotated training data, and evaluated in a small set from the same distribution …

Ma-dst: Multi-attention-based scalable dialog state tracking

A Kumar, P Ku, A Goyal, A Metallinou… - Proceedings of the AAAI …, 2020 - aaai.org
Task oriented dialog agents provide a natural language interface for users to complete their
goal. Dialog State Tracking (DST), which is often a core component of these systems, tracks …

User utterance acquisition for training task-oriented bots: a review of challenges, techniques and opportunities

MA Yaghoub-Zadeh-Fard, B Benatallah… - IEEE Internet …, 2020 - ieeexplore.ieee.org
Building conversational task-oriented bots requires large and diverse sets of annotated user
utterances to learn mappings between natural language utterances and user intents. Given …

Outlier detection for improved data quality and diversity in dialog systems

S Larson, A Mahendran, A Lee, JK Kummerfeld… - arXiv preprint arXiv …, 2019 - arxiv.org
In a corpus of data, outliers are either errors: mistakes in the data that are counterproductive,
or are unique: informative samples that improve model robustness. Identifying outliers can …

Micromodels for efficient, explainable, and reusable systems: A case study on mental health

A Lee, JK Kummerfeld, LC An, R Mihalcea - arXiv preprint arXiv …, 2021 - arxiv.org
Many statistical models have high accuracy on test benchmarks, but are not explainable,
struggle in low-resource scenarios, cannot be reused for multiple tasks, and cannot easily …

A wizard of oz study simulating api usage dialogues with a virtual assistant

Z Eberhart, A Bansal, C Mcmillan - IEEE Transactions on …, 2020 - ieeexplore.ieee.org
Virtual Assistant technology is rapidly proliferating to improve productivity in a variety of
tasks. While several virtual assistants for everyday tasks are well-known (eg, Siri, Cortana …

Iterative feature mining for constraint-based data collection to increase data diversity and model robustness

S Larson, A Zheng, A Mahendran… - Proceedings of the …, 2020 - aclanthology.org
Diverse data is crucial for training robust models, but crowdsourced text often lacks diversity
as workers tend to write simple variations from prompts. We propose a general approach for …

One Agent To Rule Them All: Towards Multi-agent Conversational AI

C Clarke, JJ Peper, K Krishnamurthy… - arXiv preprint arXiv …, 2022 - arxiv.org
The increasing volume of commercially available conversational agents (CAs) on the market
has resulted in users being burdened with learning and adopting multiple agents to …

More diverse dialogue datasets via diversity-informed data collection

K Stasaski, GH Yang, MA Hearst - … of the 58th annual meeting of …, 2020 - aclanthology.org
Automated generation of conversational dialogue using modern neural architectures has
made notable advances. However, these models are known to have a drawback of often …