Tada: Task-agnostic dialect adapters for english

W Held, C Ziems, D Yang - arXiv preprint arXiv:2305.16651, 2023 - arxiv.org
Large Language Models, the dominant starting point for Natural Language Processing
(NLP) applications, fail at a higher rate for speakers of English dialects other than Standard …

DADA: Dialect adaptation via dynamic aggregation of linguistic rules

Y Liu, W Held, D Yang - Proceedings of the 2023 Conference on …, 2023 - aclanthology.org
Existing large language models (LLMs) that mainly focus on Standard American English
(SAE) often lead to significantly worse performance when being applied to other English …

Task-agnostic low-rank adapters for unseen English dialects

Z Xiao, W Held, Y Liu, D Yang - arXiv preprint arXiv:2311.00915, 2023 - arxiv.org
Large Language Models (LLMs) are trained on corpora disproportionally weighted in favor
of Standard American English. As a result, speakers of other dialects experience …

Multi-VALUE: A framework for cross-dialectal English NLP

C Ziems, W Held, J Yang, J Dhamala, R Gupta… - arXiv preprint arXiv …, 2022 - arxiv.org
Dialect differences caused by regional, social, and economic factors cause performance
discrepancies for many groups of language technology users. Inclusive and equitable …

VALUE: Understanding dialect disparity in NLU

C Ziems, J Chen, C Harris, J Anderson… - arXiv preprint arXiv …, 2022 - arxiv.org
English Natural Language Understanding (NLU) systems have achieved great
performances and even outperformed humans on benchmarks like GLUE and SuperGLUE …

DIALECTBENCH: A NLP Benchmark for Dialects, Varieties, and Closely-Related Languages

F Faisal, O Ahia, A Srivastava, K Ahuja… - arXiv preprint arXiv …, 2024 - arxiv.org
Language technologies should be judged on their usefulness in real-world use cases. An
often overlooked aspect in natural language processing (NLP) research and evaluation is …

Occam's adaptation: A comparison of interpolation of bases adaptation methods for multi-dialect acoustic modeling with LSTMs

M Grace, M Bastani, E Weinstein - 2018 IEEE Spoken …, 2018 - ieeexplore.ieee.org
Multidialectal languages can pose challenges for acoustic modeling. Past research has
shown that with a large training corpus but without explicit modeling of inter-dialect …

{PUZZLE}: Efficiently Aligning Large Language Models through {Light-Weight} Context Switch

K Lei, Y Jin, M Zhai, K Huang, H Ye, J Zhai - 2024 USENIX Annual …, 2024 - usenix.org
Aligning Large Language Models (LLMs) is currently the primary method to ensure AI
systems operate in an ethically responsible and socially beneficial manner. Its paradigm …

Towards a deep multi-layered dialectal language analysis: A case study of African-American English

J Dacon - arXiv preprint arXiv:2206.08978, 2022 - arxiv.org
Currently, natural language processing (NLP) models proliferate language discrimination
leading to potentially harmful societal impacts as a result of biased outcomes. For example …

Help from the neighbors: Estonian dialect normalization using a Finnish dialect generator

M Hämäläinen, K Alnajjar, T Tuisk - Proceedings of the Third …, 2022 - aclanthology.org
While standard Estonian is not a low-resourced language, the different dialects of the
language are under-resourced from the point of view of NLP, given that there are no vast …