Who validates the validators? aligning llm-assisted evaluation of llm outputs with human preferences

S Shankar, JD Zamfirescu-Pereira… - Proceedings of the 37th …, 2024 - dl.acm.org
Due to the cumbersome nature of human evaluation and limitations of code-based
evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in …

Will you accept an imperfect ai? exploring designs for adjusting end-user expectations of ai systems

R Kocielnik, S Amershi, PN Bennett - … of the 2019 CHI Conference on …, 2019 - dl.acm.org
AI technologies have been incorporated into many end-user applications. However,
expectations of the capabilities of such systems vary among people. Furthermore, bloated …

Modeltracker: Redesigning performance analysis tools for machine learning

S Amershi, M Chickering, SM Drucker, B Lee… - Proceedings of the 33rd …, 2015 - dl.acm.org
Model building in machine learning is an iterative process. The performance analysis and
debugging step typically involves a disruptive cognitive switch from model building to error …

Putting humans in the natural language processing loop: A survey

ZJ Wang, D Choi, S Xu, D Yang - arXiv preprint arXiv:2103.04044, 2021 - arxiv.org
How can we design Natural Language Processing (NLP) systems that learn from human
feedback? There is a growing research body of Human-in-the-loop (HITL) NLP frameworks …

Scattershot: Interactive in-context example curation for text transformation

S Wu, H Shen, DS Weld, J Heer… - Proceedings of the 28th …, 2023 - dl.acm.org
The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an
LLM to their specific tasks with a small number of examples. However, users tend to include …

Making contextual decisions with low technical debt

A Agarwal, S Bird, M Cozowicz, L Hoang… - arXiv preprint arXiv …, 2016 - arxiv.org
Applications and systems are constantly faced with decisions that require picking from a set
of actions based on contextual information. Reinforcement-based learning algorithms such …

Local decision pitfalls in interactive machine learning: An investigation into feature selection in sentiment analysis

T Wu, DS Weld, J Heer - ACM Transactions on Computer-Human …, 2019 - dl.acm.org
Tools for Interactive Machine Learning (IML) enable end users to update models in a “rapid,
focused, and incremental”—yet local—manner. In this work, we study the question of local …

Machine guides, human supervises: Interactive learning with global explanations

T Popordanoska, M Kumar, S Teso - arXiv preprint arXiv:2009.09723, 2020 - arxiv.org
We introduce explanatory guided learning (XGL), a novel interactive learning strategy in
which a machine guides a human supervisor toward selecting informative examples for a …

Rapidly scaling dialog systems with interactive learning

JD Williams, NB Niraula, P Dasigi… - Natural language dialog …, 2015 - Springer
In personal assistant dialog systems, intent models are classifiers that identify the intent of a
user utterance, such as to add a meeting to a calendar or get the director of a stated movie …

Reasoning under uncertainty: Towards collaborative interactive machine learning

S Robert, S Büttner, C Röcker, A Holzinger - … : State-of-the-Art and Future …, 2016 - Springer
In this paper, we present the current state-of-the-art of decision making (DM) and machine
learning (ML) and bridge the two research domains to create an integrated approach of …