Who validates the validators? aligning llm-assisted evaluation of llm outputs with human preferences
S Shankar, JD Zamfirescu-Pereira… - Proceedings of the 37th …, 2024 - dl.acm.org
Due to the cumbersome nature of human evaluation and limitations of code-based
evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in …
evaluation, Large Language Models (LLMs) are increasingly being used to assist humans in …
Will you accept an imperfect ai? exploring designs for adjusting end-user expectations of ai systems
AI technologies have been incorporated into many end-user applications. However,
expectations of the capabilities of such systems vary among people. Furthermore, bloated …
expectations of the capabilities of such systems vary among people. Furthermore, bloated …
Modeltracker: Redesigning performance analysis tools for machine learning
Model building in machine learning is an iterative process. The performance analysis and
debugging step typically involves a disruptive cognitive switch from model building to error …
debugging step typically involves a disruptive cognitive switch from model building to error …
Putting humans in the natural language processing loop: A survey
How can we design Natural Language Processing (NLP) systems that learn from human
feedback? There is a growing research body of Human-in-the-loop (HITL) NLP frameworks …
feedback? There is a growing research body of Human-in-the-loop (HITL) NLP frameworks …
Scattershot: Interactive in-context example curation for text transformation
The in-context learning capabilities of LLMs like GPT-3 allow annotators to customize an
LLM to their specific tasks with a small number of examples. However, users tend to include …
LLM to their specific tasks with a small number of examples. However, users tend to include …
Making contextual decisions with low technical debt
Applications and systems are constantly faced with decisions that require picking from a set
of actions based on contextual information. Reinforcement-based learning algorithms such …
of actions based on contextual information. Reinforcement-based learning algorithms such …
Local decision pitfalls in interactive machine learning: An investigation into feature selection in sentiment analysis
Tools for Interactive Machine Learning (IML) enable end users to update models in a “rapid,
focused, and incremental”—yet local—manner. In this work, we study the question of local …
focused, and incremental”—yet local—manner. In this work, we study the question of local …
Machine guides, human supervises: Interactive learning with global explanations
We introduce explanatory guided learning (XGL), a novel interactive learning strategy in
which a machine guides a human supervisor toward selecting informative examples for a …
which a machine guides a human supervisor toward selecting informative examples for a …
Rapidly scaling dialog systems with interactive learning
In personal assistant dialog systems, intent models are classifiers that identify the intent of a
user utterance, such as to add a meeting to a calendar or get the director of a stated movie …
user utterance, such as to add a meeting to a calendar or get the director of a stated movie …
Reasoning under uncertainty: Towards collaborative interactive machine learning
In this paper, we present the current state-of-the-art of decision making (DM) and machine
learning (ML) and bridge the two research domains to create an integrated approach of …
learning (ML) and bridge the two research domains to create an integrated approach of …