Augmented language models: a survey
This survey reviews works in which language models (LMs) are augmented with reasoning
skills and the ability to use tools. The former is defined as decomposing a potentially …
skills and the ability to use tools. The former is defined as decomposing a potentially …
Webarena: A realistic web environment for building autonomous agents
With advances in generative AI, there is now potential for autonomous agents to manage
daily tasks via natural language commands. However, current agents are primarily created …
daily tasks via natural language commands. However, current agents are primarily created …
Agentbench: Evaluating llms as agents
Large Language Models (LLMs) are becoming increasingly smart and autonomous,
targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has …
targeting real-world pragmatic missions beyond traditional NLP tasks. As a result, there has …
Webshop: Towards scalable real-world web interaction with grounded language agents
Most existing benchmarks for grounding language in interactive environments either lack
realistic linguistic elements, or prove difficult to scale up due to substantial human …
realistic linguistic elements, or prove difficult to scale up due to substantial human …
A data-driven approach for learning to control computers
It would be useful for machines to use computers as humans do so that they can aid us in
everyday tasks. This is a setting in which there is also the potential to leverage large-scale …
everyday tasks. This is a setting in which there is also the potential to leverage large-scale …
Understanding html with large language models
Large language models (LLMs) have shown exceptional performance on a variety of natural
language tasks. Yet, their capabilities for HTML understanding--ie, parsing the raw HTML of …
language tasks. Yet, their capabilities for HTML understanding--ie, parsing the raw HTML of …
Personal llm agents: Insights and survey about the capability, efficiency and security
Since the advent of personal computing devices, intelligent personal assistants (IPAs) have
been one of the key technologies that researchers and engineers have focused on, aiming …
been one of the key technologies that researchers and engineers have focused on, aiming …
AssistGUI: Task-Oriented PC Graphical User Interface Automation
Abstract Graphical User Interface (GUI) automation holds significant promise for assisting
users with complex tasks thereby boosting human productivity. Existing works leveraging …
users with complex tasks thereby boosting human productivity. Existing works leveraging …
Never-ending learning of user interfaces
Machine learning models have been trained to predict semantic information about user
interfaces (UIs) to make apps more accessible, easier to test, and to automate. Currently …
interfaces (UIs) to make apps more accessible, easier to test, and to automate. Currently …
Synapse: Trajectory-as-exemplar prompting with memory for computer control
Building agents with large language models (LLMs) for computer control is a burgeoning
research area, where the agent receives computer states and performs actions to complete …
research area, where the agent receives computer states and performs actions to complete …