AssistGUI: Task-Oriented PC Graphical User Interface Automation

D Gao, L Ji, Z Bai, M Ouyang, P Li… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Graphical User Interface (GUI) automation holds significant promise for assisting
users with complex tasks thereby boosting human productivity. Existing works leveraging …

LAVE: LLM-Powered Agent Assistance and Language Augmentation for Video Editing

B Wang, Y Li, Z Lv, H Xia, Y Xu, R Sodhi - Proceedings of the 29th …, 2024 - dl.acm.org
Video creation has become increasingly popular, yet the expertise and effort required for
editing often pose barriers to beginners. In this paper, we explore the integration of large …

AI Assistance for UX: A Literature Review Through Human-Centered AI

Y Lu, Y Yang, Q Zhao, C Zhang, TJJ Li - arXiv preprint arXiv:2402.06089, 2024 - arxiv.org
Recent advancements in HCI and AI research attempt to support user experience (UX)
practitioners with AI-enabled tools. Despite the potential of emerging models and new …

Assistgui: Task-oriented desktop graphical user interface automation

D Gao, L Ji, Z Bai, M Ouyang, P Li, D Mao, Q Wu… - arXiv preprint arXiv …, 2023 - arxiv.org
Graphical User Interface (GUI) automation holds significant promise for assisting users with
complex tasks, thereby boosting human productivity. Existing works leveraging Large …

AndroidWorld: A dynamic benchmarking environment for autonomous agents

C Rawles, S Clinckemaillie, Y Chang, J Waltz… - arXiv preprint arXiv …, 2024 - arxiv.org
Autonomous agents that execute human tasks by controlling computers can enhance
human productivity and application accessibility. Yet, progress in this field will be driven by …

AutoTask: Executing Arbitrary Voice Commands by Exploring and Learning from Mobile GUI

L Pan, B Wang, C Yu, Y Chen, X Zhang… - arXiv preprint arXiv …, 2023 - arxiv.org
Voice command interfaces (VCIs) have gained increasing importance, enabling hands-free
and eyes-free interaction with digital devices. However, the inherent complexity in …

E-ANT: A Large-Scale Dataset for Efficient Automatic GUI NavigaTion

K Wang, T Xia, Z Gu, Y Zhao, S Shen, C Meng… - arXiv preprint arXiv …, 2024 - arxiv.org
Online GUI navigation on mobile devices has driven a lot of attention recent years since it
contributes to many real-world applications. With the rapid development of large language …

Devil's Advocate: Anticipatory Reflection for LLM Agents

H Wang, T Li, Z Deng, D Roth, Y Li - arXiv preprint arXiv:2405.16334, 2024 - arxiv.org
In this work, we introduce a novel approach that equips LLM agents with introspection,
enhancing consistency and adaptability in solving complex tasks. Our approach prompts …