WildQA: In-the-wild video question answering

S Castro, N Deng, P Huang, M Burzo… - arXiv preprint arXiv …, 2022 - arxiv.org
Existing video understanding datasets mostly focus on human interactions, with little
attention being paid to the" in the wild" settings, where the videos are recorded outdoors. We …

Voicify Your UI: Towards Android App Control with Voice Commands

MD Vu, H Wang, Z Li, G Haffari, Z Xing… - Proceedings of the ACM …, 2023 - dl.acm.org
Nowadays, voice assistants help users complete tasks on the smartphone with voice
commands, replacing traditional touchscreen interactions when such interactions are …

Bridging the gap between synthetic and natural questions via sentence decomposition for semantic parsing

Y Niu, F Huang, W Liu, J Cui, B Wang… - Transactions of the …, 2023 - direct.mit.edu
Semantic parsing maps natural language questions into logical forms, which can be
executed against a knowledge base for answers. In real-world applications, the performance …

Compositional generalization for multi-label text classification: A data-augmentation approach

Y Chai, Z Li, J Liu, L Chen, F Li, D Ji… - Proceedings of the AAAI …, 2024 - ojs.aaai.org
Despite significant advancements in multi-label text classification, the ability of existing
models to generalize to novel and seldom-encountered complex concepts, which are …

Total recall: a customized continual learning method for neural semantic parsers

Z Li, L Qu, G Haffari - arXiv preprint arXiv:2109.05186, 2021 - arxiv.org
This paper investigates continual learning for semantic parsing. In this setting, a neural
semantic parser learns tasks sequentially without accessing full training data from previous …

Paraphrasing techniques for maritime qa system

F Shiri, TY Zhuo, Z Li, S Pan, W Wang… - 2022 25th …, 2022 - ieeexplore.ieee.org
There has been an increasing interest in incorporating Artificial Intelligence (AI) into Defence
and military systems to complement and augment human intelligence and capabilities …

Towards Video Understanding through Language in Real-life Settings

S Castro - 2024 - deepblue.lib.umich.edu
Videos have become an integral part of our daily lives, with a rapidly growing number on
YouTube, Netflix, and TikTok serving as testimony to their widespread popularity. Behind the …

Modeling Meaning for Description and Interaction

E Stengel-Eskin - 2023 - jscholarship.library.jhu.edu
Abstract Language is a powerful tool for communication and coordination, allowing us to
share thoughts, ideas, and instructions with others. Accordingly, enabling people to …

Investigating Few-Shot Transfer Learning for Address Parsing: Fine-Tuning Multilingual Pre-Trained Language Models for Low-Resource Address Segmentation

H Heimisdóttir - 2022 - diva-portal.org
Address parsing is the process of splitting an address string into its different address
components, such as street name, street number, et cetera. Address parsing has been quite …