WildQA: In-the-wild video question answering
Existing video understanding datasets mostly focus on human interactions, with little
attention being paid to the" in the wild" settings, where the videos are recorded outdoors. We …
attention being paid to the" in the wild" settings, where the videos are recorded outdoors. We …
Voicify Your UI: Towards Android App Control with Voice Commands
Nowadays, voice assistants help users complete tasks on the smartphone with voice
commands, replacing traditional touchscreen interactions when such interactions are …
commands, replacing traditional touchscreen interactions when such interactions are …
Bridging the gap between synthetic and natural questions via sentence decomposition for semantic parsing
Semantic parsing maps natural language questions into logical forms, which can be
executed against a knowledge base for answers. In real-world applications, the performance …
executed against a knowledge base for answers. In real-world applications, the performance …
Compositional generalization for multi-label text classification: A data-augmentation approach
Despite significant advancements in multi-label text classification, the ability of existing
models to generalize to novel and seldom-encountered complex concepts, which are …
models to generalize to novel and seldom-encountered complex concepts, which are …
Total recall: a customized continual learning method for neural semantic parsers
This paper investigates continual learning for semantic parsing. In this setting, a neural
semantic parser learns tasks sequentially without accessing full training data from previous …
semantic parser learns tasks sequentially without accessing full training data from previous …
Paraphrasing techniques for maritime qa system
There has been an increasing interest in incorporating Artificial Intelligence (AI) into Defence
and military systems to complement and augment human intelligence and capabilities …
and military systems to complement and augment human intelligence and capabilities …
Towards Video Understanding through Language in Real-life Settings
S Castro - 2024 - deepblue.lib.umich.edu
Videos have become an integral part of our daily lives, with a rapidly growing number on
YouTube, Netflix, and TikTok serving as testimony to their widespread popularity. Behind the …
YouTube, Netflix, and TikTok serving as testimony to their widespread popularity. Behind the …
Modeling Meaning for Description and Interaction
E Stengel-Eskin - 2023 - jscholarship.library.jhu.edu
Abstract Language is a powerful tool for communication and coordination, allowing us to
share thoughts, ideas, and instructions with others. Accordingly, enabling people to …
share thoughts, ideas, and instructions with others. Accordingly, enabling people to …
Investigating Few-Shot Transfer Learning for Address Parsing: Fine-Tuning Multilingual Pre-Trained Language Models for Low-Resource Address Segmentation
H Heimisdóttir - 2022 - diva-portal.org
Address parsing is the process of splitting an address string into its different address
components, such as street name, street number, et cetera. Address parsing has been quite …
components, such as street name, street number, et cetera. Address parsing has been quite …