Video-of-thought: Step-by-step video reasoning from perception to cognition
Existing research of video understanding still struggles to achieve in-depth comprehension
and reasoning in complex videos, primarily due to the under-exploration of two key …
and reasoning in complex videos, primarily due to the under-exploration of two key …
Self-Adaptive Fine-grained Multi-modal Data Augmentation for Semi-supervised Muti-modal Coreference Resolution
Coreference resolution, an essential task in natural language processing, is particularly
challenging in multi-modal scenarios where data comes in various forms and modalities …
challenging in multi-modal scenarios where data comes in various forms and modalities …
Faithful Logical Reasoning via Symbolic Chain-of-Thought
While the recent Chain-of-Thought (CoT) technique enhances the reasoning ability of large
language models (LLMs) with the theory of mind, it might still struggle in handling logical …
language models (LLMs) with the theory of mind, it might still struggle in handling logical …
Auto Graph of Thoughts: A Hands-free and Cost Effective Method for using Graph of Thoughts
As powerful generative pre-trained language models like GPT become more prevalent, it is
imperative to explore methods for customizing these models to suit downstream datasets …
imperative to explore methods for customizing these models to suit downstream datasets …