Video-of-thought: Step-by-step video reasoning from perception to cognition

H Fei, S Wu, W Ji, H Zhang, M Zhang… - Forty-first International …, 2024 - openreview.net
Existing research of video understanding still struggles to achieve in-depth comprehension
and reasoning in complex videos, primarily due to the under-exploration of two key …

Self-Adaptive Fine-grained Multi-modal Data Augmentation for Semi-supervised Muti-modal Coreference Resolution

L Zheng, B Chen, H Fei, F Li, S Wu, L Liao… - ACM Multimedia …, 2024 - openreview.net
Coreference resolution, an essential task in natural language processing, is particularly
challenging in multi-modal scenarios where data comes in various forms and modalities …

Faithful Logical Reasoning via Symbolic Chain-of-Thought

J Xu, H Fei, L Pan, Q Liu, ML Lee, W Hsu - arXiv preprint arXiv:2405.18357, 2024 - arxiv.org
While the recent Chain-of-Thought (CoT) technique enhances the reasoning ability of large
language models (LLMs) with the theory of mind, it might still struggle in handling logical …

Auto Graph of Thoughts: A Hands-free and Cost Effective Method for using Graph of Thoughts

TL Ha, TB Ho, L Nguyen, D Dinh - Proceedings of the 2024 10th …, 2024 - dl.acm.org
As powerful generative pre-trained language models like GPT become more prevalent, it is
imperative to explore methods for customizing these models to suit downstream datasets …