Anticipative feature fusion transformer for multi-modal action anticipation

A Waqas, A Tripathi, RP Ramachandran… - Frontiers in Artificial …, 2024 - frontiersin.org

Cancer research encompasses data across various scales, modalities, and resolutions, from
screening and diagnostic imaging to digitized histopathology slides to various types of …

被引用次数：10 相关文章所有 2 个版本

[PDF] thecvf.com

Ma-lmm: Memory-augmented large multimodal model for long-term video understanding

B He, H Li, YK Jang, M Jia, X Cao… - Proceedings of the …, 2024 - openaccess.thecvf.com

With the success of large language models (LLMs) integrating the vision model into LLMs to
build vision-language foundation models has gained much more interest recently. However …

被引用次数：10 相关文章所有 3 个版本

[HTML] springer.com

[HTML][HTML] An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer

What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

被引用次数：15 相关文章所有 7 个版本

[PDF] thecvf.com

Latency matters: Real-time action forecasting transformer

H Girase, N Agarwal, C Choi… - Proceedings of the …, 2023 - openaccess.thecvf.com

We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …

被引用次数：12 相关文章所有 3 个版本

[PDF] thecvf.com

Uncertainty-aware Action Decoupling Transformer for Action Anticipation

H Guo, N Agarwal, SY Lo, K Lee… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Human action anticipation aims at predicting what people will do in the future based on past
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …

被引用次数：1 相关文章

[PDF] thecvf.com

Gepsan: Generative procedure step anticipation in cooking videos

MA Abdelsalam, SB Rangrej, I Hadji… - Proceedings of the …, 2023 - openaccess.thecvf.com

We study the problem of future step anticipation in procedural videos. Given a video of an
ongoing procedural activity, we predict a plausible next procedure step described in rich …

被引用次数：4 相关文章所有 4 个版本

[PDF] thecvf.com

Stillfast: An end-to-end approach for short-term object interaction anticipation

F Ragusa, GM Farinella… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com

Anticipation problem has been studied considering different aspects such as predicting
humans' locations, predicting hands and objects trajectories, and forecasting actions and …

被引用次数：14 相关文章所有 7 个版本

[PDF] thecvf.com

Interaction region visual transformer for egocentric action anticipation

D Roy, R Rajendiran… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com

Human-object interaction (HOI) and temporal dynamics along the motion paths are the most
important visual cues for egocentric action anticipation. Especially, interaction regions …

被引用次数：7 相关文章所有 3 个版本

[PDF] thecvf.com

Vlmah: Visual-linguistic modeling of action history for effective action anticipation

V Manousaki, K Bacharidis… - Proceedings of the …, 2023 - openaccess.thecvf.com

Although existing methods for action anticipation have shown considerably improved
performance on the predictability of future events in videos, the way they exploit information …

被引用次数：3 相关文章所有 5 个版本

[PDF] thecvf.com

Leveraging next-active objects for context-aware anticipation in egocentric videos

S Thakur, C Beyan, P Morerio… - Proceedings of the …, 2024 - openaccess.thecvf.com

Abstract Objects are crucial for understanding human-object interactions. By identifying the
relevant objects, one can also predict potential future interactions or actions that may occur …

被引用次数：6 相关文章所有 8 个版本