[HTML][HTML] Multimodal data integration for oncology in the era of deep neural networks: a review

A Waqas, A Tripathi, RP Ramachandran… - Frontiers in Artificial …, 2024 - frontiersin.org
Cancer research encompasses data across various scales, modalities, and resolutions, from
screening and diagnostic imaging to digitized histopathology slides to various types of …

Ma-lmm: Memory-augmented large multimodal model for long-term video understanding

B He, H Li, YK Jang, M Jia, X Cao… - Proceedings of the …, 2024 - openaccess.thecvf.com
With the success of large language models (LLMs) integrating the vision model into LLMs to
build vision-language foundation models has gained much more interest recently. However …

[HTML][HTML] An outlook into the future of egocentric vision

C Plizzari, G Goletto, A Furnari, S Bansal… - International Journal of …, 2024 - Springer
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …

Latency matters: Real-time action forecasting transformer

H Girase, N Agarwal, C Choi… - Proceedings of the …, 2023 - openaccess.thecvf.com
We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …

Uncertainty-aware Action Decoupling Transformer for Action Anticipation

H Guo, N Agarwal, SY Lo, K Lee… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human action anticipation aims at predicting what people will do in the future based on past
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …

Gepsan: Generative procedure step anticipation in cooking videos

MA Abdelsalam, SB Rangrej, I Hadji… - Proceedings of the …, 2023 - openaccess.thecvf.com
We study the problem of future step anticipation in procedural videos. Given a video of an
ongoing procedural activity, we predict a plausible next procedure step described in rich …

Stillfast: An end-to-end approach for short-term object interaction anticipation

F Ragusa, GM Farinella… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Anticipation problem has been studied considering different aspects such as predicting
humans' locations, predicting hands and objects trajectories, and forecasting actions and …

Interaction region visual transformer for egocentric action anticipation

D Roy, R Rajendiran… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human-object interaction (HOI) and temporal dynamics along the motion paths are the most
important visual cues for egocentric action anticipation. Especially, interaction regions …

Vlmah: Visual-linguistic modeling of action history for effective action anticipation

V Manousaki, K Bacharidis… - Proceedings of the …, 2023 - openaccess.thecvf.com
Although existing methods for action anticipation have shown considerably improved
performance on the predictability of future events in videos, the way they exploit information …

Leveraging next-active objects for context-aware anticipation in egocentric videos

S Thakur, C Beyan, P Morerio… - Proceedings of the …, 2024 - openaccess.thecvf.com
Abstract Objects are crucial for understanding human-object interactions. By identifying the
relevant objects, one can also predict potential future interactions or actions that may occur …