[HTML][HTML] Multimodal data integration for oncology in the era of deep neural networks: a review
Cancer research encompasses data across various scales, modalities, and resolutions, from
screening and diagnostic imaging to digitized histopathology slides to various types of …
screening and diagnostic imaging to digitized histopathology slides to various types of …
Ma-lmm: Memory-augmented large multimodal model for long-term video understanding
With the success of large language models (LLMs) integrating the vision model into LLMs to
build vision-language foundation models has gained much more interest recently. However …
build vision-language foundation models has gained much more interest recently. However …
[HTML][HTML] An outlook into the future of egocentric vision
What will the future be? We wonder! In this survey, we explore the gap between current
research in egocentric vision and the ever-anticipated future, where wearable computing …
research in egocentric vision and the ever-anticipated future, where wearable computing …
Latency matters: Real-time action forecasting transformer
We present RAFTformer, a real-time action forecasting transformer for latency aware real-
world action forecasting applications. RAFTformer is a two-stage fully transformer based …
world action forecasting applications. RAFTformer is a two-stage fully transformer based …
Uncertainty-aware Action Decoupling Transformer for Action Anticipation
Human action anticipation aims at predicting what people will do in the future based on past
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …
observations. In this paper we introduce Uncertainty-aware Action Decoupling Transformer …
Gepsan: Generative procedure step anticipation in cooking videos
We study the problem of future step anticipation in procedural videos. Given a video of an
ongoing procedural activity, we predict a plausible next procedure step described in rich …
ongoing procedural activity, we predict a plausible next procedure step described in rich …
Stillfast: An end-to-end approach for short-term object interaction anticipation
F Ragusa, GM Farinella… - Proceedings of the IEEE …, 2023 - openaccess.thecvf.com
Anticipation problem has been studied considering different aspects such as predicting
humans' locations, predicting hands and objects trajectories, and forecasting actions and …
humans' locations, predicting hands and objects trajectories, and forecasting actions and …
Interaction region visual transformer for egocentric action anticipation
D Roy, R Rajendiran… - Proceedings of the IEEE …, 2024 - openaccess.thecvf.com
Human-object interaction (HOI) and temporal dynamics along the motion paths are the most
important visual cues for egocentric action anticipation. Especially, interaction regions …
important visual cues for egocentric action anticipation. Especially, interaction regions …
Vlmah: Visual-linguistic modeling of action history for effective action anticipation
V Manousaki, K Bacharidis… - Proceedings of the …, 2023 - openaccess.thecvf.com
Although existing methods for action anticipation have shown considerably improved
performance on the predictability of future events in videos, the way they exploit information …
performance on the predictability of future events in videos, the way they exploit information …
Leveraging next-active objects for context-aware anticipation in egocentric videos
Abstract Objects are crucial for understanding human-object interactions. By identifying the
relevant objects, one can also predict potential future interactions or actions that may occur …
relevant objects, one can also predict potential future interactions or actions that may occur …