过去一年中添加的文章,按日期排序

Context-Aware Relational Reasoning for Video Chunks and Frames Overlapping in Language-Based Moment Localization

HS Nawaz, D Shi, M Nawaz - Available at SSRN 4814690 - papers.ssrn.com
31 天前 - Temporal moment localization in videos via natural language, using a query that
speci… The issue of accessing the video-text pairs instead of the videos temporal scope was …

TRIP: Temporal Residual Learning with Image Noise Prior for Image-to-Video Diffusion Models

Z Zhang, F Long, Y Pan, Z Qiu, T Yao, Y Cao… - arXiv preprint arXiv …, 2024 - arxiv.org
68 天前 - … based on both static image and noised video latent codes. Next, TRIP executes a
… over noised video and static image latent codes to enable inter-frame relational reasoning, …

Appearance-Motion Dual-Stream Heterogeneous Network for VideoQA

F Xu, Z Zhong, Y Zhu, Y Zhou, G Li - International Conference on …, 2024 - Springer
126 天前 - … Then they are fed into the object-relational reasoning … and put forward a Video-Text
Symmetric Attention Network (… spatio-temporal information, we replace the video-text …

Object-based Appearance-Motion Heterogeneous Network for Video Question Answering

F Xu, Z Zhong, Y Zhu, G Li, Y Zhou… - 2023 IEEE 29th …, 2023 - ieeexplore.ieee.org
168 天前 - … Then they are fed into the object-relational reasoning … and put forward a Video-Text
Symmetric Attention Network (… spatio-temporal information, we replace the video-text …

Hierarchical Synergy-Enhanced Multimodal Relational Network for Video Question Answering

M Peng, X Shao, Y Shi, X Zhou - ACM Transactions on Multimedia …, 2023 - dl.acm.org
220 天前 - … capability of spatio-temporal relational reasoning at a speciic scale, and to answer
question (c), the model needs to capture spatial details in temporal relations. Our proposed …

[PDF][PDF] Combing Perception with Structured Knowledge for Rich Causal Reasoning in a Computational Cognitive Architecture

P Bello - 2023 - apps.dtic.mil
240 天前 - … that consist in visual/video stimuli in which we can explore the … will construct a variety
of video examples in which blame … video data to populate episodic memory with temporally

[图书][B] Image Analysis and Processing–ICIAP 2023: 22nd International Conference, ICIAP 2023, Udine, Italy, September 11–15, 2023, Proceedings, Part I

GL Foresti, A Fusiello, E Hancock - 2023 - books.google.com
272 天前 - … The conference focuses on video analysis and understanding; pattern recognition
and machine learning; deep learning; multi-view geometry and 3D computer vision; image …