Streaming dense video captioning- 学术资源搜索

Streaming dense video captioning

X Zhou, A Arnab, S Buch, S Yan… - Proceedings of the …, 2024 - openaccess.thecvf.com

… a streaming model for dense video captioning as shown in Fig. 1. Our streaming model does
not require access to all input frames concurrently in order to process the video thanks to a …

被引用次数：23 相关文章所有 3 个版本

[PDF] thecvf.com

Streamlined dense video captioning

J Mun, L Yang, Z Ren, N Xu… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

… dense video captioning framework, which models temporal dependency across events in
a video … of event proposals to our sequential video captioning network, which is trained by …

被引用次数：175 相关文章所有 9 个版本

[PDF] thecvf.com

Multi-modal dense video captioning

V Iashin, E Rahtu - … of the IEEE/CVF conference on …, 2020 - openaccess.thecvf.com

… dense video captions for an example video sequence. Most recent works in dense video
captioning formulate the captioning … of features extracted from the video stream and the output is …

被引用次数：213 相关文章所有 9 个版本

[PDF] aaai.org

An efficient framework for dense video captioning

M Suin, AN Rajagopalan - Proceedings of the AAAI Conference on Artificial …, 2020 - aaai.org

… This is in part due to the huge size of raw video streams and the presence of redundant
information in the frames. Most of the existing frameworks, for every time step, need to pass the …

被引用次数：48 相关文章所有 4 个版本

[PDF] thecvf.com

End-to-end dense video captioning with parallel decoding

T Wang, R Zhang, Z Lu, F Zheng… - Proceedings of the …, 2021 - openaccess.thecvf.com

… In practice, we consider the dense video captioning task as a set prediction problem. The
proposed PDVC directly decodes the frame features, which are extracted from a Vision …

被引用次数：209 相关文章所有 6 个版本

[PDF] thecvf.com

Dense relational captioning: Triple-stream networks for relationship-based captioning

DJ Kim, J Choi, TH Oh… - Proceedings of the IEEE …, 2019 - openaccess.thecvf.com

… MTTSNet denotes our final model, multi-task triple-stream network with POS classifier. … for
dense captioning task [12], we suggest a new evaluation metric for relational dense captioning. …

被引用次数：111 相关文章所有 8 个版本

Environment-aware dense video captioning for IoT-enabled edge cameras

CH Lu, GY Fan - IEEE Internet of Things Journal, 2021 - ieeexplore.ieee.org

… dense-videocaptioning model based on the Transformer framework to improve execution
efficiency for video-caption … The source of ActivityNet’s videos is the streaming video platform …

被引用次数：13 相关文章

[PDF] thecvf.com

Dense-captioning events in videos

R Krishna, K Hata, F Ren, L Fei-Fei… - Proceedings of the …, 2017 - openaccess.thecvf.com

… In addition, we show a variant of our captioning module that can operate on streaming
videos by attending over only the past events. Our full model attends over both past as well as …

被引用次数：1495 相关文章所有 8 个版本

[PDF] ict.ac.cn

Attention-based densely connected LSTM for video captioning

Y Zhu, S Jiang - Proceedings of the 27th ACM international conference …, 2019 - dl.acm.org

… streams). To more effectively combine different modalities, they trained modalityspecific
LSTMs to capture the intrinsic representations of individual modalities. For video captioning, the …

被引用次数：41 相关文章所有 3 个版本

[PDF] thecvf.com

End-to-end dense video captioning with masked transformer

L Zhou, Y Zhou, JJ Corso… - Proceedings of the …, 2018 - openaccess.thecvf.com

… and the captioning modules … dense video captioning that is able to produce proposal and
description simultaneously. Also, our work directly incorporates the semantics from captions to …

被引用次数：691 相关文章所有 8 个版本