关注
Shyamal Buch
Shyamal Buch
在 stanford.edu 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
On the opportunities and risks of foundation models
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
32362021
SST: Single-stream temporal action proposals
S Buch, V Escorcia, C Shen, B Ghanem, J Carlos Niebles
Proceedings of the IEEE conference on Computer Vision and Pattern …, 2017
5022017
End-to-end, single-stream temporal action detection in untrimmed videos
S Buch, V Escorcia, B Ghanem, L Fei-Fei, JC Niebles
Proceedings of the British Machine Vision Conference (BMVC), 2017
2672017
iGibson, a Simulation Environment for Interactive Tasks in Large Realistic Scenes
B Shen*, F Xia*, C Li*, R Martín-Martín*, L Fan, G Wang, S Buch, ...
arXiv preprint arXiv:2012.02924, 2020
1372020
Revisiting the "Video" in Video-Language Understanding
S Buch, C Eyzaguirre, A Gaidon, J Wu, L Fei-Fei, JC Niebles
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
1202022
Behavior: Benchmark for everyday household activities in virtual, interactive, and ecological environments
S Srivastava, C Li, M Lingelbach, R Martín-Martín, F Xia, KE Vainio, Z Lian, ...
Conference on robot learning, 477-490, 2022
1172022
Finding "It": Weakly-Supervised Reference-Aware Visual Grounding in Instructional Videos
DA Huang*, S Buch*, L Dery, A Garg, L Fei-Fei, J Carlos Niebles
Proceedings of the IEEE Conference on Computer Vision and Pattern …, 2018
1012018
On the opportunities and risks of foundation models. arXiv
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2021
772021
RubiksNet: Learnable 3D-Shift for Efficient Video Action Recognition
L Fan*, S Buch*, G Wang, R Cao, Y Zhu, JC Niebles, L Fei-Fei
Proceedings of the European Conference on Computer Vision (ECCV), 2020
742020
On the opportunities and risks of foundation models. arXiv 2021
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
arXiv preprint arXiv:2108.07258, 2023
672023
The activitynet large-scale activity recognition challenge 2018 summary
B Ghanem, JC Niebles, C Snoek, FC Heilbron, H Alwassel, V Escorcia, ...
arXiv preprint arXiv:1808.03766, 2018
662018
Activitynet challenge 2017 summary
B Ghanem, JC Niebles, C Snoek, FC Heilbron, H Alwassel, R Khrisna, ...
arXiv preprint arXiv:1710.08011, 2017
592017
On the opportunities and risks of foundation models (arXiv: 2108.07258). arXiv
R Bommasani, DA Hudson, E Adeli, R Altman, S Arora, S von Arx, ...
552022
End-to-end joint semantic segmentation of actors and actions in video
J Ji, S Buch, A Soto, JC Niebles
Proceedings of the European Conference on Computer Vision (ECCV), 702-717, 2018
502018
System and method for leveraging end-to-end driving models for improving driving task modules
SD Buch, AD Gaidon
US Patent 10,866,588, 2020
212020
Neural event semantics for grounded language understanding
S Buch, L Fei-Fei, ND Goodman
Transactions of the Association for Computational Linguistics 9, 875-890, 2021
82021
Streaming dense video captioning
X Zhou, A Arnab, S Buch, S Yan, A Myers, X Xiong, A Nagrani, C Schmid
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
42024
MoReVQA: Exploring Modular Reasoning Models for Video Question Answering
J Min, S Buch, A Nagrani, M Cho, C Schmid
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
12024
Efficient Event Understanding in Videos and Language
SD Buch
Stanford University, 2022
2022
Language identification and accent variation detection in spoken language recordings
S Buch, J Gauthier, A Tsang
系统目前无法执行此操作,请稍后再试。
文章 1–20