关注
Yifei Huang
Yifei Huang
The University of Tokyo
在 ut-vision.org 的电子邮件经过验证 - 首页
标题
引用次数
引用次数
年份
Ego4d: Around the world in 3,000 hours of egocentric video
K Grauman, A Westbury, E Byrne, Z Chavis, A Furnari, R Girdhar, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2022
6552022
Semantic aware attention based deep object co-segmentation
H Chen, Y Huang, H Nakayama
Asian Conference on Computer Vision, 435-450, 2018
1542018
Predicting Gaze in Egocentric Video by Learning Task-dependent Attention Transition
Y Huang, M Cai, Z Li, Y Sato
Oral presentation, European Conference on Computer Vision (ECCV), 789-804, 2018
1412018
Improving action segmentation via graph-based temporal reasoning
Y Huang, Y Sugano, Y Sato
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2020
1242020
Goal-oriented gaze estimation for zero-shot learning
Y Liu, L Zhou, X Bai, Y Huang, L Gu, J Zhou, T Harada
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2021
1222021
Mutual context network for jointly estimating egocentric gaze and action
Y Huang, M Cai, Z Li, F Lu, Y Sato
IEEE Transactions on Image Processing 29, 7795-7806, 2020
672020
Manipulation-skill assessment from videos with spatial attention network
Z Li, Y Huang, M Cai, Y Sato
Proceedings of the IEEE International Conference on Computer Vision Workshops, 2019
632019
Videollm: Modeling video sequence with large language models
G Chen, YD Zheng, J Wang, J Xu, Y Huang, J Pan, Y Wang, Y Wang, ...
arXiv preprint arXiv:2305.13292, 2023
532023
Deep convolutional neural network-aided detection of portal hypertension in patients with cirrhosis
Y Liu, Z Ning, N Örmeci, W An, Q Yu, K Han, Y Huang, D Liu, F Liu, Z Li, ...
Clinical Gastroenterology and Hepatology 18 (13), 2998-3007. e5, 2020
382020
Internvideo-ego4d: A pack of champion solutions to ego4d challenges
G Chen, S Xing, Z Chen, Y Wang, K Li, Y Li, Y Liu, J Wang, YD Zheng, ...
arXiv preprint arXiv:2211.09529, 2022
362022
Commonsense knowledge aware concept selection for diverse and informative visual storytelling
H Chen, Y Huang, H Takamura, H Nakayama
Proceedings of the AAAI Conference on Artificial Intelligence 35 (2), 999-1008, 2021
362021
Ego-exo4d: Understanding skilled human activity from first-and third-person perspectives
K Grauman, A Westbury, L Torresani, K Kitani, J Malik, T Afouras, ...
Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern …, 2024
342024
Interact before align: Leveraging cross-modal knowledge for domain adaptive action recognition
L Yang, Y Huang, Y Sugano, Y Sato
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2022
342022
Handling missing sensors in topology-aware iot applications with gated graph neural network
S Liu, S Yao, Y Huang, D Liu, H Shao, Y Zhao, J Li, T Wang, R Wang, ...
Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous …, 2020
322020
Towards visually explaining video understanding networks with perturbation
Z Li, W Wang, Z Li, Y Huang, Y Sato
Proceedings of the IEEE/CVF Winter Conference on Applications of Computer …, 2021
302021
Precise multi-modal in-hand pose estimation using low-precision sensors for robotic assembly
F von Drigalski, K Hayashi, Y Huang, R Yonetani, M Hamaya, K Tanaka, ...
2021 IEEE International Conference on Robotics and Automation (ICRA), 968-974, 2021
282021
Compound Prototype Matching for Few-Shot Action Recognition
Y Huang, L Yang, Y Sato
European Conference on Computer Vision, 351-368, 2022
272022
Video mamba suite: State space model as a versatile alternative for video understanding
G Chen, Y Huang, J Xu, B Pei, Z Chen, Z Li, J Wang, K Li, T Lu, L Wang
arXiv preprint arXiv:2403.09626, 2024
212024
Internvideo2: Scaling video foundation models for multimodal video understanding
Y Wang, K Li, X Li, J Yu, Y He, G Chen, B Pei, R Zheng, J Xu, Z Wang, ...
arXiv preprint arXiv:2403.15377, 2024
162024
Weakly supervised temporal sentence grounding with uncertainty-guided self-training
Y Huang, L Yang, Y Sato
Proceedings of the IEEE/CVF conference on computer vision and pattern …, 2023
162023
系统目前无法执行此操作,请稍后再试。
文章 1–20