作者
Joonseok Lee, Sami Abu-El-Haija, Balakrishnan Varadarajan, Apostol Natsev
发表日期
2018/7/19
图书
Proceedings of the 24th ACM SIGKDD International conference on knowledge discovery & data mining
页码范围
481-490
简介
The goal of video understanding is to develop algorithms that enable machines understand videos at the level of human experts. Researchers have tackled various domains including video classification, search, personalized recommendation, and more. However, there is a research gap in combining these domains in one unified learning framework. Towards that, we propose a deep network that embeds videos using their audio-visual content, onto a metric space which preserves video-to-video relationships. Then, we use the trained embedding network to tackle various domains including video classification and recommendation, showing significant improvements over state-of-the-art baselines. The proposed approach is highly scalable to deploy on large-scale video sharing platforms like YouTube.
引用总数
2018201920202021202220232024214161512168
学术搜索中的文章
J Lee, S Abu-El-Haija, B Varadarajan, A Natsev - Proceedings of the 24th ACM SIGKDD International …, 2018