GAC-GAN: A general method for appearance-controllable human video motion transfer

N Cauli, D Reforgiato Recupero - Future Internet, 2022 - mdpi.com

In most Computer Vision applications, Deep Learning models achieve state-of-the-art
performances. One drawback of Deep Learning is the large amount of data needed to train …

被引用次数：43 相关文章所有 6 个版本

[PDF] thecvf.com

Signing at scale: Learning to co-articulate signs for large-scale photo-realistic sign language production

B Saunders, NC Camgoz… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com

Sign languages are visual languages, with vocabularies as rich as their spoken language
counterparts. However, current deep-learning based Sign Language Production (SLP) …

被引用次数：54 相关文章所有 10 个版本

[PDF] arxiv.org

Everybody sign now: Translating spoken language to photo realistic sign language video

B Saunders, NC Camgoz, R Bowden - arXiv preprint arXiv:2011.09846, 2020 - arxiv.org

To be truly understandable and accepted by Deaf communities, an automatic Sign
Language Production (SLP) system must generate a photo-realistic signer. Prior …

被引用次数：60 相关文章所有 6 个版本

[PDF] arxiv.org

Magicdance: Realistic human dance video generation with motions & facial expressions transfer

D Chang, Y Shi, Q Gao, J Fu, H Xu, G Song… - arXiv preprint arXiv …, 2023 - arxiv.org

In this work, we propose MagicDance, a diffusion-based model for 2D human motion and
facial expression transfer on challenging human dance videos. Specifically, we aim to …

被引用次数：22 相关文章所有 2 个版本

[PDF] thecvf.com

Markov game video augmentation for action segmentation

N Aziere, S Todorovic - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com

This paper addresses data augmentation for action segmentation. Our key novelty is that we
augment the original training videos in the deep feature space, not in the visual …

被引用次数：2 相关文章所有 4 个版本

[PDF] arxiv.org

A feature-space multimodal data augmentation technique for text-video retrieval

A Falcon, G Serra, O Lanz - Proceedings of the 30th ACM International …, 2022 - dl.acm.org

Every hour, huge amounts of visual contents are posted on social media and user-
generated content platforms. To find relevant videos by means of a natural language query …

被引用次数：21 相关文章所有 6 个版本

[PDF] thecvf.com

Fashionmirror: Co-attention feature-remapping virtual try-on with sequential template poses

CY Chen, L Lo, PJ Huang, HH Shuai… - Proceedings of the …, 2021 - openaccess.thecvf.com

Virtual try-on tasks have drawn increased attention. Prior arts focus on tackling this task via
warping clothes and fusing the information at the pixel level with the help of semantic …

被引用次数：27 相关文章所有 5 个版本

[PDF] mdpi.com

A hybrid approach based on gan and cnn-lstm for aerial activity recognition

A Bousmina, M Selmi, MA Ben Rhaiem, IR Farah - Remote Sensing, 2023 - mdpi.com

Unmanned aerial vehicles (UAVs), known as drones, have played a significant role in recent
years in creating resilient smart cities. UAVs can be used for a wide range of applications …

被引用次数：11 相关文章所有 4 个版本

[PDF] nowpublishers.com

[PDF][PDF] Deepfake and its enabling techniques: a review

R Brooks, Y Yuan, Y Liu, H Chen - APSIPA Transactions on …, 2022 - nowpublishers.com

Deepfake technology has been undoubtedly growing at a rapid pace since 2017.
Particularly since using GAN architecture was popularized, research in this area has grown …

被引用次数：5 相关文章所有 3 个版本

[PDF] thecvf.com

Flow guided transformable bottleneck networks for motion retargeting

J Ren, M Chai, OJ Woodford… - Proceedings of the …, 2021 - openaccess.thecvf.com

Human motion retargeting aims to transfer the motion of one person in a driving video or set
of images to another person. Existing efforts leverage a long training video from each target …

被引用次数：27 相关文章所有 5 个版本