Survey on videos data augmentation for deep learning models

N Cauli, D Reforgiato Recupero - Future Internet, 2022 - mdpi.com
In most Computer Vision applications, Deep Learning models achieve state-of-the-art
performances. One drawback of Deep Learning is the large amount of data needed to train …

Signing at scale: Learning to co-articulate signs for large-scale photo-realistic sign language production

B Saunders, NC Camgoz… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Sign languages are visual languages, with vocabularies as rich as their spoken language
counterparts. However, current deep-learning based Sign Language Production (SLP) …

Everybody sign now: Translating spoken language to photo realistic sign language video

B Saunders, NC Camgoz, R Bowden - arXiv preprint arXiv:2011.09846, 2020 - arxiv.org
To be truly understandable and accepted by Deaf communities, an automatic Sign
Language Production (SLP) system must generate a photo-realistic signer. Prior …

Magicdance: Realistic human dance video generation with motions & facial expressions transfer

D Chang, Y Shi, Q Gao, J Fu, H Xu, G Song… - arXiv preprint arXiv …, 2023 - arxiv.org
In this work, we propose MagicDance, a diffusion-based model for 2D human motion and
facial expression transfer on challenging human dance videos. Specifically, we aim to …

Markov game video augmentation for action segmentation

N Aziere, S Todorovic - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
This paper addresses data augmentation for action segmentation. Our key novelty is that we
augment the original training videos in the deep feature space, not in the visual …

A feature-space multimodal data augmentation technique for text-video retrieval

A Falcon, G Serra, O Lanz - Proceedings of the 30th ACM International …, 2022 - dl.acm.org
Every hour, huge amounts of visual contents are posted on social media and user-
generated content platforms. To find relevant videos by means of a natural language query …

Fashionmirror: Co-attention feature-remapping virtual try-on with sequential template poses

CY Chen, L Lo, PJ Huang, HH Shuai… - Proceedings of the …, 2021 - openaccess.thecvf.com
Virtual try-on tasks have drawn increased attention. Prior arts focus on tackling this task via
warping clothes and fusing the information at the pixel level with the help of semantic …

A hybrid approach based on gan and cnn-lstm for aerial activity recognition

A Bousmina, M Selmi, MA Ben Rhaiem, IR Farah - Remote Sensing, 2023 - mdpi.com
Unmanned aerial vehicles (UAVs), known as drones, have played a significant role in recent
years in creating resilient smart cities. UAVs can be used for a wide range of applications …

[PDF][PDF] Deepfake and its enabling techniques: a review

R Brooks, Y Yuan, Y Liu, H Chen - APSIPA Transactions on …, 2022 - nowpublishers.com
Deepfake technology has been undoubtedly growing at a rapid pace since 2017.
Particularly since using GAN architecture was popularized, research in this area has grown …

Flow guided transformable bottleneck networks for motion retargeting

J Ren, M Chai, OJ Woodford… - Proceedings of the …, 2021 - openaccess.thecvf.com
Human motion retargeting aims to transfer the motion of one person in a driving video or set
of images to another person. Existing efforts leverage a long training video from each target …