Survey on videos data augmentation for deep learning models
N Cauli, D Reforgiato Recupero - Future Internet, 2022 - mdpi.com
In most Computer Vision applications, Deep Learning models achieve state-of-the-art
performances. One drawback of Deep Learning is the large amount of data needed to train …
performances. One drawback of Deep Learning is the large amount of data needed to train …
Signing at scale: Learning to co-articulate signs for large-scale photo-realistic sign language production
B Saunders, NC Camgoz… - Proceedings of the IEEE …, 2022 - openaccess.thecvf.com
Sign languages are visual languages, with vocabularies as rich as their spoken language
counterparts. However, current deep-learning based Sign Language Production (SLP) …
counterparts. However, current deep-learning based Sign Language Production (SLP) …
Everybody sign now: Translating spoken language to photo realistic sign language video
To be truly understandable and accepted by Deaf communities, an automatic Sign
Language Production (SLP) system must generate a photo-realistic signer. Prior …
Language Production (SLP) system must generate a photo-realistic signer. Prior …
Magicdance: Realistic human dance video generation with motions & facial expressions transfer
In this work, we propose MagicDance, a diffusion-based model for 2D human motion and
facial expression transfer on challenging human dance videos. Specifically, we aim to …
facial expression transfer on challenging human dance videos. Specifically, we aim to …
Markov game video augmentation for action segmentation
N Aziere, S Todorovic - Proceedings of the IEEE/CVF …, 2023 - openaccess.thecvf.com
This paper addresses data augmentation for action segmentation. Our key novelty is that we
augment the original training videos in the deep feature space, not in the visual …
augment the original training videos in the deep feature space, not in the visual …
A feature-space multimodal data augmentation technique for text-video retrieval
Every hour, huge amounts of visual contents are posted on social media and user-
generated content platforms. To find relevant videos by means of a natural language query …
generated content platforms. To find relevant videos by means of a natural language query …
Fashionmirror: Co-attention feature-remapping virtual try-on with sequential template poses
Virtual try-on tasks have drawn increased attention. Prior arts focus on tackling this task via
warping clothes and fusing the information at the pixel level with the help of semantic …
warping clothes and fusing the information at the pixel level with the help of semantic …
A hybrid approach based on gan and cnn-lstm for aerial activity recognition
A Bousmina, M Selmi, MA Ben Rhaiem, IR Farah - Remote Sensing, 2023 - mdpi.com
Unmanned aerial vehicles (UAVs), known as drones, have played a significant role in recent
years in creating resilient smart cities. UAVs can be used for a wide range of applications …
years in creating resilient smart cities. UAVs can be used for a wide range of applications …
[PDF][PDF] Deepfake and its enabling techniques: a review
R Brooks, Y Yuan, Y Liu, H Chen - APSIPA Transactions on …, 2022 - nowpublishers.com
Deepfake technology has been undoubtedly growing at a rapid pace since 2017.
Particularly since using GAN architecture was popularized, research in this area has grown …
Particularly since using GAN architecture was popularized, research in this area has grown …
Flow guided transformable bottleneck networks for motion retargeting
Human motion retargeting aims to transfer the motion of one person in a driving video or set
of images to another person. Existing efforts leverage a long training video from each target …
of images to another person. Existing efforts leverage a long training video from each target …