[HTML][HTML] Video and audio deepfake datasets and open issues in deepfake technology: being ahead of the curve
Z Akhtar, TL Pendyala, VS Athmakuri - Forensic Sciences, 2024 - mdpi.com
The revolutionary breakthroughs in Machine Learning (ML) and Artificial Intelligence (AI) are
extensively being harnessed across a diverse range of domains, eg, forensic science …
extensively being harnessed across a diverse range of domains, eg, forensic science …
MVOC: a training-free multiple video object composition method with diffusion models
Video composition is the core task of video editing. Although image composition based on
diffusion models has been highly successful, it is not straightforward to extend the …
diffusion models has been highly successful, it is not straightforward to extend the …
T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation
Text-to-video (T2V) generation models have advanced significantly, yet their ability to
compose different objects, attributes, actions, and motions into a video remains unexplored …
compose different objects, attributes, actions, and motions into a video remains unexplored …
OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model
L Chen, Z Li, B Lin, B Zhu, Q Wang, S Yuan… - arXiv preprint arXiv …, 2024 - arxiv.org
Variational Autoencoder (VAE), compressing videos into latent representations, is a crucial
preceding component of Latent Video Diffusion Models (LVDMs). With the same …
preceding component of Latent Video Diffusion Models (LVDMs). With the same …
A Large-scale Universal Evaluation Benchmark For Face Forgery Detection
With the rapid development of AI-generated content (AIGC) technology, the production of
realistic fake facial images and videos that deceive human visual perception has become …
realistic fake facial images and videos that deceive human visual perception has become …
Towards Understanding Unsafe Video Generation
Video generation models (VGMs) have demonstrated the capability to synthesize high-
quality output. It is important to understand their potential to produce unsafe content, such as …
quality output. It is important to understand their potential to produce unsafe content, such as …
DreamCinema: Cinematic Transfer with Free Camera and 3D Character
W Chen, F Liu, D Wu, H Sun, H Song… - arXiv preprint arXiv …, 2024 - arxiv.org
We are living in a flourishing era of digital media, where everyone has the potential to
become a personal filmmaker. Current research on cinematic transfer empowers filmmakers …
become a personal filmmaker. Current research on cinematic transfer empowers filmmakers …
[PDF][PDF] Conditional Video Generation Guided by Multimodal Inputs: A Comprehensive Survey
The field of video generation is rapidly evolving, driven by advancements in generative
models. This survey provides a comprehensive analysis of the diverse methodologies …
models. This survey provides a comprehensive analysis of the diverse methodologies …