[HTML][HTML] Video and audio deepfake datasets and open issues in deepfake technology: being ahead of the curve

Z Akhtar, TL Pendyala, VS Athmakuri - Forensic Sciences, 2024 - mdpi.com
The revolutionary breakthroughs in Machine Learning (ML) and Artificial Intelligence (AI) are
extensively being harnessed across a diverse range of domains, eg, forensic science …

MVOC: a training-free multiple video object composition method with diffusion models

W Wang, Y Chen, Y Liu, Q Yuan, S Yang… - arXiv preprint arXiv …, 2024 - arxiv.org
Video composition is the core task of video editing. Although image composition based on
diffusion models has been highly successful, it is not straightforward to extend the …

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

K Sun, K Huang, X Liu, Y Wu, Z Xu, Z Li… - arXiv preprint arXiv …, 2024 - arxiv.org
Text-to-video (T2V) generation models have advanced significantly, yet their ability to
compose different objects, attributes, actions, and motions into a video remains unexplored …

OD-VAE: An Omni-dimensional Video Compressor for Improving Latent Video Diffusion Model

L Chen, Z Li, B Lin, B Zhu, Q Wang, S Yuan… - arXiv preprint arXiv …, 2024 - arxiv.org
Variational Autoencoder (VAE), compressing videos into latent representations, is a crucial
preceding component of Latent Video Diffusion Models (LVDMs). With the same …

A Large-scale Universal Evaluation Benchmark For Face Forgery Detection

Y Bei, H Lou, J Geng, E Liu, L Cheng, J Song… - arXiv preprint arXiv …, 2024 - arxiv.org
With the rapid development of AI-generated content (AIGC) technology, the production of
realistic fake facial images and videos that deceive human visual perception has become …

Towards Understanding Unsafe Video Generation

Y Pang, A Xiong, Y Zhang, T Wang - arXiv preprint arXiv:2407.12581, 2024 - arxiv.org
Video generation models (VGMs) have demonstrated the capability to synthesize high-
quality output. It is important to understand their potential to produce unsafe content, such as …

DreamCinema: Cinematic Transfer with Free Camera and 3D Character

W Chen, F Liu, D Wu, H Sun, H Song… - arXiv preprint arXiv …, 2024 - arxiv.org
We are living in a flourishing era of digital media, where everyone has the potential to
become a personal filmmaker. Current research on cinematic transfer empowers filmmakers …

[PDF][PDF] Conditional Video Generation Guided by Multimodal Inputs: A Comprehensive Survey

K Niu, W Liu, N Sharif, D Zhu - 2024 - researchgate.net
The field of video generation is rapidly evolving, driven by advancements in generative
models. This survey provides a comprehensive analysis of the diverse methodologies …