Survey of hallucination in natural language generation
Natural Language Generation (NLG) has improved exponentially in recent years thanks to
the development of sequence-to-sequence deep learning technologies such as Transformer …
the development of sequence-to-sequence deep learning technologies such as Transformer …
Generative adversarial networks in time series: A systematic literature review
Generative adversarial network (GAN) studies have grown exponentially in the past few
years. Their impact has been seen mainly in the computer vision field with realistic image …
years. Their impact has been seen mainly in the computer vision field with realistic image …
Generating diverse and natural 3d human motions from text
Automated generation of 3D human motions from text is a challenging problem. The
generated motions are expected to be sufficiently diverse to explore the text-grounded …
generated motions are expected to be sufficiently diverse to explore the text-grounded …
Coderl: Mastering code generation through pretrained models and deep reinforcement learning
Program synthesis or code generation aims to generate a program that satisfies a problem
specification. Recent approaches using large-scale pretrained language models (LMs) have …
specification. Recent approaches using large-scale pretrained language models (LMs) have …
Cascaded diffusion models for high fidelity image generation
We show that cascaded diffusion models are capable of generating high fidelity images on
the class-conditional ImageNet generation benchmark, without any assistance from auxiliary …
the class-conditional ImageNet generation benchmark, without any assistance from auxiliary …
Vectormapnet: End-to-end vectorized hd map learning
Autonomous driving systems require High-Definition (HD) semantic maps to navigate
around urban roads. Existing solutions approach the semantic mapping problem by offline …
around urban roads. Existing solutions approach the semantic mapping problem by offline …
A survey on neural speech synthesis
Text to speech (TTS), or speech synthesis, which aims to synthesize intelligible and natural
speech given text, is a hot research topic in speech, language, and machine learning …
speech given text, is a hot research topic in speech, language, and machine learning …
Autoregressive image generation using residual quantization
For autoregressive (AR) modeling of high-resolution images, vector quantization (VQ)
represents an image as a sequence of discrete codes. A short sequence length is important …
represents an image as a sequence of discrete codes. A short sequence length is important …
Graph neural networks for natural language processing: A survey
Deep learning has become the dominant approach in addressing various tasks in Natural
Language Processing (NLP). Although text inputs are typically represented as a sequence …
Language Processing (NLP). Although text inputs are typically represented as a sequence …
Ai choreographer: Music conditioned 3d dance generation with aist++
We present AIST++, a new multi-modal dataset of 3D dance motion and music, along with
FACT, a Full-Attention Cross-modal Transformer network for generating 3D dance motion …
FACT, a Full-Attention Cross-modal Transformer network for generating 3D dance motion …