Dreamllm: Synergistic multimodal comprehension and creation
This paper presents DreamLLM, a learning framework that first achieves versatile
Multimodal Large Language Models (MLLMs) empowered with frequently overlooked …
Multimodal Large Language Models (MLLMs) empowered with frequently overlooked …
Mi-gan: A simple baseline for image inpainting on mobile devices
In recent years, many deep learning based image inpainting methods have been developed
by the research community. Some of those methods have shown impressive image …
by the research community. Some of those methods have shown impressive image …
Shapellm: Universal 3d object understanding for embodied interaction
This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM)
designed for embodied interaction, exploring a universal 3D object understanding with 3D …
designed for embodied interaction, exploring a universal 3D object understanding with 3D …
Compressing image-to-image translation gans using local density structures on their learned manifold
Generative Adversarial Networks (GANs) have shown remarkable success in modeling
complex data distributions for image-to-image translation. Still, their high computational …
complex data distributions for image-to-image translation. Still, their high computational …
Label-guided auxiliary training improves 3d object detector
Detecting 3D objects from point clouds is a practical yet challenging task that has attracted
increasing attention recently. In this paper, we propose a Label-Guided auxiliary training …
increasing attention recently. In this paper, we propose a Label-Guided auxiliary training …
Dreambench++: A human-aligned benchmark for personalized image generation
Personalized image generation holds great promise in assisting humans in everyday work
and life due to its impressive function in creatively generating personalized content …
and life due to its impressive function in creatively generating personalized content …
CoroNetGAN: Controlled Pruning of GANs via Hypernetworks
Abstract Generative Adversarial Networks (GANs) have proven to exhibit remarkable
performance and are widely used across many generative computer vision applications …
performance and are widely used across many generative computer vision applications …
Structured Knowledge Distillation Towards Efficient and Compact Multi-View 3D Detection
Detecting 3D objects from multi-view images is a fundamental problem in 3D computer
vision. Recently, significant breakthrough has been made in multi-view 3D detection tasks …
vision. Recently, significant breakthrough has been made in multi-view 3D detection tasks …
Masked Discriminators for Content-Consistent Unpaired Image-to-Image Translation
A common goal of unpaired image-to-image translation is to preserve content consistency
between source images and translated images while mimicking the style of the target …
between source images and translated images while mimicking the style of the target …
[PDF][PDF] Structured Knowledge Distillation Towards Efficient Multi-View 3D Object Detection.
Detecting 3D objects from multi-view images is a fundamental problem in 3D computer
vision. Recently, significant breakthrough has been made in multi-view 3D detection tasks …
vision. Recently, significant breakthrough has been made in multi-view 3D detection tasks …