Imagen editor and editbench: Advancing and evaluating text-guided image inpainting
Text-guided image editing can have a transformative impact in supporting creative
applications. A key challenge is to generate edits that are faithful to the input text prompt …
applications. A key challenge is to generate edits that are faithful to the input text prompt …
Musechat: A conversational music recommendation system for videos
Music recommendation for videos attracts growing interest in multi-modal research.
However existing systems focus primarily on content compatibility often ignoring the users' …
However existing systems focus primarily on content compatibility often ignoring the users' …
Tuning-free inversion-enhanced control for consistent image editing
Consistent editing of real images is a challenging task, as it requires performing non-rigid
edits (eg, changing postures) to the main objects in the input image without changing their …
edits (eg, changing postures) to the main objects in the input image without changing their …
Coralstyleclip: Co-optimized region and layer selection for image editing
Edit fidelity is a significant issue in open-world controllable generative image editing.
Recently, CLIP-based approaches have traded off simplicity to alleviate these problems by …
Recently, CLIP-based approaches have traded off simplicity to alleviate these problems by …
Negative Pre-aware for Noisy Cross-Modal Matching
Cross-modal noise-robust learning is a challenging task since noisy correspondence is hard
to recognize and rectify. Due to the cumulative and unavoidable negative impact of …
to recognize and rectify. Due to the cumulative and unavoidable negative impact of …
Towards language-guided interactive 3d generation: Llms as layout interpreter with generative feedback
Generating and editing a 3D scene guided by natural language poses a challenge, primarily
due to the complexity of specifying the positional relations and volumetric changes within the …
due to the complexity of specifying the positional relations and volumetric changes within the …
Instilling Multi-round Thinking to Text-guided Image Generation
In this paper, we study the text-guided image generation task. Our focus lies in the
modification of a reference image, given user text feedback, to imbue it with specific desired …
modification of a reference image, given user text feedback, to imbue it with specific desired …
Chatedit: Towards multi-turn interactive facial image editing via dialogue
This paper explores interactive facial image editing via dialogue and introduces the ChatEdit
benchmark dataset for evaluating image editing and conversation abilities in this context …
benchmark dataset for evaluating image editing and conversation abilities in this context …
Tell your story: task-oriented dialogs for interactive content creation
People capture photos and videos to relive and share memories of personal significance.
Recently, media montages (stories) have become a popular mode of sharing these …
Recently, media montages (stories) have become a popular mode of sharing these …
PColorizor: Re-coloring Ancient Chinese Paintings with Ideorealm-congruent Poems
Color restoration of ancient Chinese paintings plays a significant role in Chinese culture
protection and inheritance. However, traditional color restoration is challenging and time …
protection and inheritance. However, traditional color restoration is challenging and time …