Learning Pyramid-Context Encoder Network for High-Quality Image Inpainting Paper • 1904.07475 • Published Apr 16, 2019
Learning Joint Spatial-Temporal Transformations for Video Inpainting Paper • 2007.10247 • Published Jul 20, 2020
A Task is Worth One Word: Learning with Task Prompts for High-Quality Versatile Image Inpainting Paper • 2312.03594 • Published Dec 6, 2023 • 2
Aggregated Contextual Transformations for High-Resolution Image Inpainting Paper • 2104.01431 • Published Apr 3, 2021
Make-It-Vivid: Dressing Your Animatable Biped Cartoon Characters from Text Paper • 2403.16897 • Published Mar 25, 2024
WORLDMEM: Long-term Consistent World Simulation with Memory Paper • 2504.12369 • Published Apr 16 • 35
Advancing High-Resolution Video-Language Representation with Large-Scale Video Transcriptions Paper • 2111.10337 • Published Nov 19, 2021
AniCrafter: Customizing Realistic Human-Centric Animation via Avatar-Background Conditioning in Video Diffusion Models Paper • 2505.20255 • Published May 26 • 1
CharacterShot: Controllable and Consistent 4D Character Animation Paper • 2508.07409 • Published Aug 10 • 39
Scaling Instruction-Based Video Editing with a High-Quality Synthetic Dataset Paper • 2510.15742 • Published Oct 17 • 50
HoloCine: Holistic Generation of Cinematic Multi-Shot Long Video Narratives Paper • 2510.20822 • Published Oct 23 • 40
MagicQuillV2: Precise and Interactive Image Editing with Layered Visual Cues Paper • 2512.03046 • Published 7 days ago • 11
LEGO-Puzzles: How Good Are MLLMs at Multi-Step Spatial Reasoning? Paper • 2503.19990 • Published Mar 25 • 35
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published Dec 10, 2024 • 48
HumanVid: Demystifying Training Data for Camera-controllable Human Image Animation Paper • 2407.17438 • Published Jul 24, 2024 • 26
Live2Diff: Live Stream Translation via Uni-directional Attention in Video Diffusion Models Paper • 2407.08701 • Published Jul 11, 2024 • 13
Auto Cherry-Picker: Learning from High-quality Generative Data Driven by Language Paper • 2406.20085 • Published Jun 28, 2024 • 13
FoleyCrafter: Bring Silent Videos to Life with Lifelike and Synchronized Sounds Paper • 2407.01494 • Published Jul 1, 2024 • 15
MotionBooth: Motion-Aware Customized Text-to-Video Generation Paper • 2406.17758 • Published Jun 25, 2024 • 19