ConsistEdit: Highly Consistent and Precise Training-free Visual Editing Paper • 2510.17803 • Published Oct 20 • 13
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence Paper • 2509.12203 • Published Sep 15 • 19
LazyDrag: Enabling Stable Drag-Based Editing on Multi-Modal Diffusion Transformers via Explicit Correspondence Paper • 2509.12203 • Published Sep 15 • 19
Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis Paper • 2211.14506 • Published Nov 26, 2022 • 1
Progressive Disentangled Representation Learning for Fine-Grained Controllable Talking Head Synthesis Paper • 2211.14506 • Published Nov 26, 2022 • 1
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts Paper • 2509.06155 • Published Sep 7 • 13
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts Paper • 2509.06155 • Published Sep 7 • 13
UniVerse-1: Unified Audio-Video Generation via Stitching of Experts Paper • 2509.06155 • Published Sep 7 • 13 • 2
RecA Collection Unlocking the Massive Zero-shot Potential in Unified Multimodal Models through Self-supervised Learning! • 8 items • Updated Sep 22 • 13