Hierarchical Text-Conditional Image Generation with CLIP Latents Paper • 2204.06125 • Published Apr 13, 2022 • 3
I Think, Therefore I Diffuse: Enabling Multimodal In-Context Reasoning in Diffusion Models Paper • 2502.10458 • Published Feb 12, 2025 • 38