Safetensors
agent

LiveTalk: Real-Time Multimodal Interactive Video Diffusion via Improved On-Policy Distillation

LiveTalk icon

arXiv   Hugging Face Model

This is the model for https://github.com/GAIR-NLP/LiveTalk

LiveTalk enables real-time multimodal interactive avatar video generation through an improved on-policy distillation approach. By distilling bidirectional diffusion models into causal, few-step autoregressive models, LiveTalk achieves over 20ร— speedup, enabling seamless real-time interactive experience.

LiveTalk System Overview

Downloads last month
31
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support