DreamOmni2: Multimodal Instruction-based Editing and Generation Paper • 2510.06679 • Published Oct 8 • 73
Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition Paper • 2412.09501 • Published Dec 12, 2024 • 48
LLMGA: Multimodal Large Language Model based Generation Assistant Paper • 2311.16500 • Published Nov 27, 2023 • 1