3 102 4

ltl

2793145003

AI & ML interests

None yet

Recent Activity

upvoted a paper 2 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

liked a model 5 days ago

deepseek-ai/DeepSeek-V3.2

upvoted a paper 20 days ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

View all activity

Organizations

None yet

upvoted a paper 2 days ago

Live Avatar: Streaming Real-time Audio-Driven Avatar Generation with Infinite Length

Paper • 2512.04677 • Published 3 days ago • 138

liked a model 5 days ago

deepseek-ai/DeepSeek-V3.2

Text Generation • 685B • Updated 6 days ago • 25.5k • • 766

upvoted a paper 20 days ago

Uni-MoE-2.0-Omni: Scaling Language-Centric Omnimodal Large Model with Advanced MoE, Training and Data

Paper • 2511.12609 • Published 21 days ago • 102

upvoted 2 papers 26 days ago

Thinking with Video: Video Generation as a Promising Multimodal Reasoning Paradigm

Paper • 2511.04570 • Published about 1 month ago • 208

Too Good to be Bad: On the Failure of LLMs to Role-Play Villains

Paper • 2511.04962 • Published about 1 month ago • 52

upvoted 2 papers about 2 months ago

FlashWorld: High-quality 3D Scene Generation within Seconds

Paper • 2510.13678 • Published Oct 15 • 71

The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain

Paper • 2509.26507 • Published Sep 30 • 535

upvoted 2 papers 3 months ago

Set Block Decoding is a Language Model Inference Accelerator

Paper • 2509.04185 • Published Sep 4 • 52

Why Language Models Hallucinate

Paper • 2509.04664 • Published Sep 4 • 193

upvoted 2 papers 4 months ago

PixNerd: Pixel Neural Field Diffusion

Paper • 2507.23268 • Published Jul 31 • 51

HunyuanWorld 1.0: Generating Immersive, Explorable, and Interactive 3D Worlds from Words or Pixels

Paper • 2507.21809 • Published Jul 29 • 135

upvoted 4 papers 6 months ago

Vision-Guided Chunking Is All You Need: Enhancing RAG with Multimodal Document Understanding

Paper • 2506.16035 • Published Jun 19 • 88

Drag-and-Drop LLMs: Zero-Shot Prompt-to-Weights

Paper • 2506.16406 • Published Jun 19 • 127

Stream-Omni: Simultaneous Multimodal Interactions with Large Language-Vision-Speech Model

Paper • 2506.13642 • Published Jun 16 • 26

Paper2Poster: Towards Multimodal Poster Automation from Scientific Papers

Paper • 2505.21497 • Published May 27 • 109

liked a model 7 months ago

ByteDance-Seed/BAGEL-7B-MoT

Any-to-Any • 15B • Updated 1 day ago • 834 • 1.16k

upvoted 3 papers 7 months ago

MMaDA: Multimodal Large Diffusion Language Models

Paper • 2505.15809 • Published May 21 • 97

VisRAG: Vision-based Retrieval-augmented Generation on Multi-modality Documents

Paper • 2410.10594 • Published Oct 14, 2024 • 28

Skywork-VL Reward: An Effective Reward Model for Multimodal Understanding and Reasoning

Paper • 2505.07263 • Published May 12 • 30

liked a model 7 months ago

Skywork/Skywork-VL-Reward-7B

Image-Text-to-Text • 8B • Updated Jun 10 • 17.9k • 46

ltl

AI & ML interests

Recent Activity

Organizations

ltl's activity