SyncMV4D: Synchronized Multi-view Joint Diffusion of Appearance and Motion for Hand-Object Interaction Synthesis Paper • 2511.19319 • Published 12 days ago • 1
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper • 2510.08697 • Published Oct 9 • 35
UP2You: Fast Reconstruction of Yourself from Unconstrained Photo Collections Paper • 2509.24817 • Published Sep 29 • 8
See, Point, Fly: A Learning-Free VLM Framework for Universal Unmanned Aerial Navigation Paper • 2509.22653 • Published Sep 26 • 23
view post Post 562 Qwen 3 Coder is a personal attack to k2, and I love it.It achieves near SOTA on LCB while not having reasoning.Finally people are understanding that reasoning isnt necessary for high benches...Qwen ftw!DECENTRALIZE DECENTRALIZE DECENTRALIZE See translation 🚀 6 6 🔥 4 4 + Reply
Towards Video Thinking Test: A Holistic Benchmark for Advanced Video Reasoning and Understanding Paper • 2507.15028 • Published Jul 20 • 21
SplArt: Articulation Estimation and Part-Level Reconstruction with 3D Gaussian Splatting Paper • 2506.03594 • Published Jun 4
view post Post 3072 deepseek-ai/DeepSeek-R1-0528This is the end See translation 1 reply · 🤗 7 7 ❤️ 1 1 + Reply
Feat2GS: Probing Visual Foundation Models with Gaussian Splatting Paper • 2412.09606 • Published Dec 12, 2024 • 2
ChatGarment: Garment Estimation, Generation and Editing via Large Language Models Paper • 2412.17811 • Published Dec 23, 2024
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant Tightness Paper • 2503.10624 • Published Mar 13 • 10
Easi3R: Estimating Disentangled Motion from DUSt3R Without Training Paper • 2503.24391 • Published Mar 31 • 6
Tuning-Free Image Editing with Fidelity and Editability via Unified Latent Diffusion Model Paper • 2504.05594 • Published Apr 8 • 11
Free4D: Tuning-free 4D Scene Generation with Spatial-Temporal Consistency Paper • 2503.20785 • Published Mar 26 • 22