V-Bridge: Bridging Video Generative Priors to Versatile Few-shot Image Restoration Paper • 2603.13089 • Published 4 days ago • 12 • 2
SDF-Net: Structure-Aware Disentangled Feature Learning for Opticall-SAR Ship Re-identification Paper • 2603.12588 • Published 4 days ago • 1 • 2
SimRecon: SimReady Compositional Scene Reconstruction from Real Videos Paper • 2603.02133 • Published 14 days ago • 3 • 2
Cheers: Decoupling Patch Details from Semantic Representations Enables Unified Multimodal Comprehension and Generation Paper • 2603.12793 • Published 4 days ago • 28 • 3
Spend Less, Reason Better: Budget-Aware Value Tree Search for LLM Agents Paper • 2603.12634 • Published 4 days ago • 6 • 1
NanoVDR: Distilling a 2B Vision-Language Retriever into a 70M Text-Only Encoder for Visual Document Retrieval Paper • 2603.12824 • Published 4 days ago • 4 • 2
From Sparse to Dense: Multi-View GRPO for Flow Models via Augmented Condition Space Paper • 2603.12648 • Published 4 days ago • 10 • 1
LookaheadKV: Fast and Accurate KV Cache Eviction by Glimpsing into the Future without Generation Paper • 2603.10899 • Published 6 days ago • 6 • 2
daVinci-Env: Open SWE Environment Synthesis at Scale Paper • 2603.13023 • Published 4 days ago • 22 • 3
Visual-ERM: Reward Modeling for Visual Equivalence Paper • 2603.13224 • Published 3 days ago • 19 • 1
EvoScientist: Towards Multi-Agent Evolving AI Scientists for End-to-End Scientific Discovery Paper • 2603.08127 • Published 8 days ago • 7 • 5
Compression Favors Consistency, Not Truth: When and Why Language Models Prefer Correct Information Paper • 2603.11749 • Published 5 days ago • 3 • 2
CreativeBench: Benchmarking and Enhancing Machine Creativity via Self-Evolving Challenges Paper • 2603.11863 • Published 5 days ago • 5 • 1
Do You See What I Am Pointing At? Gesture-Based Egocentric Video Question Answering Paper • 2603.12533 • Published 4 days ago • 2
Video Streaming Thinking: VideoLLMs Can Watch and Think Simultaneously Paper • 2603.12262 • Published 4 days ago • 16 • 2
OmniForcing: Unleashing Real-time Joint Audio-Visual Generation Paper • 2603.11647 • Published 5 days ago • 20 • 4
Detecting Intrinsic and Instrumental Self-Preservation in Autonomous Agents: The Unified Continuation-Interest Protocol Paper • 2603.11382 • Published 5 days ago • 2
Can Fairness Be Prompted? Prompt-Based Debiasing Strategies in High-Stakes Recommendations Paper • 2603.12935 • Published 4 days ago • 3 • 2