Enhancing Spatial Understanding in Image Generation via Reward Modeling Paper • 2602.24233 • Published 5 days ago • 46
Recovered in Translation: Efficient Pipeline for Automated Translation of Benchmarks and Datasets Paper • 2602.22207 • Published 7 days ago • 38
CUDA Agent: Large-Scale Agentic RL for High-Performance CUDA Kernel Generation Paper • 2602.24286 • Published 5 days ago • 70
Mode Seeking meets Mean Seeking for Fast Long Video Generation Paper • 2602.24289 • Published 5 days ago • 33
Compositional Generalization Requires Linear, Orthogonal Representations in Vision Embedding Models Paper • 2602.24264 • Published 5 days ago • 14
LK Losses: Direct Acceptance Rate Optimization for Speculative Decoding Paper • 2602.23881 • Published 6 days ago • 18
CiteAudit: You Cited It, But Did You Read It? A Benchmark for Verifying Scientific References in the LLM Era Paper • 2602.23452 • Published 6 days ago • 16
Accelerating Masked Image Generation by Learning Latent Controlled Dynamics Paper • 2602.23996 • Published 5 days ago • 8
Ref-Adv: Exploring MLLM Visual Reasoning in Referring Expression Tasks Paper • 2602.23898 • Published 6 days ago • 10
LongVideo-R1: Smart Navigation for Low-cost Long Video Understanding Paper • 2602.20913 • Published 8 days ago • 10
SenCache: Accelerating Diffusion Model Inference via Sensitivity-Aware Caching Paper • 2602.24208 • Published 5 days ago • 7
Vectorizing the Trie: Efficient Constrained Decoding for LLM-based Generative Retrieval on Accelerators Paper • 2602.22647 • Published 7 days ago • 2
DLEBench: Evaluating Small-scale Object Editing Ability for Instruction-based Image Editing Model Paper • 2602.23622 • Published 6 days ago • 3
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers Paper • 2602.18292 • Published 12 days ago • 10 • 6
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers Paper • 2602.18292 • Published 12 days ago • 10
Decoding as Optimisation on the Probability Simplex: From Top-K to Top-P (Nucleus) to Best-of-K Samplers Paper • 2602.18292 • Published 12 days ago • 10
Multi-Task GRPO: Reliable LLM Reasoning Across Tasks Paper • 2602.05547 • Published 28 days ago • 12 • 5