RubricBench: Aligning Model-Generated Rubrics with Human Standards Paper • 2603.01562 • Published 11 days ago • 57
Imagination Helps Visual Reasoning, But Not Yet in Latent Space Paper • 2602.22766 • Published 14 days ago • 40
DualPath: Breaking the Storage Bandwidth Bottleneck in Agentic LLM Inference Paper • 2602.21548 • Published 16 days ago • 43
view article Article OpenEnv in Practice: Evaluating Tool-Using Agents in Real-World Environments +3 29 days ago • 31
Step 3.5 Flash: Open Frontier-Level Intelligence with 11B Active Parameters Paper • 2602.10604 • Published 30 days ago • 189
Towards Pixel-Level VLM Perception via Simple Points Prediction Paper • 2601.19228 • Published Jan 27 • 18
view article Article Unlocking Agentic RL Training for GPT-OSS: A Practical Retrospective Jan 27 • 64
VISTA-PATH: An interactive foundation model for pathology image segmentation and quantitative analysis in computational pathology Paper • 2601.16451 • Published Jan 23 • 3
Jet-RL: Enabling On-Policy FP8 Reinforcement Learning with Unified Training and Rollout Precision Flow Paper • 2601.14243 • Published Jan 20 • 23
Think-Then-Generate: Reasoning-Aware Text-to-Image Diffusion with LLM Encoders Paper • 2601.10332 • Published Jan 15 • 30
VAR RL Done Right: Tackling Asynchronous Policy Conflicts in Visual Autoregressive Generation Paper • 2601.02256 • Published Jan 5 • 33
Enhancing Inflation Nowcasting with LLM: Sentiment Analysis on News Paper • 2410.20198 • Published Oct 26, 2024 • 1
DiffThinker: Towards Generative Multimodal Reasoning with Diffusion Models Paper • 2512.24165 • Published Dec 30, 2025 • 51
Differences That Matter: Auditing Models for Capability Gap Discovery and Rectification Paper • 2512.16921 • Published Dec 18, 2025 • 8
Next-Embedding Prediction Makes Strong Vision Learners Paper • 2512.16922 • Published Dec 18, 2025 • 87
Insight Miner: A Time Series Analysis Dataset for Cross-Domain Alignment with Natural Language Paper • 2512.11251 • Published Dec 12, 2025 • 8
DiffusionVL: Translating Any Autoregressive Models into Diffusion Vision Language Models Paper • 2512.15713 • Published Dec 17, 2025 • 18