Meta-Reinforcement Learning with Self-Reflection for Agentic Search Paper • 2603.11327 • Published 2 days ago • 2
SSDD: Single-Step Diffusion Decoder for Efficient Image Tokenization Paper • 2510.04961 • Published Oct 6, 2025 • 5
RoboBrain-Dex Collection Dexterous VLA utilizing human ego data training • 2 items • Updated about 13 hours ago • 2
Dripper: Token-Efficient Main HTML Extraction with a Lightweight LM Paper • 2511.23119 • Published Nov 28, 2025 • 1
AICC: Parse HTML Finer, Make Models Better -- A 7.3T AI-Ready Corpus Built by a Model-Based HTML Parser Paper • 2511.16397 • Published Nov 20, 2025 • 11
Examining Reasoning LLMs-as-Judges in Non-Verifiable LLM Post-Training Paper • 2603.12246 • Published 1 day ago • 4
Multi-Task Reinforcement Learning for Enhanced Multimodal LLM-as-a-Judge Paper • 2603.11665 • Published 1 day ago • 2
Beyond Closed-Pool Video Retrieval: A Benchmark and Agent Framework for Real-World Video Search and Moment Localization Paper • 2602.10159 • Published Feb 10 • 1