Bolmo: Byteifying the Next Generation of Language Models Paper • 2512.15586 • Published 16 days ago • 13
Bootstrapping World Models from Dynamics Models in Multimodal Foundation Models Paper • 2506.06006 • Published Jun 6, 2025 • 14
Inference-Time Hyper-Scaling with KV Cache Compression Paper • 2506.05345 • Published Jun 5, 2025 • 27
notpaulmartin/OpenR1-Math-220k_decontaminated_correct_only Viewer • Updated Mar 26, 2025 • 64.2k • 16
notpaulmartin/OpenR1-Math-220k_decontaminated_correct_only Viewer • Updated Mar 26, 2025 • 64.2k • 16
RedPajama: an Open Dataset for Training Large Language Models Paper • 2411.12372 • Published Nov 19, 2024 • 56