DR Tulu Collection Models and data associated with DR Tulu, http://allenai-web/papers/drtulu β’ 5 items β’ Updated 16 days ago β’ 31
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks β’ 8 items β’ Updated Jul 31 β’ 28
Simulating the Visual World with Artificial Intelligence: A Roadmap Paper β’ 2511.08585 β’ Published 29 days ago β’ 29
LoopTool: Closing the Data-Training Loop for Robust LLM Tool Calls Paper β’ 2511.09148 β’ Published 28 days ago β’ 16
BigCodeArena: Unveiling More Reliable Human Preferences in Code Generation via Execution Paper β’ 2510.08697 β’ Published Oct 9 β’ 36
The Dragon Hatchling: The Missing Link between the Transformer and Models of the Brain Paper β’ 2509.26507 β’ Published Sep 30 β’ 535
INTELLECT-2: A Reasoning Model Trained Through Globally Decentralized Reinforcement Learning Paper β’ 2505.07291 β’ Published May 12 β’ 14
LINC: A Neurosymbolic Approach for Logical Reasoning by Combining Language Models with First-Order Logic Provers Paper β’ 2310.15164 β’ Published Oct 23, 2023 β’ 3
MoCha: Towards Movie-Grade Talking Character Synthesis Paper β’ 2503.23307 β’ Published Mar 30 β’ 138
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 158