Submitted by Prakamya Mishra 4 Instella: Fully Open Language Models with Stellar Performance AMD 308 2
Submitted by Prakamya Mishra 5 SAND-Math: Using LLMs to Generate Novel, Difficult and Useful Mathematics Questions and Answers AMD 0 2
Submitted by Prakamya Mishra - TTT-Bench: A Benchmark for Evaluating Reasoning Ability with Simple and Novel Tic-Tac-Toe-style Games AMD 2