Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
sujayshahare 's Collections
Reasoning Datasets
Reasoning LLMs
Agents

Reasoning LLMs

updated Jun 8

Collection of research papers I find interesting on reasoning models

Upvote
-

  • Learning to Reason without External Rewards

    Paper • 2505.19590 • Published May 26 • 29

  • Scalable Best-of-N Selection for Large Language Models via Self-Certainty

    Paper • 2502.18581 • Published Feb 25

  • Training Large Language Models to Reason in a Continuous Latent Space

    Paper • 2412.06769 • Published Dec 9, 2024 • 90

  • Fractured Chain-of-Thought Reasoning

    Paper • 2505.12992 • Published May 19 • 23

  • OpenThoughts: Data Recipes for Reasoning Models

    Paper • 2506.04178 • Published Jun 4 • 48

  • SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training

    Paper • 2501.17161 • Published Jan 28 • 123

  • Kimi k1.5: Scaling Reinforcement Learning with LLMs

    Paper • 2501.12599 • Published Jan 22 • 125

  • TÜLU 3: Pushing Frontiers in Open Language Model Post-Training

    Paper • 2411.15124 • Published Nov 22, 2024 • 67
Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets Spaces Pricing Docs