Sleeping 16 νκ΅μ΄ TTS μλ λ π€ 16 νκ΅μ΄ TTS λͺ¨λΈμ λΈλΌμΈλ ν μ€νΈλ‘ λΉκ΅ νκ°νμΈμ!
Running on CPU Upgrade Featured 2.54k The Smol Training Playbook π 2.54k The secrets to building world-class LLMs
A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning Paper β’ 2510.15444 β’ Published Oct 17 β’ 147
MCP-Bench: Benchmarking Tool-Using LLM Agents with Complex Real-World Tasks via MCP Servers Paper β’ 2508.20453 β’ Published Aug 28 β’ 63
SamilPwC-AXNode-GenAI/PwC-Embedding_expr Sentence Similarity β’ 0.6B β’ Updated Aug 18 β’ 976 β’ 5
On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification Paper β’ 2508.05629 β’ Published Aug 7 β’ 180