-
Training Large Language Models To Reason In Parallel With Global Forking Tokens
Paper • 2510.05132 • Published • 1 -
shengjia-toronto/ssft32b_grpo_bs256_step10
Updated • 398 -
shengjia-toronto/ssft-32B-N6
Text Generation • 4B • Updated • 1.51k -
shengjia-toronto/grpo-test-ssft-32B
Text Generation • 33B • Updated • 17
Sheng Jia PRO
shengjia-toronto
·
AI & ML interests
None yet
Recent Activity
updated
a model
3 days ago
shengjia-toronto/grpo-test-ssft-32B
updated
a model
3 days ago
shengjia-toronto/ssft-32B-N6
updated
a collection
3 days ago
SSFT
Organizations
None yet