Sheng Jia PRO

shengjia-toronto

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

shengjia-toronto/grpo-test-ssft-32B

updated a model 3 days ago

shengjia-toronto/ssft-32B-N6

updated a collection 3 days ago

SSFT

View all activity

Organizations

None yet

Collections 1

Papers 1

arxiv:2510.05132

models 5

datasets 5

shengjia-toronto/ssft1Kcode-v2-Sonnet4-5-High-Run2-temp1-max29000

Viewer • Updated 24 days ago • 1k • 31 • 1

shengjia-toronto/ssft1Kcode-v2-Sonnet4-5-High-Run1-temp1-max29000

Viewer • Updated 24 days ago • 1k • 30

shengjia-toronto/ssft1Kcode-v2-OSS-High-Run1-temp1-max32768

Viewer • Updated 24 days ago • 1k • 23

shengjia-toronto/ssft1Kcode_v2

Viewer • Updated 24 days ago • 1k • 40

shengjia-toronto/dapo-math-17k-promptprocessed

Viewer • Updated about 1 month ago • 14.1k • 46

Sheng Jia PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Training Large Language Models To Reason In Parallel With Global Forking Tokens

shengjia-toronto/ssft32b_grpo_bs256_step10

shengjia-toronto/ssft-32B-N6

shengjia-toronto/grpo-test-ssft-32B

Training Large Language Models To Reason In Parallel With Global Forking Tokens

shengjia-toronto/ssft32b_grpo_bs256_step10

shengjia-toronto/ssft-32B-N6

shengjia-toronto/grpo-test-ssft-32B

Papers 1

models 5

shengjia-toronto/grpo-test-ssft-32B

shengjia-toronto/ssft-32B-N6

shengjia-toronto/ssft32b_grpo_bs256_step10

shengjia-toronto/ssft32b_grpo_bs256_step11_H200

shengjia-toronto/sft_mixed_32b_thinktags_GAS4

datasets 5

shengjia-toronto/ssft1Kcode-v2-Sonnet4-5-High-Run2-temp1-max29000

shengjia-toronto/ssft1Kcode-v2-Sonnet4-5-High-Run1-temp1-max29000

shengjia-toronto/ssft1Kcode-v2-OSS-High-Run1-temp1-max32768

shengjia-toronto/ssft1Kcode_v2

shengjia-toronto/dapo-math-17k-promptprocessed

Sheng Jia PRO

AI & ML interests

Recent Activity

Organizations

Collections 1

Papers 1

models 5 Sort: Recently updated

datasets 5 Sort: Recently updated

models 5

datasets 5