AzalKhan
/

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

2.39 kB

1 contributor

History: 2 commits

AzalKhan's picture

Upload README.md with huggingface_hub

a611ce3 verified about 2 months ago