AzalKhan
/

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_1176_FlashRL_G4-L2048_new / README.md

Commit History

Upload README.md with huggingface_hub

a611ce3
verified

AzalKhan commited on Oct 24, 2025