AzalKhan
/

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-1.5B-Instruct_BF16_open-r1-DAPO-Math-17k-Processed_882_FlashRL_G4-L2048_new

Commit History

Upload folder using huggingface_hub

b83e337
verified

AzalKhan commited on Oct 24, 2025

Upload folder using huggingface_hub

c11f135
verified

AzalKhan commited on Oct 24, 2025

Upload README.md with huggingface_hub

4cea059
verified

AzalKhan commited on Oct 24, 2025

initial commit

f8bb6a2
verified

AzalKhan commited on Oct 24, 2025