AzalKhan
/

Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_882

Reinforcement Learning

text-generation

text-generation-inference

Model card Files Files and versions

Qwen2.5-1.5B-Instruct_open-r1-DAPO-Math-17k-Processed_882 / merges.txt

AzalKhan's picture

Upload folder using huggingface_hub

83aeac3 verified 2 months ago

history contribute delete

1.67 MB

File too large to display, you can check the raw version instead.