R1-Fuzz-7B / README.md

Update README.md

f85bc7a verified 3 months ago

297 Bytes

metadata

base_model:
  - Qwen/Qwen2.5-7B-Instruct

R1-Fuzz-7B

R1-Fuzz-7B is a model fine-tuned for the task of fuzzing input generation. It's trained based on Qwen2.5-7B-Instruct by GRPO.