metadata
base_model:
- Qwen/Qwen2.5-7B-Instruct
R1-Fuzz-7B
R1-Fuzz-7B is a model fine-tuned for the task of fuzzing input generation. It's trained based on Qwen2.5-7B-Instruct by GRPO.
Project code: https://github.com/HKU-System-Security-Lab/R1-Fuzz