| base_model: | |
| - Qwen/Qwen2.5-7B-Instruct | |
| # R1-Fuzz-7B | |
| R1-Fuzz-7B is a model fine-tuned for the task of **fuzzing input generation**. | |
| It's trained based on Qwen2.5-7B-Instruct by GRPO. | |
| Project code: https://github.com/HKU-System-Security-Lab/R1-Fuzz | |
| Paper: https://arxiv.org/abs/2509.20384 |