Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
ZHLiu627
/
verl_agent_alfworld-GRPO-int-reward_False-Llama-3.1-8B-Instruct-100step
like
0
Safetensors
llama
Model card
Files
Files and versions
xet
Community
No model card
Downloads last month
6
Safetensors
Model size
8B params
Tensor type
BF16
ยท
Chat template
Files info
Inference Providers
NEW
This model isn't deployed by any Inference Provider.
๐
Ask for provider support