Hugging Face
Models
Datasets
Spaces
Community
Docs
Enterprise
Pricing
Log In
Sign Up
jmajkutewicz
/
Llama-3.1-Tulu-3-8B-DPO_PKU-SafeRLHF
like
0
Text Generation
PEFT
Safetensors
PKU-Alignment/PKU-SafeRLHF
English
llama
lora
dpo
alignment
conversational
License:
llama3.1
Model card
Files
Files and versions
xet
Community
Use this model
main
Llama-3.1-Tulu-3-8B-DPO_PKU-SafeRLHF
345 MB
1 contributor
History:
2 commits
jmajkutewicz
Upload folder using huggingface_hub
1ecb77d
verified
2 months ago
.gitattributes
Safe
1.52 kB
initial commit
2 months ago
README.md
1.48 kB
Upload folder using huggingface_hub
2 months ago
adapter_config.json
734 Bytes
Upload folder using huggingface_hub
2 months ago
adapter_model.safetensors
336 MB
xet
Upload folder using huggingface_hub
2 months ago
config.json
875 Bytes
Upload folder using huggingface_hub
2 months ago
special_tokens_map.json
Safe
439 Bytes
Upload folder using huggingface_hub
2 months ago
tokenizer.json
Safe
9.09 MB
Upload folder using huggingface_hub
2 months ago
tokenizer_config.json
51.2 kB
Upload folder using huggingface_hub
2 months ago