THU-KEG
/

LLaDA-8B-BGPO-countdown

Reinforcement Learning

Model card Files Files and versions

LLaDA-8B-BGPO-countdown

16 GB

2 contributors

History: 5 commits

nielsr's picture

nielsr HF Staff

Improve model card: Add pipeline tag, library name, and enrich content

cf0ca01 verified 2 months ago

.gitattributes
1.52 kB

initial commit 2 months ago
README.md
2.79 kB

Improve model card: Add pipeline tag, library name, and enrich content 2 months ago
config.json
1.44 kB

Upload folder using huggingface_hub 2 months ago
configuration_llada.py
12.4 kB

Upload folder using huggingface_hub 2 months ago
generation_config.json
143 Bytes

Upload folder using huggingface_hub 2 months ago
model-00001-of-00004.safetensors
4.5 GB
xet

Upload folder using huggingface_hub 2 months ago
model-00002-of-00004.safetensors
4.99 GB
xet

Upload folder using huggingface_hub 2 months ago
model-00003-of-00004.safetensors
5 GB
xet

Upload folder using huggingface_hub 2 months ago
model-00004-of-00004.safetensors
1.54 GB
xet

Upload folder using huggingface_hub 2 months ago
model.safetensors.index.json
25 kB

Upload folder using huggingface_hub 2 months ago
modeling_llada.py
68.9 kB

Upload folder using huggingface_hub 2 months ago
tokenizer.json
6.1 MB

Upload folder using huggingface_hub 2 months ago
tokenizer_config.json
51.7 kB

Upload folder using huggingface_hub 2 months ago