NhatHoang2002
/

llama3.1-8b-instruct-step-dpo

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Model card Files Files and versions

llama3.1-8b-instruct-step-dpo

16.1 GB

1 contributor

History: 2 commits

This model has 1 file scanned as unsafe.

NhatHoang2002's picture

Upload folder using huggingface_hub

034b55e verified 27 days ago