|
|
--- |
|
|
license: mit |
|
|
language: |
|
|
- vi |
|
|
base_model: |
|
|
- sail/Sailor2-8B-Chat |
|
|
--- |
|
|
<div align="center"> |
|
|
<img src="https://github.com/bloomifycafe/blossomsAI/blob/main/assets/logo.png?raw=true" alt="Logo"/> |
|
|
</div> |
|
|
</br> |
|
|
<div align="center"> |
|
|
|
|
|
# π BloomVN-8B-chat π |
|
|
|
|
|
</div> |
|
|
|
|
|
### A fine-tuned multilingual model for Vietnamese language |
|
|
|
|
|
## π Overview |
|
|
|
|
|
- A bilingual text generation model with strong capabilities in both Vietnamese and English languages. |
|
|
- This base model can handle a wide range of text generation tasks while maintaining high quality output in both languages, making it particularly valuable for Vietnamese-English content creation and language processing applications. |
|
|
|
|
|
## π§ Method |
|
|
|
|
|
The training process consists of three main steps: |
|
|
|
|
|
- Continuous Pre-training (CPT) from Sailor2-8B-Chat using [unsloth](https://github.com/unslothai/unsloth) |
|
|
- Fine-tuning with [Vietnamese instruction dataset](https://huggingface.co/datasets/BlossomsAI/reduced_vietnamese_instruction_dataset) |
|
|
- Applied refusal direction tuning based on ["Refusal in LLMs is Mediated by a Single Direction"](https://www.alignmentforum.org/posts/jGuXSZgv6qfdhMCuJ/refusal-in-llms-is-mediated-by-a-single-direction) |
|
|
|
|
|
## π VLMU Benchmark |
|
|
|
|
|
| EVALUATION DATE | STEM π¬ | SOCIAL SCIENCE π | HUMANITIES π | OTHERS π― | AVG β | |
|
|
|----------------|--------|------------------|---------------|-----------|--------| |
|
|
| 07/02/2025 | 50.72 | 62.81 | 60.47 | 55.4 | 56.56 | |
|
|
|
|
|
|
|
|
## π« Quantization |
|
|
|
|
|
- Coming Soon! |
|
|
|
|
|
## π€ Contributors |
|
|
|
|
|
Developed with β€οΈ by [BlossomAI](https://github.com/BlossomAI) |
|
|
|
|
|
<p align="left"> |
|
|
<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/made with unsloth.png" width="200" /> |
|
|
</p> |
|
|
|
|
|
--- |
|
|
<div align="center"> |
|
|
<sub>Star βοΈ this repo if you find it valuable!</sub> |
|
|
</div> |
|
|
|