This is a ChatML model.

merge

This is a merge of pre-trained language models created using mergekit.

Merge Details

Merge Method

This model was merged using the Model Stock merge method using Elizezen/Himeyuri-v0.1-12B as a base.

Models Merged

The following models were included in the merge:

Configuration

The following YAML configuration was used to produce this model:

base_model: Elizezen/Himeyuri-v0.1-12B
models:
  - model: anthracite-org/magnum-v2-12b
  - model: anthracite-org/magnum-v2.5-12b-kto
merge_method: model_stock
dtype: bfloat16
parameters:
  normalize: true

Downloads last month: 26

Safetensors

Model size

12B params

Tensor type

BF16

Model tree for yamatazen/Himeyuri-Magnum-12B

Elizezen/Himeyuri-v0.1-12B

anthracite-org/magnum-v2-12b

anthracite-org/magnum-v2.5-12b-kto

Merge model

this model

Merges

8 models

Quantizations

3 models

Paper for yamatazen/Himeyuri-Magnum-12B

Model Stock: All we need is just a few fine-tuned models

Paper • 2403.19522 • Published Mar 28, 2024 • 14