MBZUAI
/

bactrian-x-llama-7b-lora

Model card Files Files and versions

lmlmcat commited on May 4, 2023

Commit

cbe5dbd

·

1 Parent(s): 67496fc

update model

Files changed (2) hide show

README.md +7 -5
adapter_model.bin +2 -2

README.md CHANGED Viewed

@@ -21,25 +21,27 @@ and [databricks-dolly-15k](https://github.com/databrickslabs/dolly/tree/master/d
 The code for training the model is provided in our [github](https://github.com/mbzuai-nlp/Bactrian-X), which is adapted from [Alpaca-LoRA](https://github.com/tloen/alpaca-lora).
 This version of the weights was trained with the following hyperparameters:
-- Epochs: 2
-- Batch size: 256
 - Cutoff length: 512
 - Learning rate: 3e-4
 - Lora _r_: 64
 - Lora target modules: q_proj, k_proj, v_proj, o_proj
 That is:
 ```
 python finetune.py \
     --base_model='decapoda-research/llama-7b-hf' \
-    --num_epochs=2 \
-    --batch_size=256 \
     --cutoff_len=512 \
     --group_by_length \
     --output_dir='./bactrian-x-7b-lora' \
-    --lora_target_modules='[q_proj,k_proj,v_proj,o_proj]' \
     --lora_r=64 \
     --micro_batch_size=32
 ```

 The code for training the model is provided in our [github](https://github.com/mbzuai-nlp/Bactrian-X), which is adapted from [Alpaca-LoRA](https://github.com/tloen/alpaca-lora).
 This version of the weights was trained with the following hyperparameters:
+- Epochs: 10
+- Batch size: 128
 - Cutoff length: 512
 - Learning rate: 3e-4
 - Lora _r_: 64
 - Lora target modules: q_proj, k_proj, v_proj, o_proj
+#### Current Training Steps: 21000
 That is:
 ```
 python finetune.py \
     --base_model='decapoda-research/llama-7b-hf' \
+    --num_epochs=10 \
+    --batch_size=128 \
     --cutoff_len=512 \
     --group_by_length \
     --output_dir='./bactrian-x-7b-lora' \
+    --lora_target_modules='q_proj,k_proj,v_proj,o_proj' \
     --lora_r=64 \
     --micro_batch_size=32
 ```

adapter_model.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:8dd9a8cd9145346cde8a3b356dd402e6221e020e98e0587791c581215661591d
-size 134263757

 version https://git-lfs.github.com/spec/v1
+oid sha256:a79d74d6cfed583c0a176438158a983133a2016ed923d25347e4335d95c7aab8
+size 268527949