sambanovasystems
/

SambaCoder-nsql-llama-2-70b

Text Generation

text-generation-inference

Model card Files Files and versions

bol20162021 commited on Feb 13, 2024

Commit

5f5bfaa

·

verified ·

1 Parent(s): 8f2efda

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ We evaluate our models on three text-to-SQL benchmarks: Spider, Bird, and text2s
 ## Training Procedure
-NSQL was trained using cross-entropy loss to maximize the likelihood of sequential inputs. For finetuning on text-to-SQL pairs, we only compute the loss over the SQL portion of the pair. The model is trained using SambaNova's in-house Reconfigurable Dataflow Unit (RDU), leveraging data and model parallelism. We pre-trained for 2 epochs and fine-tuned for 10 epochs.
 ### Hyperparameters

 ## Training Procedure
+SambaCoder-nsql-llama-2-70b was trained using cross-entropy loss to maximize the likelihood of sequential inputs. For finetuning on text-to-SQL pairs, we only compute the loss over the SQL portion of the pair. The model is trained using SambaNova's in-house Reconfigurable Dataflow Unit (RDU), leveraging data and model parallelism. We pre-trained for 2 epochs and fine-tuned for 10 epochs.
 ### Hyperparameters