[Train] pad_token_id set as "!"

by eungizoa - opened Feb 9, 2025

Feb 9, 2025

According to the special_tokens_map, the pad_token is set as "!".
I want to fine-tune this model to perform specific task, and rom the trl.SFTTrainer document, it says that pad_token_id should be set differently as eos_token_id.

"Make sure to have a pad_token_id which is different from eos_token_id which can result in the model not properly predicting EOS (End of Sentence) tokens during generation." (https://huggingface.co/docs/trl/sft_trainer)

But the string "!" is a very common string, so I wonder if I can use this pad_token during training.

Can I train this model as below?
"tokenizer.pad_token_id = tokenizer.eos_token_id"

Or should I use this pad_token ("!") without any modification?

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment