💬 Fix `setup_chat_format` and add `clone_chat_template` #3404

qgallouedec · 2025-05-02T05:51:03Z

This PR does two things:

It fixes the resizing of the embedding dimension in setup_chat_format.
It introduces a new function, clone_chat_template.

Both functions essentially serve the same purpose. The key difference is that setup_chat_template takes as input any tokenizer that already includes a chat template. I think this new function is more convenient and flexible in practice.

Also, if you look at the implementations, you'll notice that unlike setup_chat_format, clone_chat_template doesn't handle BOS or PAD tokens. Correct me if I’m wrong, but I don’t believe that’s necessary when setting up a chat template.

Also, it adds all the added tokens from the source tokenizer, ensuring tokens like <think> <|eot|> ... are properly set.
Note that the practice vary a lot regarding these added token, whether they are "special" or not. The current implementation fully relies on the source tokenizer: if a token is special in the source tokenizer, it will be special in the target, and vice et versa.

In the long term, I think we should deprecate setup_chat_format in favor of clone_chat_template , but this can be done in a future PR.

qgallouedec · 2025-05-31T18:51:28Z

trl/models/utils.py

-        len(tokenizer), pad_to_multiple_of=resize_to_multiple_of if resize_to_multiple_of is not None else None
+        new_num_tokens=tokenizer.vocab_size + len(tokenizer.added_tokens_encoder.keys()),
+        pad_to_multiple_of=resize_to_multiple_of if resize_to_multiple_of is not None else None,


fix the new embedding size

HuggingFaceDocBuilderDev · 2025-05-31T19:05:55Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

Copilot

Pull Request Overview

This PR fixes the embedding resize logic in setup_chat_format and introduces a new helper, setup_chat_template, which copies a chat template from an existing pretrained tokenizer.

Replaces calls to setup_chat_format with setup_chat_template in the SFT script.
Updates setup_chat_format to use explicit new_num_tokens and adds the setup_chat_template function.
Adjusts exports, tests, and documentation to reference and cover setup_chat_template.

Reviewed Changes

Copilot reviewed 7 out of 7 changed files in this pull request and generated 2 comments.

Show a summary per file

File	Description
trl/scripts/sft.py	Swap `setup_chat_format` for `setup_chat_template` (hardcoded source) and note passing source arg
trl/models/utils.py	Fix resize args in `setup_chat_format`, import `AutoTokenizer`, and add `setup_chat_template`
trl/models/init.py	Export `setup_chat_template`
trl/init.py	Export `setup_chat_template`
tests/test_dataset_formatting.py	Import and test `setup_chat_template`; update resize modulus in old test
docs/source/sft_trainer.md	Change docs to use `setup_chat_template` and update references
docs/source/model_utils.md	Add autodoc entry for `setup_chat_template`

Comments suppressed due to low confidence (2)

docs/source/sft_trainer.md:63

Update the section heading to reflect setup_chat_template (e.g. "Add Special Tokens for Chat Template") to match the examples below.

### Add Special Tokens for Chat Format

trl/models/utils.py:82

[nitpick] Since this function is slated for deprecation, consider emitting a FutureWarning to notify users to migrate to setup_chat_template.

def setup_chat_format(

tests/test_dataset_formatting.py

trl/scripts/sft.py

Co-authored-by: Copilot <[email protected]>

trl/models/utils.py

edbeeching · 2025-06-03T07:48:27Z

trl/models/utils.py

@@ -132,6 +137,69 @@ def setup_chat_format(
    return model, tokenizer


+def setup_chat_template(


Would clone_chat_template be a more suitable name?

sounds good

trl/models/utils.py

Co-authored-by: Edward Beeching <[email protected]>

…es accordingly.

lewtun

Thanks for the improvements to chat template formatting! For context, setup_chat_format() was designed to enable one to apply a default chat template (ChatML) to base / pretrained models, where a chat template is usually not predefined (Qwen models are the main exception)

The intention was to support a few popular chat templates which could then be set (potentially) via the model config.

If I understand correctly, this PR is about scenarios where one would like to post-train a model with an existing chat template? In that case, I'm not sure I understand why we need to add special tokens etc

docs/source/sft_trainer.md

lewtun · 2025-06-11T18:52:53Z

Also, if you look at the implementations, you'll notice that unlike setup_chat_format, clone_chat_template doesn't handle BOS or PAD tokens. Correct me if I’m wrong, but I don’t believe that’s necessary when setting up a chat template.

The reason we did this in setup_chat_format() was because some base models like Llama / Qwen don't always define a BOS or PAD token and this needs to be set explicitly if you want to run SFT.

qgallouedec · 2025-06-11T19:33:46Z

For popular chat templates, there’s usually at least one model on the Hugging Face Hub that already implements that template, right?
The goal is to simplify SFT setup by letting users specify:
“Use the same chat template as [reference_model]”
instead of manually configuring the template in trl. In any case we only support one, very simple chat template, so it's not very satisfying.

Co-authored-by: lewtun <[email protected]>

trl/models/utils.py

lewtun · 2025-06-16T07:07:24Z

trl/scripts/sft.py

@@ -104,7 +104,8 @@ def main(script_args, training_args, model_args):

    # Set default chat template if needed
    if tokenizer.chat_template is None:
-        model, tokenizer = setup_chat_format(model, tokenizer, format="chatml")
+        # TODO: source should be passed as an argument
+        model, tokenizer = clone_chat_template(model, tokenizer, "Qwen/Qwen3-0.6B")


We have found it useful internally to expose a chat_template arg in SFTConfig which allows one to define a custom template or copy-paste one from an existing model. Perhaps we could expose this along with a chat_template_clone arg (or something similar), now that I better understand what your intent was in this PR?

sounds good!

qgallouedec and others added 6 commits May 2, 2025 05:50

fix setup chat format

7454b1f

Merge branch 'main' into fix-setup-chat-format

77340e8

update doc

92e5353

test

809aa5d

new func!

6692f6f

use it in sft script

79f744a

qgallouedec commented May 31, 2025

View reviewed changes

qgallouedec marked this pull request as ready for review May 31, 2025 19:02

fix import and add example

64c7d71

qgallouedec changed the title ~~fix setup chat format~~ Fix setup_chat_format and add setup_chat_template May 31, 2025

qgallouedec changed the title ~~Fix setup_chat_format and add setup_chat_template~~ 💬 Fix setup_chat_format and add setup_chat_template May 31, 2025

remove type hint

e42c72e

qgallouedec requested review from Copilot, lewtun, kashif and edbeeching May 31, 2025 19:35

Copilot AI reviewed May 31, 2025

View reviewed changes

tests/test_dataset_formatting.py Outdated Show resolved Hide resolved

trl/scripts/sft.py Outdated Show resolved Hide resolved

Update test_dataset_formatting.py

e54765e

Co-authored-by: Copilot <[email protected]>

qgallouedec requested a review from shirinyamani May 31, 2025 21:46

edbeeching reviewed Jun 3, 2025

View reviewed changes

trl/models/utils.py Outdated Show resolved Hide resolved

edbeeching reviewed Jun 3, 2025

View reviewed changes

trl/models/utils.py Outdated Show resolved Hide resolved

edbeeching reviewed Jun 3, 2025

View reviewed changes

trl/models/utils.py Outdated Show resolved Hide resolved

qgallouedec and others added 6 commits June 6, 2025 05:24

Merge branch 'main' into fix-setup-chat-format

1e42c76

Merge branch 'main' into fix-setup-chat-format

68f11fa

Apply suggestions from code review

9b50a7a

Co-authored-by: Edward Beeching <[email protected]>

Apply suggestions from code review

0db69b7

Co-authored-by: Edward Beeching <[email protected]>

Rename setup_chat_template to clone_chat_template and update referenc…

f1fbe73

…es accordingly.

improve qol and ensure added tokens from source

a2d9cce

qgallouedec changed the title ~~💬 Fix setup_chat_format and add setup_chat_template~~ 💬 Fix setup_chat_format and add clone_chat_template Jun 7, 2025

propagate fix

063f137

qgallouedec requested a review from edbeeching June 7, 2025 01:09

qgallouedec and others added 2 commits June 7, 2025 01:29

Fix value head assertion to check weight shape instead of num_embeddings

1e57173

Merge branch 'main' into fix-setup-chat-format

461b17e

lewtun reviewed Jun 11, 2025

View reviewed changes

docs/source/sft_trainer.md Outdated Show resolved Hide resolved

qgallouedec added 2 commits June 12, 2025 17:04

Merge branch 'main' into fix-setup-chat-format

4e8de26

Merge branch 'main' into fix-setup-chat-format

fc0c9c0

edbeeching approved these changes Jun 13, 2025

View reviewed changes

Update sft_trainer.md

dfd5c91

Co-authored-by: lewtun <[email protected]>

qgallouedec commented Jun 13, 2025

View reviewed changes

trl/models/utils.py Outdated Show resolved Hide resolved

qgallouedec added 2 commits June 13, 2025 13:39

Update utils.py

8cfeb96

Merge branch 'main' into fix-setup-chat-format

03d78b1

qgallouedec merged commit 4126803 into main Jun 15, 2025
11 checks passed

qgallouedec deleted the fix-setup-chat-format branch June 15, 2025 13:59

lewtun reviewed Jun 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

💬 Fix `setup_chat_format` and add `clone_chat_template` #3404

💬 Fix `setup_chat_format` and add `clone_chat_template` #3404

Uh oh!

qgallouedec commented May 2, 2025 •

edited

Loading

Uh oh!

qgallouedec May 31, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 31, 2025

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

edbeeching Jun 3, 2025

Uh oh!

qgallouedec Jun 6, 2025

Uh oh!

Uh oh!

lewtun left a comment •

edited

Loading

Uh oh!

Uh oh!

lewtun commented Jun 11, 2025

Uh oh!

qgallouedec commented Jun 11, 2025 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

lewtun Jun 16, 2025

Uh oh!

qgallouedec Jun 16, 2025

Uh oh!

qgallouedec Jun 16, 2025

Uh oh!

Uh oh!

		@@ -132,6 +137,69 @@ def setup_chat_format(
		return model, tokenizer


		def setup_chat_template(

💬 Fix setup_chat_format and add clone_chat_template #3404

💬 Fix setup_chat_format and add clone_chat_template #3404

Uh oh!

Conversation

qgallouedec commented May 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

qgallouedec May 31, 2025

Choose a reason for hiding this comment

Uh oh!

HuggingFaceDocBuilderDev commented May 31, 2025

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

edbeeching Jun 3, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Jun 6, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lewtun left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

lewtun commented Jun 11, 2025

Uh oh!

qgallouedec commented Jun 11, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Uh oh!

lewtun Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

qgallouedec Jun 16, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

💬 Fix `setup_chat_format` and add `clone_chat_template` #3404

💬 Fix `setup_chat_format` and add `clone_chat_template` #3404

qgallouedec commented May 2, 2025 •

edited

Loading

lewtun left a comment •

edited

Loading

qgallouedec commented Jun 11, 2025 •

edited

Loading