[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes #38047

gante · 2025-05-09T14:59:58Z

What does this PR do?

The main goal of this PR is to enable user-friendly generate parameterization. This also facilitates performance-related customization, which will be the focus of a follow-up PR.

After the deprecation cycle, new users typing transformers chat -h will be redirected to a (new) docs intro section to generation arguments, instead of seeing a wall of CLI arguments. For transformers power users, chat is now usable and can be parameterized without prior knowledge about the CLI. These changes were inspired by the idea that CLIs should be a conversation and that they shouldn't drown users in information

More specifically, with this PR:

We can accept almost any generate flag as a positional argument, present and future, as opposed to being limited to a set of hardcoded flags;
We can pass a generation_config.json, for power users to pass complex generate arguments that may be difficult to specify in a CLI;
User chat commands are clearly distinguished from potential chat entries -- they now start with !
!status, a new command, can be used to print state-related information, such as the current generate flags
!set can now be used to set arbitrary generate flags
!reset was removed -- it was providing minimal benefits (relaunching the CLI with the previous command is the same) but it was requiring us to maintain and pass the input state around
help is now printed if there is a typo in a user command (e.g. !stats -> not a valid command -> prints error and help)
(non-chat specific) There is a new intro section to generate args in the docs, allowing a soft-landing into the parameterization of the text generation universe

Example usage:

transformers chat Qwen/Qwen2.5-0.5B-Instruct do_sample=False max_new_tokens=10

github-actions · 2025-05-09T15:00:12Z

Hi 👋, thank you for opening this pull request! The pull request is converted to draft by default. The CI will be paused while the PR is in draft mode. When it is ready for review, please click the Ready for review button (at the bottom of the PR page). This will assign reviewers and trigger CI.

HuggingFaceDocBuilderDev · 2025-05-09T15:20:07Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

LysandreJik

Very cool! Played around with it locally, really like it 👌

LysandreJik · 2025-05-12T11:05:15Z

README.md

 > [!TIP]
 > You can also chat with a model directly from the command line.
 > ```shell
-> transformers chat --model_name_or_path Qwen/Qwen2.5-0.5B-Instruct


…UX-related changes (huggingface#38047) * accept arbitrary kwargs * move user commands to a separate fn * work with generation config files * rm cmmt * docs * base generate flag doc section * nits * nits * nits * no <br> * better basic args description

gante added 3 commits May 8, 2025 16:39

accept arbitrary kwargs

631a933

move user commands to a separate fn

6ca3886

work with generation config files

b915abd

github-actions bot marked this pull request as draft May 9, 2025 15:00

gante changed the title ~~[chat] generate parameterization powered by generation config and UX-friendly changes~~ [chat] generate parameterization powered by generation config and UX-related changes May 9, 2025

gante added 2 commits May 9, 2025 15:00

rm cmmt

3e17965

docs

ad817f9

gante marked this pull request as ready for review May 9, 2025 15:06

gante changed the title ~~[chat] generate parameterization powered by generation config and UX-related changes~~ [chat] generate parameterization powered by GenerationConfig and UX-related changes May 9, 2025

gante added 6 commits May 9, 2025 15:40

base generate flag doc section

9f02f85

nits

0454c14

nits

43b03da

nits

3be4ff6

no <br>

65d1657

better basic args description

020ae01

gante requested a review from Rocketknight1 May 9, 2025 16:29

Merge branch 'main' into chat_generation_config

561e7f8

gante requested a review from LysandreJik May 12, 2025 08:29

LysandreJik approved these changes May 12, 2025

View reviewed changes

gante merged commit 8efe3a9 into huggingface:main May 12, 2025
10 checks passed

gante deleted the chat_generation_config branch May 12, 2025 13:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes #38047

[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes #38047

Uh oh!

gante commented May 9, 2025 •

edited

Loading

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 9, 2025

Uh oh!

LysandreJik left a comment

Uh oh!

LysandreJik May 12, 2025

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[chat] generate parameterization powered by GenerationConfig and UX-related changes #38047

[chat] generate parameterization powered by GenerationConfig and UX-related changes #38047

Uh oh!

Conversation

gante commented May 9, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Uh oh!

github-actions bot commented May 9, 2025

Uh oh!

HuggingFaceDocBuilderDev commented May 9, 2025

Uh oh!

LysandreJik left a comment

Choose a reason for hiding this comment

Uh oh!

LysandreJik May 12, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes #38047

[`chat`] generate parameterization powered by `GenerationConfig` and UX-related changes #38047

gante commented May 9, 2025 •

edited

Loading