`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to chatml upon parsing issue, avoid double bos #11616

ochafik · 2025-02-03T10:42:34Z

Couple of fixes in common_chat_templates_from_model

allow --chat-template chatml when --jinja enabled:
- Cheap way to force generic tool call format + crude template onto models (doesn't bode well w/ gemma, which thinks of itself as a model, not an assistant; testing other models w/ it in slow tool call server tests).
catch any exceptions in jinja parsing (e.g. Eval bug: Release b4524 breaks serving of granite-code models #11500) and default to chatml
Incidentally, avoid double BOS issue w/ jinja (just pass empty bos/eos tokens to the template)

ochafik · 2025-02-03T17:39:27Z

Sorry somehow had forgotten half of the changes when I undrafted, should look better now.

ngxson · 2025-02-03T22:39:12Z

common/common.cpp

+        LOG_ERR("%s: failed to parse chat template: %s\n", __func__, e.what());
+        return {
+            has_explicit_template,
+            std::make_unique<minja::chat_template>(CHATML_TEMPLATE_SRC, token_bos, token_eos),


I think at some point we should no longer fallback to chatml. The fallback to chatml was a temporary solution when chat templates was not a common thing.

For example, in such case, we can return an error message like: Chat template is not supported, you must specify a custom template using --chat-template ... when user uses /chat/completions endpoint.

either way, it's surprising all the things we can have chatml do with a few "polyfills" (in minja)

…chatml upon parsing issue, avoid double bos (ggml-org#11616) * tool-call: allow `--jinja --chat-template chatml` * fix double bos issue (drop bos/eos tokens from jinja template) * add missing try catch around jinja parsing to default to chatml * Simplify default chatml logic

ochafik added 2 commits February 3, 2025 04:07

tool-call: allow --jinja --chat-template chatml

1e9acd2

Update test_tool_call.py

77ae97e

github-actions bot added examples python python script changes server labels Feb 3, 2025

ochafik changed the title ~~tool-call: allow --chat-template chatml when --jinja enabled~~ tool-call: allow --chat-template chatml w/ --jinja, default to chatml upon parsing issue, avoid double bos Feb 3, 2025

This was referenced Feb 3, 2025

server: fix tool-call of DeepSeek R1 Qwen, return reasoning_content (Command 7RB & DeepSeek R1) unless --reasoning-format none #11607

Merged

Eval bug: trivial grammar crashes (DeepSeek R1 Distill Llama 8B) #11591

Closed

Olivier Chafik added 2 commits February 3, 2025 13:58

fix typo

cf83623

fix double bos issue (drop bos/eos tokens from jinja template)

5d18d76

ochafik marked this pull request as ready for review February 3, 2025 14:01

ochafik requested a review from ngxson as a code owner February 3, 2025 14:01

Olivier Chafik added 2 commits February 3, 2025 14:01

fix bad merge

aa98e59

add missing try catch around jinja parsing to default to chatml

b2dd490

Simplify default chatml logic

d73448d

ngxson approved these changes Feb 3, 2025

View reviewed changes

ochafik merged commit cde3833 into ggml-org:master Feb 3, 2025
48 checks passed

ochafik mentioned this pull request Feb 16, 2025

tool-call: refactor common chat / tool-call api (+ tests / fixes) #11900

Merged

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to chatml upon parsing issue, avoid double bos #11616

`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to chatml upon parsing issue, avoid double bos #11616

Uh oh!

ochafik commented Feb 3, 2025 •

edited

Loading

Uh oh!

ochafik commented Feb 3, 2025

Uh oh!

ngxson Feb 3, 2025

Uh oh!

ochafik Feb 3, 2025

Uh oh!

Uh oh!

Uh oh!

tool-call: allow --chat-template chatml w/ --jinja, default to chatml upon parsing issue, avoid double bos #11616

tool-call: allow --chat-template chatml w/ --jinja, default to chatml upon parsing issue, avoid double bos #11616

Uh oh!

Conversation

ochafik commented Feb 3, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

ochafik commented Feb 3, 2025

Uh oh!

ngxson Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

ochafik Feb 3, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to chatml upon parsing issue, avoid double bos #11616

`tool-call`: allow `--chat-template chatml` w/ `--jinja`, default to chatml upon parsing issue, avoid double bos #11616

ochafik commented Feb 3, 2025 •

edited

Loading