✨ feat: Add support for Qwen3 model #286

johnmai-dev · 2025-04-28T16:03:14Z

Add support for Qwen3 model
Port of https://github.com/ml-explore/mlx-lm/blob/main/mlx_lm/models/qwen3.py

awni · 2025-04-28T16:56:40Z

Looks great!!

Can you add a model configuration for Qwen3 1.7B:

Assume it will be here: mlx-community/Qwen3-1.7B-4bit

And can you update the default model in the LLM app to use that config (as apposed to the dated older qwen of a similar size?

johnmai-dev · 2025-04-28T17:22:21Z

Looks great!!

Can you add a model configuration for Qwen3 1.7B:

Assume it will be here: mlx-community/Qwen3-1.7B-4bit

And can you update the default model in the LLM app to use that config (as apposed to the dated older qwen of a similar size?

Received, sir.🫡🫡🫡

awni · 2025-04-28T23:31:41Z

Thanks for the changes @johnmai-dev. The model runs and produces ok output.. but it looks like there may be something off with the chat template 🤔 .

johnmai-dev · 2025-04-29T05:20:42Z

Thanks for the changes @johnmai-dev. The model runs and produces ok output.. but it looks like there may be something off with the chat template 🤔 .

Yes, it is indeed an issue with Jinja. I will only have time to fix it after work in the evening. The error message is: Runtime: Cannot call something that is not a function: got UndefinedValue

DePasqualeOrg · 2025-04-29T09:14:34Z

I added some missing string methods here: johnmai-dev/Jinja#15

This seems to fix the error.

awni · 2025-04-29T13:14:53Z

Awesome thanks @DePasqualeOrg. Let us know when it lands. We may need to make a PR on swift transformers to bump the jinja package version as well.

DePasqualeOrg · 2025-04-29T13:34:18Z

The update is available now. Because swift-transformers uses upToNextMinor for the dependencies, and this was a patch update, mlx-swift-examples should pick up the latest version when you update to the latest package versions in Xcode.

johnmai-dev · 2025-04-29T13:35:53Z

Awesome thanks @DePasqualeOrg. Let us know when it lands. We may need to make a PR on swift transformers to bump the jinja package version as well.

Jinja version 1.1.2 has been released. Only Update Package 😉

johnmai-dev · 2025-04-29T13:38:49Z

Thinking Mode

Non-thinking Mode

awni · 2025-04-29T13:45:01Z

Awesome! Tested locally and it works well now.

Regarding the thinking toggle.. I think we should find a way to keep it cause it's fun to play with.

But there are a couple issues to think through there:

It will always be there even for non-thinking models. Which doesn't really make sense
In the past I've seen other models use different tokenizer flags for that. So how does one generalize that other models?

DePasqualeOrg · 2025-04-29T13:50:27Z

I don't know if it will be possible to reliably determine this based on the tokenizer config files. My app Local Chat adds a lot of metadata about the models, such as whether they use thinking tags, whether the response starts with the start thinking tag, etc. To work reliably, this should probably be part of the model preset configurations, and more options may need to be added over time as models become more diverse.

johnmai-dev · 2025-04-29T13:51:04Z

Awesome! Tested locally and it works well now.

Regarding the thinking toggle.. I think we should find a way to keep it cause it's fun to play with.

But there are a couple issues to think through there:

It will always be there even for non-thinking models. Which doesn't really make sense

In the past I've seen other models use different tokenizer flags for that. So how does one generalize that other models?

Yes, I'm also debating whether to add a non-thinking mode.😂 But if we don't add it, we'll have to default to thinking mode.

# Conflicts: # mlx-swift-examples.xcodeproj/project.xcworkspace/xcshareddata/swiftpm/Package.resolved

awni · 2025-04-29T14:05:36Z

To work reliably, this should probably be part of the model preset configurations

Indeed.. I was thinking something similar. Models may need optional additional config info including which toggles they have (tools, thinking). And we can dynamically set that based on the config..

I'm ok keeping it the way it is for now. But with the intention of keeping an eye on it and refactoring / redesigning if it becomes more important.

johnmai-dev · 2025-04-29T14:15:27Z

To work reliably, this should probably be part of the model preset configurations

Indeed.. I was thinking something similar. Models may need optional additional config info including which toggles they have (tools, thinking). And we can dynamically set that based on the config..

I'm ok keeping it the way it is for now. But with the intention of keeping an eye on it and refactoring / redesigning if it becomes more important.

I had planned to expose modelType and only display the Thinking Toggle when modelType is Qwen3.

johnmai-dev · 2025-04-29T14:20:03Z

I'm ok keeping it the way it is for now. But with the intention of keeping an eye on it and refactoring / redesigning if it becomes more important.

If we keeping it, this PR is ready for review & merge. 🍻
@awni @davidkoski

awni · 2025-04-29T14:22:02Z

I had planned to expose modelType and only display the Thinking Toggle when modelType is Qwen3.

Yes, something like that could also work. Though I think it may be better to read it as an optinoal field in the config rather than hard code the model types that should have different features.

But, let's do this type of stuff in a follow on PR so we can prioritize merging this without the extra complication.

awni

LGTM!

davidkoski · 2025-04-29T15:59:28Z

Yeah, I think we can keep the toggle there in the example (just like we have the tool piece showing how to integrate that). If there is some metadata in the tokenizer config we can hook that up -- maybe there is something similar for tools?

davidkoski · 2025-04-29T16:00:06Z

Looks great, thank you for the contribution! @johnmai-dev and @DePasqualeOrg

mzbac · 2025-04-30T00:05:43Z

Thank you all for the awesome work, can't wait to integrate it in my mlx swift projects.

DePasqualeOrg · 2025-04-30T08:26:25Z

I had planned to expose modelType and only display the Thinking Toggle when modelType is Qwen3.

It might be easier to maintain if you use a field related to the functionality you want to control. For this type of additional functionality that is specific to some models, my app has a field that is a list of features, one of which could be an enum case toggleThinking.

✨ feat: Add support for Qwen3 model

4c340bb

johnmai-dev added 2 commits April 29, 2025 01:20

feat: Add Qwen3 multiple configurations

9e2647c

Increase max position embeddings to 32768

691320c

johnmai-dev marked this pull request as ready for review April 28, 2025 17:23

Update model configuration to Qwen3 1.7b 4bit

2a4d12b

johnmai-dev mentioned this pull request Apr 29, 2025

Parse error on Qwen3 chat template johnmai-dev/Jinja#14

Closed

feat: Add thinking mode toggle for Qwen3 support

2a38841

chore: Update package resolution for Jinja to version 1.1.2

44682cd

Merge branch 'main' into 20250428-Qwen3

4537768

# Conflicts: # mlx-swift-examples.xcodeproj/project.xcworkspace/xcshareddata/swiftpm/Package.resolved

chore: Update package dependencies to latest versions

76efd3c

johnmai-dev force-pushed the 20250428-Qwen3 branch from 03dbb5f to 76efd3c Compare April 29, 2025 14:08

awni approved these changes Apr 29, 2025

View reviewed changes

davidkoski merged commit a6cb969 into ml-explore:main Apr 29, 2025
1 check passed

davidkoski mentioned this pull request Apr 29, 2025

swift-format #290

Merged

rudrankriyam mentioned this pull request Apr 29, 2025

Add Qwen3 to Chat Example #292

Merged

✨ feat: Add support for Qwen3 model #286

✨ feat: Add support for Qwen3 model #286

Uh oh!

Conversation

johnmai-dev commented Apr 28, 2025

Uh oh!

awni commented Apr 28, 2025

Uh oh!

johnmai-dev commented Apr 28, 2025

Uh oh!

awni commented Apr 28, 2025

Uh oh!

johnmai-dev commented Apr 29, 2025

Uh oh!

DePasqualeOrg commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

awni commented Apr 29, 2025

Uh oh!

DePasqualeOrg commented Apr 29, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

johnmai-dev commented Apr 29, 2025

Uh oh!

johnmai-dev commented Apr 29, 2025

Thinking Mode

Non-thinking Mode

Uh oh!

awni commented Apr 29, 2025

Uh oh!

DePasqualeOrg commented Apr 29, 2025

Uh oh!

johnmai-dev commented Apr 29, 2025

Uh oh!

awni commented Apr 29, 2025

Uh oh!

johnmai-dev commented Apr 29, 2025

Uh oh!

johnmai-dev commented Apr 29, 2025

Uh oh!

awni commented Apr 29, 2025

Uh oh!

awni left a comment

Choose a reason for hiding this comment

Uh oh!

davidkoski commented Apr 29, 2025

Uh oh!

davidkoski commented Apr 29, 2025

Uh oh!

Uh oh!

mzbac commented Apr 30, 2025

Uh oh!

DePasqualeOrg commented Apr 30, 2025

Uh oh!

Uh oh!

DePasqualeOrg commented Apr 29, 2025 •

edited

Loading

DePasqualeOrg commented Apr 29, 2025 •

edited

Loading