Skip to content

Model replacement to Qwen3-32B#2189

Merged
k8s-ci-robot merged 4 commits intokubernetes-sigs:mainfrom
sats-23:issue2151
Feb 20, 2026
Merged

Model replacement to Qwen3-32B#2189
k8s-ci-robot merged 4 commits intokubernetes-sigs:mainfrom
sats-23:issue2151

Conversation

@sats-23
Copy link
Copy Markdown
Contributor

@sats-23 sats-23 commented Jan 21, 2026

What type of PR is this?
/kind documentation
/kind cleanup

What this PR does / why we need it:
-Replaces llama3-8b model with qwen3-32b across all references in guides
-Propagate changes onto manifests referenced in the guides
-For fine-tuned example, replaced adapters with https://huggingface.co/nicoboss/Qwen3-32B-Uncensored

Which issue(s) this PR fixes:

Issue #2151 (more PRs to follow)

Does this PR introduce a user-facing change?: Yes

Update manifests to use Qwen3-32B

@k8s-ci-robot k8s-ci-robot added kind/documentation Categorizes issue or PR as related to documentation. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. labels Jan 21, 2026
@netlify
Copy link
Copy Markdown

netlify bot commented Jan 21, 2026

Deploy Preview for gateway-api-inference-extension ready!

Name Link
🔨 Latest commit 394d49c
🔍 Latest deploy log https://app.netlify.com/projects/gateway-api-inference-extension/deploys/69981e885e6b0c0008c32230
😎 Deploy Preview https://deploy-preview-2189--gateway-api-inference-extension.netlify.app
📱 Preview on mobile
Toggle QR Code...

QR Code

Use your smartphone camera to open QR code link.

To edit notification comments on pull requests, go to your Netlify project configuration.

@k8s-ci-robot k8s-ci-robot added cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jan 21, 2026
@k8s-ci-robot
Copy link
Copy Markdown
Contributor

Hi @sats-23. Thanks for your PR.

I'm waiting for a kubernetes-sigs member to verify that this patch is reasonable to test. If it is, they should reply with /ok-to-test on its own line. Until that is done, I will not automatically test new commits in this PR, but the usual testing commands by org members will still work. Regular contributors should join the org to skip this step.

Once the patch is verified, the new status will be reflected by the ok-to-test label.

I understand the commands that are listed here.

Details

Instructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository.

@k8s-ci-robot k8s-ci-robot added the size/L Denotes a PR that changes 100-499 lines, ignoring generated files. label Jan 21, 2026
@sats-23 sats-23 force-pushed the issue2151 branch 2 times, most recently from 40dfba9 to 82a6a88 Compare January 21, 2026 11:14
@k8s-ci-robot k8s-ci-robot requested a review from ahg-g January 22, 2026 12:47
@sats-23
Copy link
Copy Markdown
Contributor Author

sats-23 commented Jan 22, 2026

/cc @ahg-g
/cc @nirrozenbaum

@ahg-g
Copy link
Copy Markdown
Contributor

ahg-g commented Jan 22, 2026

/ok-to-test

@k8s-ci-robot k8s-ci-robot added ok-to-test Indicates a non-member PR verified by an org member that is safe to test. and removed needs-ok-to-test Indicates a PR that requires an org member to verify it is safe to test. labels Jan 22, 2026
Copy link
Copy Markdown
Contributor

@ahg-g ahg-g left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks a lot!

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

needs updating as well

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Done

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not done, pls revert the changes in this file and only update the model name here

@sats-23 sats-23 force-pushed the issue2151 branch 2 times, most recently from ee2aad6 to 026bc39 Compare January 22, 2026 13:31
@k8s-ci-robot k8s-ci-robot added size/XL Denotes a PR that changes 500-999 lines, ignoring generated files. and removed size/L Denotes a PR that changes 100-499 lines, ignoring generated files. labels Jan 22, 2026
@sats-23
Copy link
Copy Markdown
Contributor Author

sats-23 commented Jan 22, 2026

As the tests are deeply knit with the doc and manifests, I would prefer all the changes related to model port to go in this single PR. Converting PR to draft until all models are ported

@sats-23 sats-23 marked this pull request as draft January 22, 2026 13:49
@k8s-ci-robot k8s-ci-robot added the do-not-merge/work-in-progress Indicates that a PR should not merge because it is a work in progress. label Jan 22, 2026
@sats-23
Copy link
Copy Markdown
Contributor Author

sats-23 commented Jan 22, 2026

/test pull-gateway-api-inference-extension-test-unit-main

@ahg-g
Copy link
Copy Markdown
Contributor

ahg-g commented Jan 23, 2026

it seems a unit test is failing

@sats-23
Copy link
Copy Markdown
Contributor Author

sats-23 commented Jan 23, 2026

it seems a unit test is failing

Yep @ahg-g, I will look into it soon

@sats-23 sats-23 force-pushed the issue2151 branch 3 times, most recently from 034203f to fc42c33 Compare February 12, 2026 11:14
@k8s-ci-robot k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 12, 2026
@sats-23
Copy link
Copy Markdown
Contributor Author

sats-23 commented Feb 12, 2026

@ahg-g All comments have been addressed, PTAL

@k8s-ci-robot k8s-ci-robot added the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 12, 2026
Copy link
Copy Markdown
Contributor

@ahg-g ahg-g left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

not done, pls revert the changes in this file and only update the model name here

@sats-23 sats-23 force-pushed the issue2151 branch 5 times, most recently from f50ce0a to b6ec9db Compare February 19, 2026 06:19
@k8s-ci-robot k8s-ci-robot added needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. and removed needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. labels Feb 19, 2026
Signed-off-by: Sathvik <Sathvik.S@ibm.com>
@k8s-ci-robot k8s-ci-robot removed the needs-rebase Indicates a PR cannot be merged because it has merge conflicts with HEAD. label Feb 20, 2026
@sats-23 sats-23 requested a review from ahg-g February 20, 2026 06:36
add back sglang engine type label
add back vllm engine type label
add back vllm engine type label
@ahg-g
Copy link
Copy Markdown
Contributor

ahg-g commented Feb 20, 2026

Thanks a lot!

/lgtm
/approve

@k8s-ci-robot k8s-ci-robot added the lgtm "Looks good to me", indicates that a PR is ready to be merged. label Feb 20, 2026
@k8s-ci-robot
Copy link
Copy Markdown
Contributor

[APPROVALNOTIFIER] This PR is APPROVED

This pull-request has been approved by: ahg-g, sats-23

The full list of commands accepted by this bot can be found here.

The pull request process is described here

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@k8s-ci-robot k8s-ci-robot added the approved Indicates a PR has been approved by an approver from all required OWNERS files. label Feb 20, 2026
@k8s-ci-robot k8s-ci-robot merged commit 3aa604b into kubernetes-sigs:main Feb 20, 2026
11 checks passed
kaushikmitr pushed a commit to tomatillo-and-multiverse/gateway-api-inference-extension-merged that referenced this pull request Feb 24, 2026
* Model replacement to Qwen3-32B

Signed-off-by: Sathvik <Sathvik.S@ibm.com>

* Update config/manifests/sglang/gpu-deployment.yaml

add back sglang engine type label

* Update config/manifests/vllm/cpu-deployment.yaml

add back vllm engine type label

* Update config/manifests/vllm/gpu-deployment.yaml

add back vllm engine type label

---------

Signed-off-by: Sathvik <Sathvik.S@ibm.com>
Co-authored-by: Abdullah Gharaibeh <40361897+ahg-g@users.noreply.github.com>
RyanRosario pushed a commit to RyanRosario/gateway-api-inference-extension that referenced this pull request Mar 9, 2026
* Model replacement to Qwen3-32B

Signed-off-by: Sathvik <Sathvik.S@ibm.com>

* Update config/manifests/sglang/gpu-deployment.yaml

add back sglang engine type label

* Update config/manifests/vllm/cpu-deployment.yaml

add back vllm engine type label

* Update config/manifests/vllm/gpu-deployment.yaml

add back vllm engine type label

---------

Signed-off-by: Sathvik <Sathvik.S@ibm.com>
Co-authored-by: Abdullah Gharaibeh <40361897+ahg-g@users.noreply.github.com>
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

approved Indicates a PR has been approved by an approver from all required OWNERS files. cncf-cla: yes Indicates the PR's author has signed the CNCF CLA. kind/cleanup Categorizes issue or PR as related to cleaning up code, process, or technical debt. kind/documentation Categorizes issue or PR as related to documentation. lgtm "Looks good to me", indicates that a PR is ready to be merged. ok-to-test Indicates a non-member PR verified by an org member that is safe to test. size/L Denotes a PR that changes 100-499 lines, ignoring generated files.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants