feat: Allow overriding default LLM provider endpoints #11282

zhengkezhou1 · 2025-05-23T10:37:22Z

Description

Motivation: Add new configuration fields to allow customizing the full API endpoint for LLM providers. This enables more flexible integration with various LLM services by allowing custom endpoint.
What changed: Add PathOverride and AuthHeaderOverride fields to LLMProvider.
Related issues: Fixes hostOverride setting causes incorrect requests in AI Gateway #11225

Change Type

/kind new_feature

Changelog

Add `PathOverride` and `AuthHeaderOverride` fields for custom LLM provider endpoints

Additional Notes

zhengkezhou1 · 2025-05-23T10:44:53Z

I will add E2E tests soon...

npolshakova · 2025-05-23T17:36:34Z

internal/kgateway/extensions2/plugins/backend/ai/ai_model_cluster_test.go

 	}

 	model := "gpt-4"
+	//TODO: Verify path override is correctly applied in transformation template


Yep, a unit test at the plugin level would be good to add here. You can also test the output envoy config by adding a setup test similar to how this deepseek test is structured: https://github.com/kgateway-dev/kgateway/blob/main/internal/kgateway/setup/testdata/standard/ai-deepseek-prompt-guard.yaml

If you run the setup tests with the input config defined, it will write the output file for you. This can be a quick way to validate the path replacement is correct!

Ah! Nevermind I see you've already created internal/kgateway/setup/testdata/standard/ai-custom-url-out.yaml 🎉

api/v1alpha1/ai_backend.go

internal/kgateway/extensions2/plugins/backend/ai/ai_model_cluster.go

zhengkezhou1 · 2025-05-26T15:48:26Z

It works after adding the following:

...
fullPathOverride:
    path: "/api/v1/chat/completions"
...

api/v1alpha1/ai_backend.go

zhengkezhou1 · 2025-05-29T15:57:27Z

internal/kgateway/extensions2/plugins/backend/ai/ai_model_cluster_test.go

+	path := "/api/v1/chat/completions"
+	prefix := "Bearer"
+	header := "Authorization"
+	//TODO: Verify path override is correctly applied in transformation template


In current testing, it seems that there is no way to validate these new inputs.

We can confirm this at the setup_test level by checking the path is correctly replaced!

andy-fong · 2025-05-30T19:08:07Z

api/v1alpha1/ai_backend.go

+	// Only supported for OpenAI and Anthropic compatible APIs for now.
+	FullPath *string `json:"path"`


Suggested change

// Only supported for OpenAI and Anthropic compatible APIs for now.

FullPath *string `json:"path"`

FullPath *string `json:"fullPath"`

zhengkezhou1 · 2025-05-30T19:18:40Z

test/kubernetes/e2e/features/aiextension/tests/client/client.py

-        base_url=TEST_OPENAI_BASE_URL,
+        default_headers={"custom-header":"custom-prefix"} if overrideProvider else None,
+        api_key=None if overrideProvider else "passthrough" if passthrough else "FAKE",
+        base_url=TEST_OPENAI_BASE_URL if overrideProvider else TEST_OVERRIDE_BASE_URL,


I believe I made an error; it should be base_url = TEST_OVERRIDE_BASE_URL if overrideProvider else TEST_OPENAI_BASE_URL. 😳

@npolshakova @andy-fong Additionally, should we test Routing, Streaming, PromptGuard, etc., under the condition of TEST_OVERRIDE_PROVIDER? 🤔 Please let me know. Thank you!

None of the PromptGuard settings should change the path, so it's fine to skip adding a test there.

Gemini and VertexAI change the path depending on whether streaming is enabled or not, but OpenAI and other providers just use stream: true in the body. Since we're now always overriding the path if the user sets the override here, let's add a small streaming test with gemini to show that the override path is set correctly in that case.

@zhengkezhou1 again, thank you for taking the time for the contribution. That's a good question and I understand why you use an env to set an override base url; however, the various TEST_*_BASE_URL are meant to test the different providers on the various features. Adding a TEST_OVERRIDE_BASE_URL and following the test "pattern" lead you to the question of should you also add tests for Routing, Streaming, PromptGuard ...etc. I think you are on the right thinking but it's also a red flag because it basically will double all the tests we need to write for all existing and future features.

So, step back and think about it. This override applies to all providers; however, it's not going to change the response from the LLM providers for various features. So, instead of adding the path override tests to all existing test suites, it might be better to test this as a separate feature for each provider once. That way, you would just follow the existing test pattern and just use the TEST_OPENAI_BASE_URL for example and have a new yaml file under testdata to turn on this path override feature for each provider. However, for upstream, you don't need a mock LLM response but rather can use something like http-echo to return you the request headers and path as a json object, then the test just verify the request path is the one we expect.

Actually, I agree with Andy. Let's keep the current approach with TEST_OVERRIDE_BASE_URL for now for just the routing e2e test, then as a follow up let's clean up the tests and create a standalone test suite for the path override.

The follow up test PR should have a separate test for each provider once (keep original TEST_OPENAI_BASE_URL and create a new testdata file with values to use).

Signed-off-by: zhengkezhou1 <[email protected]>

npolshakova

~~Looks good! Let's just add the streaming test and this looks good to go!~~

Let's handle this in a follow up PR: #11282 (comment)

github-actions bot added kind/feature Categorizes issue or PR as related to a new feature. release-note labels May 23, 2025

zhengkezhou1 mentioned this pull request May 23, 2025

feat: introduce AI Backend API: OpenRouter #11277

Closed

npolshakova reviewed May 23, 2025

View reviewed changes

api/v1alpha1/ai_backend.go Outdated Show resolved Hide resolved

npolshakova reviewed May 23, 2025

View reviewed changes

internal/kgateway/extensions2/plugins/backend/ai/ai_model_cluster.go Show resolved Hide resolved

zhengkezhou1 changed the title ~~feat: Introduce PathOverride field for AI Backend custom API path~~ feat: add FullPathOverride field for custom LLM provider endpoints May 26, 2025

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch from 4d6e317 to 6f0f689 Compare May 26, 2025 16:01

andy-fong reviewed May 27, 2025

View reviewed changes

api/v1alpha1/ai_backend.go Outdated Show resolved Hide resolved

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch from 6f0f689 to 5015b12 Compare May 29, 2025 15:52

zhengkezhou1 changed the title ~~feat: add FullPathOverride field for custom LLM provider endpoints~~ feat: Allow overriding default LLM provider endpoints May 29, 2025

zhengkezhou1 commented May 29, 2025

View reviewed changes

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch 2 times, most recently from f0bdb92 to 8f6e334 Compare May 30, 2025 11:18

zhengkezhou1 marked this pull request as draft May 30, 2025 11:20

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch from 8f6e334 to b0c7c3c Compare May 30, 2025 12:21

zhengkezhou1 marked this pull request as ready for review May 30, 2025 13:08

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch 2 times, most recently from fb4e7c0 to 1ac7470 Compare May 30, 2025 17:44

andy-fong reviewed May 30, 2025

View reviewed changes

zhengkezhou1 commented May 30, 2025

View reviewed changes

zhengkezhou1 added 3 commits June 2, 2025 17:45

feat: introduce PathOverride field for AI Backend custom API path

aaff0fe

Signed-off-by: zhengkezhou1 <[email protected]>

feat(ai): add FullPathOverride field for custom LLM provider endpoints

c4421d5

Signed-off-by: zhengkezhou1 <[email protected]>

feat(ai): Allow overriding default LLM provider endpoints

1c36e50

Signed-off-by: zhengkezhou1 <[email protected]>

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch from 1307393 to 359a03b Compare June 2, 2025 11:25

Add e2e & unit tests

c8a4d90

Signed-off-by: zhengkezhou1 <[email protected]>

zhengkezhou1 force-pushed the feat/introduce-pathOverride-field branch from 359a03b to c8a4d90 Compare June 2, 2025 12:09

fix ai test

8b493fe

Signed-off-by: zhengkezhou1 <[email protected]>

artberger mentioned this pull request Jun 2, 2025

Docs to override AI provider settings kgateway-dev/kgateway.dev#228

Closed

npolshakova reviewed Jun 2, 2025

View reviewed changes

npolshakova approved these changes Jun 2, 2025

View reviewed changes

npolshakova added this pull request to the merge queue Jun 3, 2025

Merged via the queue into kgateway-dev:main with commit 1c721e1 Jun 3, 2025
20 checks passed

zhengkezhou1 deleted the feat/introduce-pathOverride-field branch July 2, 2025 20:24

npolshakova mentioned this pull request Jul 21, 2025

URLRewrite to llm backend is not applying correctly #11699

Open

zhengkezhou1 mentioned this pull request Aug 1, 2025

Request org membership for zhengkezhou1 kgateway-dev/community#107

Merged

5 tasks

		// Only supported for OpenAI and Anthropic compatible APIs for now.
		FullPath *string `json:"path"`

	// Only supported for OpenAI and Anthropic compatible APIs for now.
	FullPath *string `json:"path"`
	FullPath *string `json:"fullPath"`

feat: Allow overriding default LLM provider endpoints #11282

feat: Allow overriding default LLM provider endpoints #11282

Uh oh!

Conversation

zhengkezhou1 commented May 23, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Change Type

Changelog

Additional Notes

Uh oh!

zhengkezhou1 commented May 23, 2025

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

zhengkezhou1 commented May 26, 2025

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

andy-fong Jun 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

npolshakova left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

zhengkezhou1 commented May 23, 2025 •

edited

Loading

andy-fong Jun 2, 2025 •

edited

Loading

npolshakova left a comment •

edited

Loading