Add Multi model support to java-cfenv #299

cpage-pivotal · 2025-07-04T16:08:29Z

These changes fully support both the single-model and multi-model plans in the Tanzu Platform GenAI tile.

It uses the /v1/models endpoint for model discovery as described here:
https://techdocs.broadcom.com/us/en/vmware-tanzu/platform-services/genai-on-tanzu-platform-for-cloud-foundry/10-2/ai-cf/how-to-guides-discover-models-and-send-openai-requests-to-them.html

garethjevans · 2025-07-08T07:57:16Z

java-cfenv-boot/src/main/java/io/pivotal/cfenv/spring/boot/GenAIModelInfo.java

+ */
+public class GenAIModelInfo {
+
+    public enum Capability {


I personally wouldn't make this an enum. We're expecting the capabilities returned by the ai-server to increase over time. If this is an enum we need to ensure this is updated in lock-step with ai-server.

OK, I have made Capability a non-enum class.

garethjevans · 2025-07-08T11:04:06Z

java-cfenv-boot/src/main/java/io/pivotal/cfenv/spring/boot/GenAIModelSelector.java

+
+        return models.stream()
+                .filter(model -> model.hasCapability(requiredCapability))
+                .findFirst();


With this approach, its only possible to get the first model that matches

This is true. I originally experimented with allowing for other matching options (getting the last model that matches, preferring a certain model, etc), but in practice this doesn't work well. It's not intuitive how an end user configures that sort of preference, and for first/last model matching, the user can't reliably predict the ordering of models anyway.

I don't think it's best practice for a multi-model plan to publish multiple models with the same capability. If this does happen, and an end user needs to exert precise control over which model is selected, I think that needs to be done through code, not through an auto-config library like java-cfenv.

We're actually assuming that multiple models within an endpoint with the same capabilities will be used quite frequently, especially when using things like mcp sampling where we want to select a model based on labels rather than an exact model name.

We're also expecting these models to change over time and we want to allow apps to be able to handle this without restarting - hence the introduction of the GenaiLocator which can be used to dynamically be query the models available rather than statically wiring in a model that requires a restart to pick up any changes.

This is all good, but again, I don't think MCP Sampling or the selection of individual models is handled elegantly within an autoconfig startup library like java-cfenv.

I think java-cfenv is for developers who want an easy button default option on startup. Fine-grained model selection (and certainly runtime model selection) needs to be handled in other code.

Similarly, java-cfenv doesn't provide a mechanism for selecting a specific RabbitMQ server if multiple instances are bound on startup. That's outside the scope of java-cfenv.

The only other option would be to refuse to bind any models if multiple models with matching capabilities are found. I can implement that if you like.

Corby added 12 commits July 3, 2025 15:00

Adding support for multi-models

50554eb

Adding support for multi-models

deda97d

Adding support for multi-models

41f7cd5

Adding support for multi-models

6a0a2e6

Adding support for multi-models

c153807

Fix newlines per checkstyle

3ea71a4

Fix imports for checkstyle

b9d1162

Fix imports and newlines for checkstyle

77e9d3b

Fix import for checkstyle

1c628a4

Simplified constructors

3709c96

Make service names consistent with existing code

94c0ff6

Fix issue with binding multiple single-model GenAI service instances

9fd8fc1

garethjevans reviewed Jul 8, 2025

View reviewed changes

Defined Capability as a class rather than an enum

c1cc392

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add Multi model support to java-cfenv #299

Add Multi model support to java-cfenv #299

cpage-pivotal commented Jul 4, 2025 •

edited

Loading

Uh oh!

garethjevans Jul 8, 2025

Uh oh!

cpage-pivotal Jul 8, 2025

Uh oh!

garethjevans Jul 8, 2025

Uh oh!

cpage-pivotal Jul 8, 2025

Uh oh!

garethjevans Jul 8, 2025

Uh oh!

cpage-pivotal Jul 8, 2025 •

edited

Loading

Uh oh!

cpage-pivotal Jul 8, 2025

Uh oh!

Uh oh!

Add Multi model support to java-cfenv #299

Are you sure you want to change the base?

Add Multi model support to java-cfenv #299

Conversation

cpage-pivotal commented Jul 4, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

garethjevans Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

cpage-pivotal Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

garethjevans Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

cpage-pivotal Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

garethjevans Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

cpage-pivotal Jul 8, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

cpage-pivotal Jul 8, 2025

Choose a reason for hiding this comment

Uh oh!

Uh oh!

cpage-pivotal commented Jul 4, 2025 •

edited

Loading

cpage-pivotal Jul 8, 2025 •

edited

Loading