Skip to content
This repository was archived by the owner on Jun 5, 2025. It is now read-only.
This repository was archived by the owner on Jun 5, 2025. It is now read-only.

Implement pipelines for FIM #87

Closed
Closed
@aponcedeleonch

Description

@aponcedeleonch

We can call directly the endpoint in our gcloud instance from vscode with the following config

"tabAutocompleteModel": {
    "title": "Stacklok Hosted - Non Instruct",
    "provider": "openai",
    "model": "Qwen/Qwen2.5-Coder-14B",
    "apiKey": "$(token)",
    "apiBase": "https://inference.codegate.ai/v1"
  },
  "tabAutocompleteOptions": {
    "debounceDelay": 500
  }

Make the necessary code changes so that the request first passes through CodeGate and then goes to gcloud

Metadata

Metadata

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions