Skip to content

v0.1.4

Latest
Compare
Choose a tag to compare
@kerthcet kerthcet released this 10 Jun 16:03
· 20 commits to main since this release
f159c34

What's Changed

🚀 Major Features:

Features:

  • feat: add preStop hook for llamacpp and tgi in the BackendRuntime by @cr7258 in #381
  • feat: support speculative decoding for llamacpp by @cr7258 in #402
  • Add global configmap by @kerthcet in #431
  • Add dispatcher & memoryStore & latencyAwarePlugin by @kerthcet in #440
  • feat: support runai streamer for vllm by @cr7258 in #423

🐛 Bugs:

  • feat: update sglang version to v0.4.5 to fix /health_generate endpoint 404 error by @cr7258 in #383
  • fix: remove trailing slashes from envoyproxy repository URLs in Chart.yaml by @OKevinoo in #407

♻️ Cleanups:

New Contributors

Full Changelog: v0.1.3...v0.1.4