Releases · google-ai-edge/LiteRT-LM · GitHub

03 Apr 01:51

advaitjain

v0.10.1 Latest

Latest

🔥 Gemma 4 support

Deploy Gemma 4 across a broad range of hardware with stellar performance (blog).

👉 Try on Linux, macOS, Windows (WSL) or Raspberry Pi with the
LiteRT-LM CLI:

litert-lm run  \
   --from-huggingface-repo=litert-community/gemma-4-E2B-it-litert-lm \
   gemma-4-E2B-it.litertlm \
   --prompt="What is the capital of France?"

Release Notes

CLI Enhancements & Migration: Migrated the CLI from fire to click, adding features like --verbose, --version, improved help formatting, and enhanced terminal output styling (#1784, #1733, #1791, #1792).
Hugging Face Integration: Added support for importing models directly from Hugging Face and implemented auto-conversion for missing models during "run" commands (#1797, #1735).
Core Performance & Features: Introduced a LiteRT-based KV cache implementation, speculative decoding support, and improved context merging for conversation history (#1601, #1793, #1742).
Platform & Build Improvements: Refactored CMake for better Android/cross-compilation support, updated the Windows build with a CPU sampler workaround, and transitioned nightly releases to Ubuntu-22.04 (#1741, #1734, #1772).
API & Documentation: Expanded the Kotlin API for response channel configuration and launched new Python API resources, including a "Getting Started" guide and a Colab notebook (#1724, #1737, #1757).

Assets 5

27 Mar 20:44

ztenghui

v0.9.0

Android & iOS Update

Performance Optimizations: Significant improvements to app initialization speed and memory management.
Bug Fixes: General stability enhancements for a smoother user experience.

Assets 7

25 Mar 21:26

ztenghui

v0.9.0-rc Pre-release

Pre-release

Android / iOS release
With many bug fixes and performance improvements.

Assets 7

13 Mar 00:18

ztenghui

v0.9.0-beta Pre-release

Pre-release

Beta release for v0.9.0.

Assets 11

06 Mar 01:13

whhone

v0.9.0-alpha03 Pre-release

Pre-release

Update to alpha03 for 0.9.0 release

Assets 11

12 Feb 23:58

ztenghui

v0.9.0-alpha02 Pre-release

Pre-release

Update to alpha02 for 0.9.0 release

Assets 11

30 Jan 06:38

ztenghui

v0.9.0-alpha Pre-release

Pre-release

Support multi-session and session-cloning for compiled model executor.

LiteRT-LM-PiperOrigin-RevId: 862916621

Assets 11

02 Dec 17:17

ztenghui

v0.8.1

Bugfix:

Fix the Windows compilation error when using GPU

Assets 11

25 Nov 18:06

ztenghui

V0.8.0

Desktop GPU support.
Simple CLI for Desktop: Link to Quick Start section
Multi-Modality support: Vision and Audio input are supported when models support it. See more details here
Kotlin API for Android and JVM (Linux, MacOS, Windows): Link to LiteRT-LM Kotlin API
Conversation API: Link to Conversation API
Function calling support: Link to Tool Use

Assets 11

24 Jun 18:53

v0.7.0

Added Qualcomm & MediaTek NPU support for Gemma3 1B model!
Fixed Apple M4 build issue.
Other minor refactorization / cleanups.

Assets 11