Llamatik

Kotlin-first llama.cpp integration for on-device and remote LLM inference.

Kotlin. LLMs. On your terms.

🚀 Features

✅ Kotlin Multiplatform: shared code across Android, iOS, and desktop
✅ Offline inference via llama.cpp (compiled with Kotlin/Native bindings)
✅ Remote inference via optional HTTP client (e.g. llamatik-server)
✅ Embeddings support for vector search & retrieval
✅ Text generation (non-streaming and streaming)
✅ Context-aware generation (system + conversation history)
✅ Works with GGUF models (e.g. Mistral, Phi, LLaMA)
✅ Lightweight and dependency-free runtime

🔧 Use Cases

🧠 On-device chatbots
📚 Local RAG systems
🛰️ Hybrid AI apps with fallback to remote LLMs
🎮 Game AI, assistants, and dialogue generators

🧱 Architecture

Llamatik provides three core modules:

llamatik-core: Native C++ llama.cpp integration via Kotlin/Native
llamatik-client: Lightweight HTTP client to connect to remote llama.cpp-compatible backends
llamatik-backend: Lightweight llama.cpp HTTP server

All backed by a shared Kotlin API so you can switch between local and remote seamlessly.

🔧 Requirements

iOS Deployment Target 16.6

📦 Library Installation

Llamatik is published on Maven Central.

Add to your settings.gradle.kts:

dependencyResolutionManagement {
    repositories {
        google()
        mavenCentral()
    }
}

Add to your build.gradle.kts:

commonMain.dependencies {
    implementation("com.llamatik:library:0.8.1")
}

🧑‍💻 Library Usage

The public Kotlin API is defined in LlamaBridge (an expect object with platform-specific actual implementations).

API surface

@Suppress("EXPECT_ACTUAL_CLASSIFIERS_ARE_IN_BETA_WARNING")
expect object LlamaBridge {
    // Utilities
    @Composable
    fun getModelPath(modelFileName: String): String   // copy asset/bundle model to app files dir and return absolute path
    fun shutdown()                                    // free native resources

    // Embeddings
    fun initModel(modelPath: String): Boolean         // load embeddings model
    fun embed(input: String): FloatArray              // return embedding vector

    // Text generation (non-streaming)
    fun initGenerateModel(modelPath: String): Boolean // load generation model
    fun generate(prompt: String): String
    fun generateWithContext(
        systemPrompt: String,
        contextBlock: String,
        userPrompt: String
    ): String

    // Text generation (streaming)
    fun generateStream(prompt: String, callback: GenStream)
    fun generateStreamWithContext(
        systemPrompt: String,
        contextBlock: String,
        userPrompt: String,
        callback: GenStream
    )

    // Convenience streaming overload (callbacks)
    fun generateStreamWithContext(
        system: String,
        context: String,
        user: String,
        onDelta: (String) -> Unit,
        onDone: () -> Unit,
        onError: (String) -> Unit
    )
}

interface GenStream {
    fun onDelta(text: String)
    fun onComplete()
    fun onError(message: String)
}

Quick start (Android)

// 1) Resolve model paths (place GGUF in androidMain/assets)
val embPath = LlamaBridge.getModelPath("mistral-embed.Q4_0.gguf")
val genPath = LlamaBridge.getModelPath("phi-2.Q4_0.gguf")

// 2) Load models
LlamaBridge.initModel(embPath)
LlamaBridge.initGenerateModel(genPath)

// 3a) Embeddings
val vec: FloatArray = LlamaBridge.embed("Kotlin ❤️ llama.cpp")

// 3b) Non-streamed generation
val reply: String = LlamaBridge.generate("Write a haiku about Kotlin.")

// 3c) Streaming generation with callbacks
LlamaBridge.generateStreamWithContext(
    system = "You are a concise assistant.",
    context = "Project: Llamatik readme refresh.",
    user = "List 3 key features.",
    onDelta = { delta -> /* append to UI */ },
    onDone  = { /* enable send */ },
    onError = { err -> /* show error */ }
)

Notes

Call shutdown() on app teardown to release native resources.
getModelPath() is @Composable to allow platform-specific asset access where needed.
Use GGUF models compatible with your build of llama.cpp (quantization, context size, etc.).

🧑‍💻 Backend Usage

Please go to the Backend README.md for more information.

Try LLamatik App

If you want to try how LLamatik works you can download the App on the App Store or Google Play Store.

Apps using Llamatik

The following is a list of some of the public apps using Llamatik and are published on the Google Play Store and App Store.

Want to add your app? Found an app that no longer works or no longer uses Llamatik? Please submit a pull request on GitHub to update this page!

🤝 Contributing

Llamatik is 100% open-source and actively developed.
Contributions, bug reports, and feature suggestions are welcome!

📜 License

MIT

Built with ❤️ for the Kotlin community.

Name		Name	Last commit message	Last commit date
Latest commit History 59 Commits
.github		.github
assets		assets
backend		backend
composeApp		composeApp
config		config
design		design
gradle		gradle
homepage		homepage
iosApp		iosApp
library		library
llama.cpp @ 517b717		llama.cpp @ 517b717
shared		shared
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
build.gradle.kts		build.gradle.kts
gradle.properties		gradle.properties
gradlew		gradlew
gradlew.bat		gradlew.bat
settings.gradle.kts		settings.gradle.kts

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Repository files navigation

Llamatik

🚀 Features

🔧 Use Cases

🧱 Architecture

🔧 Requirements

📦 Library Installation

🧑‍💻 Library Usage

API surface

Quick start (Android)

Notes

🧑‍💻 Backend Usage

Try LLamatik App

Apps using Llamatik

🤝 Contributing

📜 License

About

Uh oh!

Releases 9

Sponsor this project

Uh oh!

Packages

Uh oh!

Uh oh!

Languages

Uh oh!

License

ferranpons/Llamatik

Folders and files

Latest commit

History

Repository files navigation

Llamatik

🚀 Features

🔧 Use Cases

🧱 Architecture

🔧 Requirements

📦 Library Installation

🧑‍💻 Library Usage

API surface

Quick start (Android)

Notes

🧑‍💻 Backend Usage

Try LLamatik App

Apps using Llamatik

🤝 Contributing

📜 License

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 9

Sponsor this project

Uh oh!

Packages 0

Uh oh!

Uh oh!

Languages

Packages