lookahead-prompt : add example #4226

ggerganov · 2023-11-26T18:39:11Z

Add an example implementing the "Prompt Lookup Decoding" technique:

https://github.com/apoorvumang/prompt-lookup-decoding

This should be a great exercise for people looking to become familiar with llama.cpp's KV cache management and batched decoding API. Looking for contributions.

The following examples can be used as starting points:

speculative
lookahead
batched

The text was updated successfully, but these errors were encountered:

0xdevalias · 2023-11-27T23:10:57Z

@apoorvumang FYI

wsxiaoys · 2023-11-30T07:48:37Z

I just implemented this for tabby in https://github.com/TabbyML/tabby/pull/916/files - it's a slightly more complicated implementation (since tabby runs on continuous batching), but should be something can be used as reference.

LeonEricsson · 2023-12-03T16:41:47Z

I'd love to give this a try, first time contributing.

bullno1 · 2023-12-19T11:07:14Z

Somewhat related: https://arxiv.org/abs/2312.11462

It seems someone looked at lookup decoding (ngram) and speculative decoding and asked themselves: "Why not both?".
And thus this is the result.

I'm still reading through the paper.

ggerganov · 2023-12-30T21:20:14Z

Implemented here: https://github.com/ggerganov/llama.cpp/tree/master/examples/lookup

ggerganov added good first issue Good for newcomers performance Speed related topics labels Nov 26, 2023

ggerganov added this to ggml : roadmap Nov 26, 2023

ggerganov moved this to Todo in ggml : roadmap Nov 26, 2023

0xdevalias mentioned this issue Nov 27, 2023

add 'prompt lookup decoding' for faster inference ollama/ollama#1292

Open

LeonEricsson mentioned this issue Dec 15, 2023

Prompt lookup decoding #4484

Merged

3 tasks

ggerganov closed this as completed Dec 30, 2023

ggerganov moved this from Todo to Done in ggml : roadmap Dec 30, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

lookahead-prompt : add example #4226

lookahead-prompt : add example #4226

ggerganov commented Nov 26, 2023

0xdevalias commented Nov 27, 2023

wsxiaoys commented Nov 30, 2023

LeonEricsson commented Dec 3, 2023

bullno1 commented Dec 19, 2023

ggerganov commented Dec 30, 2023

lookahead-prompt : add example #4226

lookahead-prompt : add example #4226

Comments

ggerganov commented Nov 26, 2023

0xdevalias commented Nov 27, 2023

wsxiaoys commented Nov 30, 2023

LeonEricsson commented Dec 3, 2023

bullno1 commented Dec 19, 2023

ggerganov commented Dec 30, 2023