add missing kv clear in llama_beam_search #6664

dwrensha · 2024-04-13T18:46:05Z

Adds a call to llama_kv_cache_seq_rm() in llama_beam_search_data::fill_next_beams_by_top_probabilities().

This seems to be necessary for the subsequent llama_decode() calls -- which can act on the same sequence at the same position with different test tokens -- to return reasonable results.

Before this change:

$ bin/beam-search ~/models/mistral-7b-v0.1.Q8_0.gguf 3 "fibonacci: 1, 1, 2, 3, 5, 8, 13,"
...
 21, 341, 610, 987, 1597, 233

After this change:

$ bin/beam-search ~/models/mistral-7b-v0.1.Q8_0.gguf 3 "fibonacci: 1, 1, 2, 3, 5, 8, 13,"
...
 21, 34, 55, 89, 144, 233, 377, 610, 987, 1597, 2584, 4181, 6765, 10946, 17711, 28657, 46368, 75025, 121393, 196418, 317811, 514229, 832040, 1346269, 2178309, 3524578, 5702887, 9227465, 14930352, 24157817, 39088169, 63245986, 102334155, 165580141, 26791429

dwrensha · 2024-04-13T18:46:43Z

cc @mattpulver, who added beam search in #2267.

compilade

Nice catch!

Throwing an idea here for later: I think the beam search should eventually be adapted to use one seq_id per beam to facilitate the logic for sharing KV cells between beams, and to allow parallel beam search.

dwrensha · 2024-04-14T18:56:07Z

I think the beam search should eventually be adapted to use one seq_id per beam to facilitate the logic for sharing KV cells between beams, and to allow parallel beam search.

I agree!

add missing kv clear in llama_beam_search

e969ada

compilade approved these changes Apr 14, 2024

View reviewed changes

compilade merged commit 1958f7e into ggml-org:master Apr 14, 2024
53 of 59 checks passed

dwrensha deleted the fix-beam-search branch April 14, 2024 19:26

cebtenzzre mentioned this pull request Apr 15, 2024

Add beam search abetlen/llama-cpp-python#631

Open

tybalex pushed a commit to rubra-ai/tools.cpp that referenced this pull request Apr 17, 2024

llama : add missing kv clear in llama_beam_search (ggml-org#6664)

f4f85b3

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add missing kv clear in llama_beam_search #6664

add missing kv clear in llama_beam_search #6664

dwrensha commented Apr 13, 2024

dwrensha commented Apr 13, 2024 •

edited

Loading

compilade left a comment

dwrensha commented Apr 14, 2024

add missing kv clear in llama_beam_search #6664

add missing kv clear in llama_beam_search #6664

Conversation

dwrensha commented Apr 13, 2024

dwrensha commented Apr 13, 2024 • edited Loading

compilade left a comment

Choose a reason for hiding this comment

dwrensha commented Apr 14, 2024

dwrensha commented Apr 13, 2024 •

edited

Loading