A sampling function that returns top token probabilities #1784

WangHaoranRobin · 2023-06-10T05:55:38Z

I was using playing around with the server example and wanted to expose the probabilities of the generated tokens to the server client to implement custom stopping sequences and criteria(similar to openai's api here).

All it would take should just be creating a different version of "llama_sample_token" and "llama_sample_token_greedy" that returns an object containing the top X tokens and their probabilities.

The only related issue/pr/discussion I was able to find is this pr about logging probabilities. Please give me pointers if similar requests have been discussed somewhere.

Since I'm relatively new to the repo, what is the protocol here? Should I just make a PR?

nauyisu022 · 2023-06-21T09:20:55Z

Did you find out how to achieve this?

WangHaoranRobin · 2023-06-21T18:25:15Z

Yes. I realized the parameters passed into the sample functions are references. So no need to change the core logic to get the probs. just read directly from the candidate list object after passing it through the sampling function is enough.

i.e. in server.cpp :
https://github.com/ggerganov/llama.cpp/blob/049aa16b8c5c6d086246e4e6b9feb18de4fbd663/examples/server/server.cpp#L293
Read candidates_p.data[i].p after the sampling functions are called to get the probs.

I'm working on a PR for server example about this rn.

WangHaoranRobin closed this as completed Jun 16, 2023

WangHaoranRobin mentioned this issue Jun 21, 2023

server: add option to output probabilities for completion #1962

Merged

GustawB mentioned this issue Jan 4, 2025

Test different Llama models with Llama.cpp lib. ZPP-MURMURAS/ZPP_Murmuras#59

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

A sampling function that returns top token probabilities #1784

A sampling function that returns top token probabilities #1784

WangHaoranRobin commented Jun 10, 2023

nauyisu022 commented Jun 21, 2023

WangHaoranRobin commented Jun 21, 2023

A sampling function that returns top token probabilities #1784

A sampling function that returns top token probabilities #1784

Comments

WangHaoranRobin commented Jun 10, 2023

nauyisu022 commented Jun 21, 2023

WangHaoranRobin commented Jun 21, 2023