examples : switch retrieval to llama_encode #13685

CISC · 2025-05-21T13:12:11Z

Also enable --no-warmup option for retrieval.

Warmup calls llama_decode, would it make sense to disable this for embedding models somehow?

ggerganov

Warmup calls llama_decode, would it make sense to disable this for embedding models somehow?

There isn't a simple criteria to do it. Is it causing issues?

CISC · 2025-05-21T13:47:42Z

Warmup calls llama_decode, would it make sense to disable this for embedding models somehow?

There isn't a simple criteria to do it. Is it causing issues?

No, just the decode: cannot decode batches with this context (use llama_encode() instead) logging.

* switch retrieval to llama_encode * enable --no-warmup for retrieval

CISC added 2 commits May 21, 2025 15:03

switch retrieval to llama_encode

7bf0a79

enable --no-warmup for retrieval

53153f0

github-actions bot added the examples label May 21, 2025

CISC requested a review from ggerganov May 21, 2025 13:17

ggerganov approved these changes May 21, 2025

View reviewed changes

CISC merged commit 2aa777d into master May 21, 2025
46 checks passed

CISC deleted the cisc/retrieval-encode branch May 21, 2025 14:57

infil00p pushed a commit to baseweight/llama.cpp that referenced this pull request May 22, 2025

examples : switch retrieval to llama_encode (ggml-org#13685)

ca0bccb

* switch retrieval to llama_encode * enable --no-warmup for retrieval

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

examples : switch retrieval to llama_encode #13685

examples : switch retrieval to llama_encode #13685

CISC commented May 21, 2025

Uh oh!

ggerganov left a comment

Uh oh!

CISC commented May 21, 2025

Uh oh!

Uh oh!

Uh oh!

examples : switch retrieval to llama_encode #13685

examples : switch retrieval to llama_encode #13685

Conversation

CISC commented May 21, 2025

Uh oh!

ggerganov left a comment

Choose a reason for hiding this comment

Uh oh!

CISC commented May 21, 2025

Uh oh!

Uh oh!

Uh oh!