Skip to content

b5554

Compare
Choose a tag to compare
@github-actions github-actions released this 31 May 13:22
3600cc2
llama : use n_swa + n_ubatch cells for SWA cache (#13833)

* llama : use n_swa + n_ubatch cells for SWA cache

ggml-ci

* llama : add warning about multi-sqeuence SWA contexts