Skip to content

server : improve error reporting #13680

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Merged
merged 1 commit into from
May 21, 2025
Merged

server : improve error reporting #13680

merged 1 commit into from
May 21, 2025

Conversation

ggerganov
Copy link
Member

On master even when the computation fails due to lack of system resources, we send an error to the client that states "Input prompt is too big ..." which is confusing.

We now send error messages that correspond better to the error codes of llama_decode().

@ggerganov ggerganov requested a review from ngxson as a code owner May 21, 2025 10:01
@ggerganov
Copy link
Member Author

cc @erusev

@ggerganov ggerganov merged commit 5fbfe38 into master May 21, 2025
46 checks passed
@ggerganov ggerganov deleted the gg/server-fix-error-msg branch May 21, 2025 16:47
infil00p pushed a commit to baseweight/llama.cpp that referenced this pull request May 22, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants