Added cached_tokens to the usage monitoring. #555

Christianm9000 · 2025-04-20T11:04:44Z

1.: Added cached tokens to the Usage class and "add" function.
2.: Extracted cached_tokens from the LLM response and appended it to the Usage object.
Note: This change is only made when pairing the Agents SDK with the responses API (not the chat-completions).

cached_tokens is now accessible via the llm_output.raw_responses[n].usage, where n is the response index.

rm-openai

@Christianm9000 - this looks great, but any reason you didn't add it to ChatCompletions (and also the litellm implementation)?

Added cached_tokens to the usage monitoring.

959d4fb

rm-openai reviewed Apr 21, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added cached_tokens to the usage monitoring. #555

Added cached_tokens to the usage monitoring. #555

Christianm9000 commented Apr 20, 2025

rm-openai left a comment

Added cached_tokens to the usage monitoring. #555

Are you sure you want to change the base?

Added cached_tokens to the usage monitoring. #555

Conversation

Christianm9000 commented Apr 20, 2025

rm-openai left a comment

Choose a reason for hiding this comment