Support for OpenAI’s Flex processing #10695

jannikmaierhoefer · 2025-11-25T10:08:50Z

jannikmaierhoefer
Nov 25, 2025
Maintainer

Describe the feature or potential improvement

OpenAI’s Flex processing makes responses.create() return immediately and pushes the real completion (or failure) to a later responses.wait() / retrieve() call. Our current OpenAI integration only wraps create/parse, so the Langfuse observation ends before Flex actually finishes—no final output, usage, latency, or errors are captured.

Request: add Flex-aware instrumentation for Responses.wait, responses.retrieve, and their async counterparts (or otherwise defer closing the observation when Flex jobs are “in_progress”). That would let Langfuse emit correct results and error states for non-streaming Flex calls.

Additional information

No response

bartlomiej-korpus · 2025-11-25T10:47:16Z

bartlomiej-korpus
Nov 25, 2025

Hello! It's not only related to flex processing but to responses with webhooks in general.

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Langfuse

Support for OpenAI’s Flex processing #10695

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Langfuse

Support for OpenAI’s Flex processing #10695

Uh oh!

jannikmaierhoefer Nov 25, 2025 Maintainer

Describe the feature or potential improvement

Additional information

Replies: 1 comment

Uh oh!

bartlomiej-korpus Nov 25, 2025

jannikmaierhoefer
Nov 25, 2025
Maintainer

bartlomiej-korpus
Nov 25, 2025