OpenAI component doesn't setup telemetry correctly #5451

samsp-msft · 2024-08-26T21:45:50Z

I have been fiddling with the playground sample for the playground OpenAI component trying to get it to emit telemetry. I think the following things are missing;

Updating Directory.Packages.props to use beta3 for the Azure.AI.OpenAI package

<PackageVersion Include="Azure.AI.OpenAI" Version="2.0.0-beta.3" />

I also had to add a source to nuget as that version is pulling in a more recent OpenAI SDK version than is available internally at the moment

In the Aspire.Azure.AI.OpenAI component:
- Update the activity source names to include OpenAI.*

protected virtual string[] ActivitySourceNames => new[] { $"{typeof(TClient).Namespace}.*", "OpenAI.*" };

Set the app context switch for the OpenAI instrumentation

        if (GetTracingEnabled(settings))
        {
            AppContext.SetSwitch("OpenAI.Experimental.EnableOpenTelemetry", true);
            builder.Services.AddOpenTelemetry()
                .WithTracing(traceBuilder => traceBuilder.AddSource(ActivitySourceNames));

        }

When these are all in place, we get the metrics and telemetry that @lmolkova added to OpenAI recently.

I believe we should include the app context switch - its there because the semanic conventions are not stable, but IMHO for Aspire we should be showing what's available rather than hiding it.

eerhardt · 2024-08-26T21:49:46Z

@lmolkova @annelo-msft - is OpenAI and OpenTelemetry piped all the way through yet?

lmolkova · 2024-08-26T22:02:30Z

@eerhardt OpenAI is instrumented with OTel, but partially (not all APIs).

I was thinking about changing the approach slightly on how we let users opt into experimental semconv and wanted to get your and @samsp-msft opinions.

So today OpenAI does what Azure SDKs do: app context switch + AddSource(name).

What if we did AddSource("Experimental.OpenAI") instead?

Pros:

it's one-step enablement instead of two
it's obvious and explicit that things are experimental

Cons:

code-change is necessary when telemetry goes stable, but we can enable both sources right away AddSource("Experiemental.OpenAI*").AddSource(OpenAI*)

wdyt?

eerhardt · 2024-08-27T15:50:54Z

What if we did AddSource("Experimental.OpenAI") instead?

It would definitely make it easier for us. And still keeps the signal to users that this is experimental. So I'd be supportive of the change.

samsp-msft · 2024-08-27T18:11:10Z

I think it's one less thing that the user needs to configure. TBH, I think we may be being overly cautious - there are too many moving pieces to be able to enable telemetry that I wonder if we are just making getting it working too hard. Getting some telemetry that may change over time is probably better than not getting any because you didn't find the docs and therefore missed some semi-hidden configuration parameter.

For the above scenario, I only got it working because I know what @lmolkova had checked in, and followed the dependency graph to see what was actually being used. Most customer's won't do that, and they'll just assume that telemetry isn't enabled.

I almost wonder if the component should just emit a log message once per process about telemetry being preview and not having any flags at all.

lmolkova · 2024-08-27T19:57:23Z

As someone who gets "why this attribute on a span got changed 2 years ago" support tickets every once in a while, I want to have an explicit opt-in into experimental stuff. Also OTel has some opinions on what telemetry stability is (i.e. if I had a alert on a stable thing it should keep working, if it was broken and I lost $10B because of it it's a terrible issue).

It sounds like you both support my proposal - I'll send the PR to OpenAI to change it and remove app-context-switch. We can totally keep a bigger discussion open on what are the stability guarantees on telemetry.

samsp-msft · 2024-08-29T20:35:20Z

There is still work required in the Aspire component to pickup the new version and push the right strings for metrics and tracing.

Fixes #5451

sebastienros · 2024-10-15T22:00:40Z

Since this issue was create we now have telemetry supported by default in both Azure AI OpenAI and OpenAI client integrations. This still requires the app switch to be set (or ENV), but as soon as it's done the telemetry is flowing without any other intervention. This can then be disable using settings like any other OTEL integration in Aspire. https://github.com/dotnet/aspire/blob/main/src/Components/Aspire.Azure.AI.OpenAI/README.md#experimental-telemetry

samsp-msft assigned sebastienros and eerhardt Aug 26, 2024

ghost added the area-integrations Issues pertaining to Aspire Integrations packages label Aug 26, 2024

lmolkova mentioned this issue Sep 7, 2024

Telemetry: update enablement (experimental source instead of app context switch) and docs improvements openai/openai-dotnet#187

Open

davidfowl added the bug label Sep 7, 2024

sebastienros added a commit that referenced this issue Sep 27, 2024

Add metrics and improve telemetry support in Azure.AI.OpenAI

cc4d785

Fixes #5451

sebastienros mentioned this issue Sep 27, 2024

Add metrics and improve telemetry support in Azure.AI.OpenAI #5999

Merged

16 tasks

joperezr added the untriaged label Oct 15, 2024

davidfowl removed the bug label Oct 16, 2024

eerhardt added this to the Backlog milestone Jan 14, 2025

eerhardt removed the untriaged label Jan 14, 2025

davidfowl added the ai label May 28, 2025

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

OpenAI component doesn't setup telemetry correctly #5451

OpenAI component doesn't setup telemetry correctly #5451

samsp-msft commented Aug 26, 2024

eerhardt commented Aug 26, 2024

Uh oh!

lmolkova commented Aug 26, 2024 •

edited

Loading

Uh oh!

eerhardt commented Aug 27, 2024

Uh oh!

samsp-msft commented Aug 27, 2024

Uh oh!

lmolkova commented Aug 27, 2024 •

edited

Loading

Uh oh!

samsp-msft commented Aug 29, 2024

Uh oh!

sebastienros commented Oct 15, 2024

Uh oh!

OpenAI component doesn't setup telemetry correctly #5451

OpenAI component doesn't setup telemetry correctly #5451

Comments

samsp-msft commented Aug 26, 2024

eerhardt commented Aug 26, 2024

Uh oh!

lmolkova commented Aug 26, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

eerhardt commented Aug 27, 2024

Uh oh!

samsp-msft commented Aug 27, 2024

Uh oh!

lmolkova commented Aug 27, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

samsp-msft commented Aug 29, 2024

Uh oh!

sebastienros commented Oct 15, 2024

Uh oh!

lmolkova commented Aug 26, 2024 •

edited

Loading

lmolkova commented Aug 27, 2024 •

edited

Loading