feat(ai-proxy): Adjust the streaming response structure to keep it consistent with the openai #2391

daixijun · 2025-06-10T08:28:59Z

…nsistent with the openai

Ⅰ. Describe what this PR did

修改前， usage 为空时输出空 json 对象， logprobs,finish_reason 字段为空时不输出

修改后， usage,finish_reason,logprobs 字段为空时输出 null，保持与官方接口一致

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

…nsistent with the openai Signed-off-by: Xijun Dai <[email protected]>

lingma-agents · 2025-06-10T08:29:50Z

feat(ai-proxy): 调整流式响应结构以保持与OpenAI接口的一致性

变更文件

文件路径	变更说明
plugins/wasm-go/extensions/ai-proxy/provider/bedrock.go	将FinishReason和Usage字段改为指针类型，使用util.Ptr处理空值场景
plugins/wasm-go/extensions/ai-proxy/provider/claude.go	修正FinishReason指针化处理，统一Usage结构体指针格式
plugins/wasm-go/extensions/ai-proxy/provider/dify.go	强制FinishReason为finish_reason_stop并使用指针，修正Usage结构体引用方式
plugins/wasm-go/extensions/ai-proxy/provider/gemini.go	修正Usage字段指针化，处理候选响应的FinishReason指针转换
plugins/wasm-go/extensions/ai-proxy/provider/model.go	将FinishReason改为string类型，Usage改为usage类型，添加Logprobs字段指针支持
plugins/wasm-go/extensions/ai-proxy/util/ptr.go	实现泛型Ptr函数用于安全转换值类型为指针类型

💡 小贴士

与 lingma-agents 交流的方式

📜 直接回复评论
直接回复本条评论，lingma-agents 将自动处理您的请求。例如：

在当前代码中添加详细的注释说明。
请详细介绍一下你说的 LRU 改造方案，并使用伪代码加以说明。

📜 在代码行处标记
在文件的特定位置创建评论并 @lingma-agents。例如：

@Lingma-Agent 分析这个方法的性能瓶颈并提供优化建议。
@Lingma-Agent 对这个方法生成优化代码。

📜 在讨论中提问
在任何讨论中 @lingma-agents 来获取帮助。例如：

@Lingma-Agent 请总结上述讨论并提出解决方案。
@Lingma-Agent 请根据讨论内容生成优化代码。

lingma-agents

🔍 代码评审报告

🎯 评审意见概览

严重度	数量	说明
🔴 Blocker	0	阻断性问题，需立即修复。例如：系统崩溃、关键功能不可用或严重安全漏洞。
🟠 Critical	5	严重问题，高优先级修复。例如：核心功能异常或性能瓶颈影响用户体验。
🟡 Major	1	主要问题，建议修复。例如：非核心功能缺陷或代码维护性较差。
🟢 Minor	0	次要问题，酌情优化。例如：代码格式不规范或注释缺失。

总计： 6 个问题

📋 评审意见详情

💡 单文件建议

以下是文件级别的代码建议，聚焦于代码的可读性、可维护性和潜在问题。

🔹 plugins/wasm-go/extensions/ai-proxy/provider/dify.go (2 💬)

修复switch语句中重复的FinishReason赋值 (L119-L135)
修正responseUsage变量的指针声明 (L214)

🔹 plugins/wasm-go/extensions/ai-proxy/provider/hunyuan.go (1 💬)

修正Usage字段的指针类型 (L390)

🔹 plugins/wasm-go/extensions/ai-proxy/provider/model.go (1 💬)

确保所有FinishReason字段赋值正确使用指针包装函数 (L148)

🔹 plugins/wasm-go/extensions/ai-proxy/provider/spark.go (1 💬)

修正Usage字段的类型匹配 (L153)

🔹 plugins/wasm-go/extensions/ai-proxy/provider/vertex.go (1 💬)

修正Usage字段赋值的指针类型 (L247)

🚀 跨文件建议

以下是对代码架构和设计的综合分析，聚焦于跨文件交互、系统一致性和潜在优化空间。

🔍 1. 字段指针类型转换未全面覆盖导致潜在接口不一致

在多个文件中将Usage字段从非指针类型改为指针类型（如usage→*usage），但部分转换场景未统一处理。例如：bedrock.go中openAIFormattedChunk.Usage = &usage{...}采用显式指针构造，而vertex.go中Usage: &usage{...}也遵循此模式，但dify.go的Usage: &response.MetaData.Usage直接引用了嵌套指针。需要确保所有Usage字段赋值均采用指针包装，并检查是否存在未转换的遗留用例。此外，FinishReason字段在部分场景（如qwen.go的FinishReason: util.Ptr(qwenChoice.FinishReason)）使用Ptr函数，但vertex.go的FinishReason: util.Ptr(candidate.FinishReason)等已规范使用，需确认是否所有字段赋值均采用统一指针包装方式。

📌 关键代码：

plugins/wasm-go/extensions/ai-proxy/provider/dify.go (146-146)

Usage:             &response.MetaData.Usage

plugins/wasm-go/extensions/ai-proxy/provider/qwen.go (420-420)

usageResponse.Usage = &usage{...}

⚠️ 潜在风险： 接口不一致可能导致序列化/反序列化失败，或下游服务因类型不匹配出现运行时错误。

🔍 2. Ptr工具函数未在所有需要场景应用导致代码冗余

新增的util.Ptr工具函数用于统一指针包装，但在部分场景仍存在手动构造指针的情况。例如：dify.go的responseUsage = &usage{...}直接使用&运算符，而其他文件如bedrock.go的FinishReason: util.Ptr(...)已采用工具函数。建议将所有显式指针构造替换为util.Ptr，以减少代码碎片并提升可维护性。

📌 关键代码：

plugins/wasm-go/extensions/ai-proxy/provider/dify.go (214-214)

responseUsage = &usage{...}

⚠️ 潜在风险： 代码风格不一致增加维护成本，可能引发指针/值类型混淆的潜在bug。

🔍 3. Usage结构体未声明为公共类型导致复用障碍

usage结构体在model.go中定义为私有类型（未加大写字母开头），但多个提供商文件（如bedrock.go、claude.go）直接实例化该类型。这违反Go语言的封装原则，可能导致类型暴露和维护风险。应将usage结构体改为公共类型（首字母大写），或提供工厂方法进行实例化。

📌 关键代码：

plugins/wasm-go/extensions/ai-proxy/provider/model.go (152-152)

type usage struct { ... }

⚠️ 潜在风险： 包外直接访问私有结构体可能导致意外修改，破坏数据封装性。

🔍 4. Streaming响应与完整响应的Usage计算逻辑不一致

在流式响应（如hunyuan.go的convertChunkFromHunyuanToOpenAI）中，Usage字段在每个分片都进行独立赋值，而完整响应（如buildChatCompletionResponse）则使用总用量。这可能导致分片累加与最终响应的数值不一致。需要统一分片用量的累积逻辑和最终结算方式。

📌 关键代码：

plugins/wasm-go/extensions/ai-proxy/provider/hunyuan.go (390-390)

Usage: &usage{ PromptTokens: hunyuanFormattedChunk.Usage.PromptTokens ... }

⚠️ 潜在风险： 用量统计不准确可能导致计费错误或资源监控数据失真。

🔍 5. FinishReason转换未统一错误处理

各提供商的stopReason转换函数（如stopReasonBedrock2OpenAI）分散实现，且部分场景（如gemini.go的candidate.FinishReason）直接赋值未经过转换。建议将转换逻辑集中到公共函数，并添加默认错误处理机制，避免未定义的finish_reason值。

📌 关键代码：

plugins/wasm-go/extensions/ai-proxy/provider/gemini.go (527-527)

choice.FinishReason = util.Ptr(candidate.FinishReason)

⚠️ 潜在风险： 不一致的枚举值可能导致客户端解析异常或状态判断错误。

💡 小贴士

与 lingma-agents 交流的方式

📜 直接回复评论
直接回复本条评论，lingma-agents 将自动处理您的请求。例如：

在当前代码中添加详细的注释说明。
请详细介绍一下你说的 LRU 改造方案，并使用伪代码加以说明。

📜 在代码行处标记
在文件的特定位置创建评论并 @lingma-agents。例如：

@Lingma-Agent 分析这个方法的性能瓶颈并提供优化建议。
@Lingma-Agent 对这个方法生成优化代码。

📜 在讨论中提问
在任何讨论中 @lingma-agents 来获取帮助。例如：

@Lingma-Agent 请总结上述讨论并提出解决方案。
@Lingma-Agent 请根据讨论内容生成优化代码。

plugins/wasm-go/extensions/ai-proxy/provider/dify.go

plugins/wasm-go/extensions/ai-proxy/provider/hunyuan.go

plugins/wasm-go/extensions/ai-proxy/provider/model.go

plugins/wasm-go/extensions/ai-proxy/provider/spark.go

plugins/wasm-go/extensions/ai-proxy/provider/vertex.go

codecov-commenter · 2025-06-10T08:33:16Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 46.03%. Comparing base (ef31e09) to head (072bde4).
Report is 556 commits behind head on main.

Additional details and impacted files

@@             Coverage Diff             @@
##             main    #2391       +/-   ##
===========================================
+ Coverage   35.91%   46.03%   +10.12%     
===========================================
  Files          69       81       +12     
  Lines       11576    13018     +1442     
===========================================
+ Hits         4157     5993     +1836     
+ Misses       7104     6679      -425     
- Partials      315      346       +31

see 78 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Signed-off-by: Xijun Dai <[email protected]>

johnlanni

LGTM

…nsistent with the openai (alibaba#2391) Signed-off-by: Xijun Dai <[email protected]>

feat(ai-proxy): Adjust the streaming response structure to keep it co…

7a0ee8e

…nsistent with the openai Signed-off-by: Xijun Dai <[email protected]>

daixijun requested review from CH3CHO, cr7258 and rinfx as code owners June 10, 2025 08:28

lingma-agents bot reviewed Jun 10, 2025

View reviewed changes

test(ai-proxy): Adjust the structure of streaming response

072bde4

Signed-off-by: Xijun Dai <[email protected]>

daixijun requested review from 2456868764 and Xunzhuo as code owners June 12, 2025 02:21

johnlanni approved these changes Jun 12, 2025

View reviewed changes

johnlanni merged commit fb8dd81 into alibaba:main Jun 12, 2025
12 checks passed

daixijun deleted the feat/chatcompleteions-streaming branch June 12, 2025 09:10

ink-hz pushed a commit to ink-hz/higress-ai-capability-auth that referenced this pull request Nov 5, 2025

feat(ai-proxy): Adjust the streaming response structure to keep it co…

3c8f7ec

…nsistent with the openai (alibaba#2391) Signed-off-by: Xijun Dai <[email protected]>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(ai-proxy): Adjust the streaming response structure to keep it consistent with the openai #2391

feat(ai-proxy): Adjust the streaming response structure to keep it consistent with the openai #2391

Uh oh!

daixijun commented Jun 10, 2025

Uh oh!

lingma-agents bot commented Jun 10, 2025

与 lingma-agents 交流的方式

Uh oh!

lingma-agents bot left a comment •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Jun 10, 2025 •

edited

Loading

Uh oh!

johnlanni left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

feat(ai-proxy): Adjust the streaming response structure to keep it consistent with the openai #2391

feat(ai-proxy): Adjust the streaming response structure to keep it consistent with the openai #2391

Uh oh!

Conversation

daixijun commented Jun 10, 2025

Ⅰ. Describe what this PR did

Ⅱ. Does this pull request fix one issue?

Ⅲ. Why don't you add test cases (unit test/integration test)?

Ⅳ. Describe how to verify it

Ⅴ. Special notes for reviews

Uh oh!

lingma-agents bot commented Jun 10, 2025

feat(ai-proxy): 调整流式响应结构以保持与OpenAI接口的一致性

变更文件

与 lingma-agents 交流的方式

Uh oh!

lingma-agents bot left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

🔍 代码评审报告

🎯 评审意见概览

📋 评审意见详情

💡 单文件建议

🚀 跨文件建议

与 lingma-agents 交流的方式

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Jun 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

johnlanni left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

lingma-agents bot left a comment •

edited

Loading

codecov-commenter commented Jun 10, 2025 •

edited

Loading