发现问题，采用原始的 transformer推理，vllm offline推理以及vllm serve推理，结果相差很大


transformers
vllm local method: https://github.com/QwenLM/Qwen3-VL-Embedding/blob/main/examples/reranker_vllm.ipynb
vllm openai method: https://github.com/vllm-project/vllm/blob/main/examples/pooling/score/vision_rerank_api_online.py
以方法 1 为标准答案，我发现方法 2 与之差别不大，但方法 3 的结果却非常奇怪：得分都在 0.5 倍左右。请问有人能提供一些建议吗？查询和文档均为图像。

vllm issue区也遇到相似的问题
https://github.com/vllm-project/vllm/issues/34502

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

发现问题，采用原始的 transformer推理，vllm offline推理以及vllm serve推理，结果相差很大 #76

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

发现问题，采用原始的 transformer推理，vllm offline推理以及vllm serve推理，结果相差很大 #76

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions