[Feature Request]: Support LLMs, embeddings & reranking models served through vLLM

### Is there an existing issue for the same feature request?

- [X] I have checked the existing issues.

### Is your feature request related to a problem?

_No response_

### Describe the feature you'd like

It would be great if RAGflow could support LLMs served through vLLM's OpenAI-compatible APi server, and if RAGflow could also support embedding models and re-ranking models served through vLLM. 

### Describe implementation you've considered

vLLM is the most popular open-source framework for high-performance, GPU-accelerated serving of LLMs and other AI models, and it would be great to have support for it. 

vLLM has a well-documented API for LLM serving and embeddings (which is compatible with OpenAI's format), and a custom API for serving reranking models:
- [Chat Completions API documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#chat-api)
- [Embeddings API documentation](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#embeddings-api)
- [Rerank API documentation, named "score"](https://docs.vllm.ai/en/latest/serving/openai_compatible_server.html#score-api)



### Documentation, adoption, use case

_No response_

### Additional information

_No response_

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature Request]: Support LLMs, embeddings & reranking models served through vLLM #4316

Is there an existing issue for the same feature request?

Is your feature request related to a problem?

Describe the feature you'd like

Describe implementation you've considered

Documentation, adoption, use case

Additional information

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[Feature Request]: Support LLMs, embeddings & reranking models served through vLLM #4316

Description

Is there an existing issue for the same feature request?

Is your feature request related to a problem?

Describe the feature you'd like

Describe implementation you've considered

Documentation, adoption, use case

Additional information

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions