-
Notifications
You must be signed in to change notification settings - Fork 301
Adding files to deploy CodeTrans application on ROCm vLLM #1545
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
chensuyue
merged 33 commits into
opea-project:main
from
chyundunovDatamonsters:feature/Codetrans_vLLM
Mar 24, 2025
Merged
Changes from all commits
Commits
Show all changes
33 commits
Select commit
Hold shift + click to select a range
097dcaf
Add a new section to change LLM model such as deepseek based on valid…
louie-tsai aa5a93e
CodeTrans - add deploy app with vLLM ROCm
488aee3
CodeTrans - add deploy app with vLLM ROCm
3243f4b
CodeTrans - add deploy app with vLLM ROCm
f5430e5
CodeTrans - add deploy app with vLLM ROCm
1106d0e
CodeTrans - fix Dockerfile for vLLM
265d67b
CodeTrans - fix Dockerfile for vLLM
bea90fa
CodeTrans - fix files for deploy with ROCm vLLM
9090354
CodeTrans - fix files for deploy with ROCm vLLM
470d88b
CodeTrans - fix files for deploy with ROCm vLLM
a3fe495
CodeTrans - fix files for deploy with ROCm vLLM
cc3ab59
CodeTrans - fix files for deploy with ROCm vLLM
26a443c
CodeTrans - fix ROCm docker compose file
7a317ec
CodeTrans - fix files for deploy on ROCm
19bc3c5
CodeTrans - fix files for deploy on ROCm
9feb285
Merge branch 'main' into feature/Codetrans_vLLM
chyundunovDatamonsters 2d0602c
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 061db6c
CodeTrans - fix files for deploy on ROCm
3b0e35e
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 0d92a20
CodeTrans - fix files for deploy on ROCm
acf55d9
Merge remote-tracking branch 'origin/feature/Codetrans_vLLM' into fea…
3db4ac4
CodeTrans - fix files for deploy on ROCm
204e3a7
CodeTrans - fix files for deploy on ROCm
f597a85
CodeTrans - fix files for deploy on ROCm
4f577ad
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 92a8034
CodeTrans - fix files for deploy on ROCm
524421a
Merge remote-tracking branch 'origin/feature/Codetrans_vLLM' into fea…
289cffd
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] 7415b59
CodeTrans - fix files for deploy on ROCm
cf68881
CodeTrans - fix files for deploy on ROCm
e011a9b
CodeTrans - fix files for deploy on ROCm
4693488
[pre-commit.ci] auto fixes from pre-commit.com hooks
pre-commit-ci[bot] b21ed34
Merge branch 'main' into feature/Codetrans_vLLM
chyundunovDatamonsters File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Large diffs are not rendered by default.
Oops, something went wrong.
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
113 changes: 113 additions & 0 deletions
113
CodeTrans/docker_compose/amd/gpu/rocm/compose_vllm.yaml
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,113 @@ | ||
# Copyright (C) 2024 Intel Corporation | ||
# Copyright (c) 2024 Advanced Micro Devices, Inc. | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
services: | ||
codetrans-vllm-service: | ||
image: ${REGISTRY:-opea}/vllm-rocm:${TAG:-latest} | ||
container_name: codetrans-vllm-service | ||
ports: | ||
- "${CODETRANS_VLLM_SERVICE_PORT:-8081}:8011" | ||
environment: | ||
no_proxy: ${no_proxy} | ||
http_proxy: ${http_proxy} | ||
https_proxy: ${https_proxy} | ||
HUGGINGFACEHUB_API_TOKEN: ${CODETRANS_HUGGINGFACEHUB_API_TOKEN} | ||
HF_TOKEN: ${CODETRANS_HUGGINGFACEHUB_API_TOKEN} | ||
HF_HUB_DISABLE_PROGRESS_BARS: 1 | ||
HF_HUB_ENABLE_HF_TRANSFER: 0 | ||
WILM_USE_TRITON_FLASH_ATTENTION: 0 | ||
PYTORCH_JIT: 0 | ||
healthcheck: | ||
test: [ "CMD-SHELL", "curl -f http://${HOST_IP}:${CODETRANS_VLLM_SERVICE_PORT:-8028}/health || exit 1" ] | ||
interval: 10s | ||
timeout: 10s | ||
retries: 100 | ||
volumes: | ||
- "./data:/data" | ||
shm_size: 20G | ||
devices: | ||
- /dev/kfd:/dev/kfd | ||
- /dev/dri/:/dev/dri/ | ||
cap_add: | ||
- SYS_PTRACE | ||
group_add: | ||
- video | ||
security_opt: | ||
- seccomp:unconfined | ||
- apparmor=unconfined | ||
command: "--model ${CODETRANS_LLM_MODEL_ID} --swap-space 16 --disable-log-requests --dtype float16 --tensor-parallel-size 4 --host 0.0.0.0 --port 8011 --num-scheduler-steps 1 --distributed-executor-backend \"mp\"" | ||
ipc: host | ||
codetrans-llm-server: | ||
image: ${REGISTRY:-opea}/llm-textgen:${TAG:-latest} | ||
container_name: codetrans-llm-server | ||
depends_on: | ||
codetrans-vllm-service: | ||
condition: service_healthy | ||
ports: | ||
- "${CODETRANS_LLM_SERVICE_PORT:-9000}:9000" | ||
ipc: host | ||
environment: | ||
no_proxy: ${no_proxy} | ||
http_proxy: ${http_proxy} | ||
https_proxy: ${https_proxy} | ||
LLM_ENDPOINT: ${CODETRANS_LLM_ENDPOINT} | ||
LLM_MODEL_ID: ${CODETRANS_LLM_MODEL_ID} | ||
HUGGINGFACEHUB_API_TOKEN: ${CODETRANS_HUGGINGFACEHUB_API_TOKEN} | ||
HF_TOKEN: ${CODETRANS_HUGGINGFACEHUB_API_TOKEN} | ||
LLM_COMPONENT_NAME: "OpeaTextGenService" | ||
restart: unless-stopped | ||
codetrans-backend-server: | ||
image: ${REGISTRY:-opea}/codetrans:${TAG:-latest} | ||
container_name: codetrans-backend-server | ||
depends_on: | ||
- codetrans-llm-server | ||
ports: | ||
- "${CODETRANS_BACKEND_SERVICE_PORT:-7777}:7777" | ||
environment: | ||
no_proxy: ${no_proxy} | ||
https_proxy: ${https_proxy} | ||
http_proxy: ${http_proxy} | ||
MEGA_SERVICE_HOST_IP: ${HOST_IP} | ||
LLM_SERVICE_HOST_IP: ${HOST_IP} | ||
LLM_SERVICE_PORT: ${CODETRANS_LLM_SERVICE_PORT} | ||
ipc: host | ||
restart: always | ||
codetrans-ui-server: | ||
image: ${REGISTRY:-opea}/codetrans-ui:${TAG:-latest} | ||
container_name: codetrans-ui-server | ||
depends_on: | ||
- codetrans-backend-server | ||
ports: | ||
- "${CODETRANS_FRONTEND_SERVICE_PORT:-5173}:5173" | ||
environment: | ||
no_proxy: ${no_proxy} | ||
https_proxy: ${https_proxy} | ||
http_proxy: ${http_proxy} | ||
BASE_URL: ${CODETRANS_BACKEND_SERVICE_URL} | ||
BASIC_URL: ${CODETRANS_BACKEND_SERVICE_URL} | ||
ipc: host | ||
restart: always | ||
codetrans-nginx-server: | ||
image: ${REGISTRY:-opea}/nginx:${TAG:-latest} | ||
container_name: codetrans-nginx-server | ||
depends_on: | ||
- codetrans-backend-server | ||
- codetrans-ui-server | ||
ports: | ||
- "${CODETRANS_NGINX_PORT:-80}:80" | ||
environment: | ||
- no_proxy=${no_proxy} | ||
- https_proxy=${https_proxy} | ||
- http_proxy=${http_proxy} | ||
- FRONTEND_SERVICE_IP=${CODETRANS_FRONTEND_SERVICE_IP} | ||
- FRONTEND_SERVICE_PORT=${CODETRANS_FRONTEND_SERVICE_PORT} | ||
- BACKEND_SERVICE_NAME=${CODETRANS_BACKEND_SERVICE_NAME} | ||
- BACKEND_SERVICE_IP=${CODETRANS_BACKEND_SERVICE_IP} | ||
- BACKEND_SERVICE_PORT=${CODETRANS_BACKEND_SERVICE_PORT} | ||
ipc: host | ||
restart: always | ||
|
||
networks: | ||
default: | ||
driver: bridge |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,54 @@ | ||
#!/usr/bin/env bash | ||
|
||
# Copyright (c) 2025 Advanced Micro Devices, Inc. | ||
# SPDX-License-Identifier: Apache-2.0 | ||
|
||
### The IP address or domain name of the server on which the application is running | ||
# If your server is located behind a firewall or proxy, you will need to specify its external address, | ||
# which can be used to connect to the server from the Internet. It must be specified in the EXTERNAL_HOST_IP variable. | ||
# If the server is used only on the internal network or has a direct external address, | ||
# specify it in HOST_IP and in EXTERNAL_HOST_IP. | ||
export HOST_IP='' | ||
export EXTERNAL_HOST_IP='' | ||
|
||
### Model ID | ||
export CODETRANS_LLM_MODEL_ID="Qwen/Qwen2.5-Coder-7B-Instruct" | ||
|
||
### The port of the TGI service. On this port, the TGI service will accept connections | ||
export CODETRANS_VLLM_SERVICE_PORT=18156 | ||
|
||
### The endpoint of the TGI service to which requests to this service will be sent (formed from previously set variables) | ||
export CODETRANS_LLM_ENDPOINT="http://${HOST_IP}:${CODETRANS_VLLM_SERVICE_PORT}" | ||
|
||
### A token for accessing repositories with models | ||
export CODETRANS_HUGGINGFACEHUB_API_TOKEN=${HUGGINGFACEHUB_API_TOKEN} | ||
|
||
### The port of the LLM service. On this port, the LLM service will accept connections | ||
export CODETRANS_LLM_SERVICE_PORT=18157 | ||
|
||
### The IP address or domain name of the server for CodeTrans MegaService | ||
export CODETRANS_MEGA_SERVICE_HOST_IP=${HOST_IP} | ||
|
||
### The endpoint of the LLM service to which requests to this service will be sent | ||
export CODETRANS_LLM_SERVICE_HOST_IP=${HOST_IP} | ||
|
||
### The ip address of the host on which the container with the frontend service is running | ||
export CODETRANS_FRONTEND_SERVICE_IP=${HOST_IP} | ||
|
||
### The port of the frontend service | ||
export CODETRANS_FRONTEND_SERVICE_PORT=18155 | ||
|
||
### Name of GenAI service for route requests to application | ||
export CODETRANS_BACKEND_SERVICE_NAME=codetrans | ||
|
||
### The ip address of the host on which the container with the backend service is running | ||
export CODETRANS_BACKEND_SERVICE_IP=${HOST_IP} | ||
|
||
### The port of the backend service | ||
export CODETRANS_BACKEND_SERVICE_PORT=18154 | ||
|
||
### The port of the Nginx reverse proxy for application | ||
export CODETRANS_NGINX_PORT=18153 | ||
|
||
### Endpoint of the backend service | ||
export CODETRANS_BACKEND_SERVICE_URL="http://${EXTERNAL_HOST_IP}:${CODETRANS_BACKEND_SERVICE_PORT}/v1/codetrans" |
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Uh oh!
There was an error while loading. Please reload this page.