Skip to content

Commit 5f52a10

Browse files
authored
update tgi version (#378)
* try with tgi-gaudi:2.0.0 Signed-off-by: chensuyue <[email protected]> * try with text-generation-inference:2.1.0 Signed-off-by: chensuyue <[email protected]> * add params for tgi-gaudi:2.0.0 Signed-off-by: chensuyue <[email protected]> --------- Signed-off-by: chensuyue <[email protected]>
1 parent 450efcc commit 5f52a10

File tree

3 files changed

+4
-4
lines changed

3 files changed

+4
-4
lines changed

ChatQnA/docker/gaudi/docker_compose.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -114,7 +114,7 @@ services:
114114
LANGCHAIN_PROJECT: "opea-reranking-service"
115115
restart: unless-stopped
116116
tgi-service:
117-
image: ghcr.io/huggingface/tgi-gaudi:1.2.1
117+
image: ghcr.io/huggingface/tgi-gaudi:2.0.0
118118
container_name: tgi-gaudi-server
119119
ports:
120120
- "8008:80"
@@ -133,7 +133,7 @@ services:
133133
cap_add:
134134
- SYS_NICE
135135
ipc: host
136-
command: --model-id ${LLM_MODEL_ID}
136+
command: --model-id ${LLM_MODEL_ID} --max-input-length 1024 --max-total-tokens 2048
137137
llm:
138138
image: opea/llm-tgi:latest
139139
container_name: llm-tgi-gaudi-server

ChatQnA/docker/xeon/docker_compose.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -110,7 +110,7 @@ services:
110110
LANGCHAIN_PROJECT: "opea-reranking-service"
111111
restart: unless-stopped
112112
tgi_service:
113-
image: ghcr.io/huggingface/text-generation-inference:1.4
113+
image: ghcr.io/huggingface/text-generation-inference:2.1.0
114114
container_name: tgi-service
115115
ports:
116116
- "9009:80"

ChatQnA/tests/test_chatqna_on_gaudi.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -25,7 +25,7 @@ function build_docker_images() {
2525
cd tei-gaudi/
2626
docker build --no-cache -f Dockerfile-hpu -t opea/tei-gaudi:latest .
2727

28-
docker pull ghcr.io/huggingface/tgi-gaudi:1.2.1
28+
docker pull ghcr.io/huggingface/tgi-gaudi:2.0.0
2929
docker pull ghcr.io/huggingface/text-embeddings-inference:cpu-1.2
3030

3131
cd $WORKPATH/docker

0 commit comments

Comments
 (0)