Skip to content

Commit bd6f76c

Browse files
authored
Update the tgi service images for gaudi (#451)
Update the tgi-gaudi image version to 2.0.5 Signed-off-by: zhlsunshine <[email protected]>
1 parent cdd3585 commit bd6f76c

File tree

8 files changed

+9
-9
lines changed

8 files changed

+9
-9
lines changed

helm-charts/chatqna/gaudi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -17,7 +17,7 @@ tgi:
1717
accelDevice: "gaudi"
1818
image:
1919
repository: ghcr.io/huggingface/tgi-gaudi
20-
tag: "2.0.1"
20+
tag: "2.0.5"
2121
resources:
2222
limits:
2323
habana.ai/gaudi: 1

helm-charts/chatqna/guardrails-gaudi-values.yaml

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ tgi:
2727
accelDevice: "gaudi"
2828
image:
2929
repository: ghcr.io/huggingface/tgi-gaudi
30-
tag: "2.0.1"
30+
tag: "2.0.5"
3131
resources:
3232
limits:
3333
habana.ai/gaudi: 1
@@ -40,7 +40,7 @@ tgi-guardrails:
4040
LLM_MODEL_ID: "meta-llama/Meta-Llama-Guard-2-8B"
4141
image:
4242
repository: ghcr.io/huggingface/tgi-gaudi
43-
tag: "2.0.1"
43+
tag: "2.0.5"
4444
resources:
4545
limits:
4646
habana.ai/gaudi: 1

helm-charts/codegen/gaudi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ tgi:
55
accelDevice: "gaudi"
66
image:
77
repository: ghcr.io/huggingface/tgi-gaudi
8-
tag: "2.0.1"
8+
tag: "2.0.5"
99
resources:
1010
limits:
1111
habana.ai/gaudi: 1

helm-charts/codetrans/gaudi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ tgi:
55
accelDevice: "gaudi"
66
image:
77
repository: ghcr.io/huggingface/tgi-gaudi
8-
tag: "2.0.1"
8+
tag: "2.0.5"
99
resources:
1010
limits:
1111
habana.ai/gaudi: 1

helm-charts/common/tgi/gaudi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,7 +9,7 @@ accelDevice: "gaudi"
99

1010
image:
1111
repository: ghcr.io/huggingface/tgi-gaudi
12-
tag: "2.0.1"
12+
tag: "2.0.5"
1313

1414
MAX_INPUT_LENGTH: "1024"
1515
MAX_TOTAL_TOKENS: "2048"

helm-charts/docsum/gaudi-values.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -5,7 +5,7 @@ tgi:
55
accelDevice: "gaudi"
66
image:
77
repository: ghcr.io/huggingface/tgi-gaudi
8-
tag: "2.0.1"
8+
tag: "2.0.5"
99
resources:
1010
limits:
1111
habana.ai/gaudi: 1

microservices-connector/config/manifests/tgi_gaudi.yaml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -88,7 +88,7 @@ spec:
8888
optional: true
8989
securityContext:
9090
{}
91-
image: "ghcr.io/huggingface/tgi-gaudi:2.0.1"
91+
image: "ghcr.io/huggingface/tgi-gaudi:2.0.5"
9292
imagePullPolicy: IfNotPresent
9393
volumeMounts:
9494
- mountPath: /data

microservices-connector/config/samples/ChatQnA/use_cases.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ Should you desire to use the Gaudi accelerator, two alternate images are used fo
2626
For Gaudi:
2727

2828
- tei-embedding-service: ghcr.io/huggingface/tei-gaudi:synapse_1.16
29-
- tgi-service: ghcr.io/huggingface/tgi-gaudi:2.0.1
29+
- tgi-service: ghcr.io/huggingface/tgi-gaudi:2.0.5
3030

3131
## Deploy ChatQnA pipeline
3232

0 commit comments

Comments
 (0)