Skip to content

Commit 665c46f

Browse files
authored
Update Kubernetes manifest files for deploying ChatQnA (#445)
Update Kubernetes manifest files for deploying ChatQnA without GMC. Signed-off-by: Lianhao Lu <[email protected]>
1 parent 6e797fa commit 665c46f

25 files changed

+2283
-901
lines changed

ChatQnA/README.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -105,9 +105,13 @@ docker compose -f docker_compose.yaml up -d
105105

106106
Refer to the [NVIDIA GPU Guide](./docker/gpu/README.md) for more instructions on building docker images from source.
107107

108-
## Deploy ChatQnA into Kubernetes on Xeon & Gaudi
108+
## Deploy ChatQnA into Kubernetes on Xeon & Gaudi with GMC
109109

110-
Refer to the [Kubernetes Guide](./kubernetes/manifests/README.md) for instructions on deploying ChatQnA into Kubernetes on Xeon & Gaudi.
110+
Refer to the [Kubernetes Guide](./kubernetes/README.md) for instructions on deploying ChatQnA into Kubernetes on Xeon & Gaudi with GMC.
111+
112+
## Deploy ChatQnA into Kubernetes on Xeon & Gaudi without GMC
113+
114+
Refer to the [Kubernetes Guide](./kubernetes/manifests/README.md) for instructions on deploying ChatQnA into Kubernetes on Xeon & Gaudi without GMC.
111115

112116
## Deploy ChatQnA into Kubernetes using Helm Chart
113117

Lines changed: 41 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,41 @@
1+
<h1 align="center" id="title">Deploy ChatQnA in Kubernetes Cluster</h1>
2+
3+
> [NOTE]
4+
> The following values must be set before you can deploy:
5+
> HUGGINGFACEHUB_API_TOKEN
6+
7+
> You can also customize the "MODEL_ID" if needed.
8+
9+
> You need to make sure you have created the directory `/mnt/opea-models` to save the cached model on the node where the ChatQnA workload is running. Otherwise, you need to modify the `chatqna.yaml` file to change the `model-volume` to a directory that exists on the node.
10+
11+
## Deploy On Xeon
12+
13+
```
14+
cd GenAIExamples/ChatQnA/kubernetes/manifests/xeon
15+
export HUGGINGFACEHUB_API_TOKEN="YourOwnToken"
16+
sed -i "s/insert-your-huggingface-token-here/${HUGGINGFACEHUB_API_TOKEN}/g" chatqna.yaml
17+
kubectl apply -f chatqna.yaml
18+
```
19+
20+
## Deploy On Gaudi
21+
22+
```
23+
cd GenAIExamples/ChatQnA/kubernetes/manifests/gaudi
24+
export HUGGINGFACEHUB_API_TOKEN="YourOwnToken"
25+
sed -i "s/insert-your-huggingface-token-here/${HUGGINGFACEHUB_API_TOKEN}/g" chatqna.yaml
26+
kubectl apply -f chatqna.yaml
27+
```
28+
29+
## Verify Services
30+
31+
To verify the installation, run the command `kubectl get pod` to make sure all pods are running.
32+
33+
Then run the command `kubectl port-forward svc/chatqna 8888:8888` to expose the ChatQnA service for access.
34+
35+
Open another terminal and run the following command to verify the service if working:
36+
37+
```console
38+
curl http://localhost:8888/v1/chatqna \
39+
-H 'Content-Type: application/json' \
40+
-d '{"messages": "What is the revenue of Nike in 2023?"}'
41+
```

ChatQnA/kubernetes/manifests/chaqna-xeon-backend-server.yaml

Lines changed: 0 additions & 45 deletions
This file was deleted.

ChatQnA/kubernetes/manifests/docsum_gaudi_llm.yaml

Lines changed: 0 additions & 74 deletions
This file was deleted.

ChatQnA/kubernetes/manifests/docsum_llm.yaml

Lines changed: 0 additions & 74 deletions
This file was deleted.

ChatQnA/kubernetes/manifests/embedding.yaml

Lines changed: 0 additions & 45 deletions
This file was deleted.

0 commit comments

Comments
 (0)