-
Notifications
You must be signed in to change notification settings - Fork 201
Description
Priority
P3-Medium
OS type
Ubuntu
Hardware type
Xeon-GNR
Installation method
- Pull docker images from hub.docker.com
- Build docker images from source
- Other
Deploy method
- Docker
- Docker Compose
- Kubernetes Helm Charts
- Other
Running nodes
Single Node
What's the version?
NA
Description
Not all containers from CI tests are removed for subsequent CI tests e.g., tests/llms/test_llms_* . The impact is there might be container or port conflicts, OR some tests use containers from previous run.
I think the problem is the function stop_docker
may not down all the orphan containers when using the ${service_name}:
function stop_docker() {
cd $WORKPATH/comps/llms/deployment/docker_compose
docker compose -f compose_text-generation.yaml down ${service_name} --remove-orphans
}
Usually it is the include containers from compose_text-generation.yaml
that don't get taken down:
include:
- ../../../third_parties/tgi/deployment/docker_compose/compose.yaml
- ../../../third_parties/vllm/deployment/docker_compose/compose.yaml
- ../../../third_parties/ollama/deployment/docker_compose/compose.yaml
- ../../../third_parties/llamacpp/deployment/docker_compose/compose.yaml
I added docker ps -a to see this here example run. Please search logs 'Docker containers before stop_docker' and 'Docker containers after stop_docker' to see that containers are not taken down..
I did try leaving out the ${service_name}
, below and it does help when I try locally i.e.:
docker compose -f compose_text-generation.yaml down --remove-orphans
To fix, we may need to amend the stop_docker() in the tests? Happy to discuss or clarify further.
Reproduce steps
example run (please search logs 'Docker containers before stop_docker' and 'Docker containers after stop_docker'
Can add docker ps -a before and after stop_docker.
Raw log
Attachments
No response