Skip to content

[Bug] Orphaned Containers in Github Runners that Run Tests #1323

@edlee123

Description

@edlee123

Priority

P3-Medium

OS type

Ubuntu

Hardware type

Xeon-GNR

Installation method

  • Pull docker images from hub.docker.com
  • Build docker images from source
  • Other

Deploy method

  • Docker
  • Docker Compose
  • Kubernetes Helm Charts
  • Other

Running nodes

Single Node

What's the version?

NA

Description

Not all containers from CI tests are removed for subsequent CI tests e.g., tests/llms/test_llms_* . The impact is there might be container or port conflicts, OR some tests use containers from previous run.

I think the problem is the function stop_docker may not down all the orphan containers when using the ${service_name}:

function stop_docker() {
    cd $WORKPATH/comps/llms/deployment/docker_compose
    docker compose -f compose_text-generation.yaml down ${service_name} --remove-orphans
}

Usually it is the include containers from compose_text-generation.yaml that don't get taken down:

include:
  - ../../../third_parties/tgi/deployment/docker_compose/compose.yaml
  - ../../../third_parties/vllm/deployment/docker_compose/compose.yaml
  - ../../../third_parties/ollama/deployment/docker_compose/compose.yaml
  - ../../../third_parties/llamacpp/deployment/docker_compose/compose.yaml

I added docker ps -a to see this here example run. Please search logs 'Docker containers before stop_docker' and 'Docker containers after stop_docker' to see that containers are not taken down..

I did try leaving out the ${service_name}, below and it does help when I try locally i.e.:

docker compose -f compose_text-generation.yaml down --remove-orphans

To fix, we may need to amend the stop_docker() in the tests? Happy to discuss or clarify further.

Reproduce steps

example run (please search logs 'Docker containers before stop_docker' and 'Docker containers after stop_docker'

Can add docker ps -a before and after stop_docker.

Raw log

Attachments

No response

Metadata

Metadata

Labels

A0need to scrubbugSomething isn't working

Type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions