Skip to content

Status of Jobs pods can become "OutOfCpu" when many pods are created and completed in a short time on the same node schedule by batch.volcano.sh/v1alpha1. #2700

@GEEKZX

Description

@GEEKZX

What happened?
Status of pods can become "OutOfCpu" when many jobs are created and completed in a short time on the same node schedule by batch.volcano.sh/v1alpha1.

What did you expect to happen?
Status of pods should become "Running", "Pending" or "Completed". When node resources are not satisfied, The jobs pods should not be scheduled.

Anything else we need to know?
We found this issue in volcano v1.7.0 k8s version v1.23.15
Pods QoS is Burstable
Error message:
status:
message: 'Pod Node didn''t have enough resource: cpu, requested: 56033, used: 34036, capacity: 90000'
phase: Failed
reason: OutOfcpu
Environment:

  • Volcano Version: v1.7.0
  • Kubernetes version (use kubectl version): v1.23.15
  • Cloud provider or hardware configuration:
  • OS (e.g. from /etc/os-release): Ubuntu 20.04.3 LTS
  • Kernel (e.g. uname -a):Linux k8s-master 5.15.0-43-generic Refresh volcano intro image #46-Ubuntu SMP Tue Jul 12 10:30:17 UTC 2022 x86_64 x86_64 x86_64 GNU/Linux
  • Install tools:
  • Others:

Metadata

Metadata

Assignees

No one assigned

    Labels

    kind/bugCategorizes issue or PR as related to a bug.lifecycle/staleDenotes an issue or PR has remained open with no activity and has become stale.

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions