Pipeline task if task's request resource less than the releasing resource of node during performing allocate action#541
Conversation
|
Hey @sivanzcw, TravisBuddy Request Identifier: 3aa881f0-0b81-11ea-9cd6-8f216fa7db85 |
| if err := stmt.Pipeline(task, node.Name); err != nil { | ||
| glog.Errorf("Failed to pipeline Task %v on %v", | ||
| task.UID, node.Name) | ||
| if err := ssn.Pipeline(task, node.Name); err != nil { |
There was a problem hiding this comment.
any more info on why change it from stmt to ssn ?
There was a problem hiding this comment.
If reclaim action is enabled after allocate action. The cluter situation is as below
| serial | node name | resource |
|---|---|---|
| 1 | node1 | 4c8g |
| 2 | node2 | 4c8g |
| serial | queue name | weight | quota | status |
|---|---|---|---|---|
| 1 | default | 1 | 0.8c 1.5M | overused |
| 2 | queue1 | 100000 | 7c 8g | active |
| serial | job name | pods number | minA | queue | status |
|---|---|---|---|---|---|
| 1 | joba | 7 | 1 | default | all running |
| 2 | jobb | 7 | 7 | queue1 | all pending |
There are two jobs in the cluster, joba and jobb. Joba was placed in default queue. Jobb was placed in queue1 queue. Joba has 7pods Running. Jobb has 7pods pending. default queue was overused. Pods in queue1 will try to reclaim resource from defualt queue.
-
In
reclaimaction,podb-1injobbevicted podpoda-1injoba,poda-1was originally at nodenode1, the scheduling loop ends. -
In the next scheduling loop. In
allocateaction,podb-1want to be pipelined to nodenode1, but gang-restriction ofjobbwas not meet, the pipeline action will be discard. Inallocateaction no pod was pipelined, though there are releasing resources in cluster. -
In
relcaimaction of this scheduling loop,podb-1injobbwill try to evicted other pods injoba. -
Finally,
podb-1will evicted 6 pods fromjoba. -
So if there are releasing resources in cluster, pod who has the higher priority, may should be
pipelinedto the node, regardless of whether the gang restriction of job of the pod was meet, in case that, the pod will evict other pods in subsequent actions.
|
/approve |
|
Hey @sivanzcw, TravisCI finished with status TravisBuddy Request Identifier: 53495a40-1fd6-11ea-ba47-7f442aed9c1e |
|
Hey @sivanzcw, TravisCI finished with status TravisBuddy Request Identifier: 3fc95e60-2077-11ea-830b-038034041c48 |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: k82cn, sivanzcw The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
|
Hey @sivanzcw, TravisBuddy Request Identifier: b9b37c50-207d-11ea-830b-038034041c48 |
…urce of node during performing allocate action
|
Hey @sivanzcw, TravisCI finished with status TravisBuddy Request Identifier: 39863f60-2085-11ea-830b-038034041c48 |
|
Hey @sivanzcw, TravisBuddy Request Identifier: 03bebdc0-2086-11ea-830b-038034041c48 |
|
/lgtm |
#540