scheduler: don't invoke SnapshotSharedLister in parallizer#2625
Merged
koordinator-bot[bot] merged 1 commit intokoordinator-sh:mainfrom Sep 23, 2025
Merged
Conversation
Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com>
Codecov Report✅ All modified and coverable lines are covered by tests. Additional details and impacted files@@ Coverage Diff @@
## main #2625 +/- ##
=======================================
Coverage 66.49% 66.49%
=======================================
Files 491 491
Lines 59103 59104 +1
=======================================
+ Hits 39298 39300 +2
+ Misses 16944 16942 -2
- Partials 2861 2862 +1
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
Member
Author
|
/approve |
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: saintube, ZiMengSheng The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
qinfustu
pushed a commit
to qinfustu/koordinator
that referenced
this pull request
Sep 23, 2025
…or-sh#2625) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com>
qinfustu
added a commit
to qinfustu/koordinator
that referenced
this pull request
Sep 23, 2025
Signed-off-by: qinfustu <30459241+qinfustu@users.noreply.github.com> scheduler: don't invoke SnapshotSharedLister in parallizer (koordinator-sh#2625) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: support ignore nominatedPods of same job (koordinator-sh#2628) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: distinguish preemption failure&success in nominatingInfo (koordinator-sh#2629) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> koordlet: add mainline kernel support for IsCoreSchedSupported() (koordinator-sh#2621) Signed-off-by: hwenwur <hwenwur@gmail.com> scheduler: fix deviceShare UT (koordinator-sh#2631) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com>
qinfustu
added a commit
to qinfustu/koordinator
that referenced
this pull request
Sep 24, 2025
Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> scheduler: during scheduling, it must consider whether hami-core is installed on the nodes (koordinator-sh#2577) Signed-off-by: qinfustu <fu_qin_stu@163.com> fix: podGroup not add to queue when pg not fount in PodGroupControlle… (koordinator-sh#2580) Signed-off-by: yangchao10 <yangchao10@xiaomi.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> chore: fix typo for RegisterTypeNodeMetadata (koordinator-sh#2584) Signed-off-by: qingyuan.zheng <460189852@qq.com> webhook: fix elasticquota validation error for min > max (koordinator-sh#2586) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> koordlet: support enhanced group identity for gpu (koordinator-sh#2583) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> scheduler: preprocess unmatch reservation's allocated (koordinator-sh#2589) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> manager: support batch resource limit of nodeCapacity (koordinator-sh#2588) Signed-off-by: lijunxin <lijunxin.ljx@alibaba-inc.com> apis: adapt to both v1alpha1&v1alpha2 noderesourcetopology (koordinator-sh#2593) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> scheduler: collect schedule pod result in metrics (koordinator-sh#2585) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> scheduler: add pre-allocation nominator (koordinator-sh#2592) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> koordlet: fix typo and add memory-ratio resource for buildXPUDevice() (koordinator-sh#2599) Signed-off-by: ZhuZhezz <zzhuzju@163.com> scheduler: don't invoke SnapshotSharedLister in parallizer (koordinator-sh#2600) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> scheduler: add questionedObjectKet and topologyKeyToExplain (koordinator-sh#2601) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> apis: add scheduleExplanation CRD (koordinator-sh#2602) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> scheduler: improve load aware perf by resources cache and vectorization (koordinator-sh#2582) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> koordlet: fix koordlet panic randomly,caused by node info not ready (koordinator-sh#2597) Signed-off-by: zhengj5 <zhengj5@trip.com> Co-authored-by: zhengj5 <zhengj5@trip.com> scheduler: loadaware support dominantResourceWeight (koordinator-sh#2603) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> koordlet: fix path for sched_idle_saver_wmark (koordinator-sh#2611) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> provides GPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> revert Signed-off-by: qinfustu <fu_qin_stu@163.com> revert2 Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> scheduler: during scheduling, it must consider whether hami-core is installed on the nodes (koordinator-sh#2577) Signed-off-by: qinfustu <fu_qin_stu@163.com> fix: podGroup not add to queue when pg not fount in PodGroupControlle… (koordinator-sh#2580) Signed-off-by: yangchao10 <yangchao10@xiaomi.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> chore: fix typo for RegisterTypeNodeMetadata (koordinator-sh#2584) Signed-off-by: qingyuan.zheng <460189852@qq.com> webhook: fix elasticquota validation error for min > max (koordinator-sh#2586) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> koordlet: support enhanced group identity for gpu (koordinator-sh#2583) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> scheduler: preprocess unmatch reservation's allocated (koordinator-sh#2589) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> manager: support batch resource limit of nodeCapacity (koordinator-sh#2588) Signed-off-by: lijunxin <lijunxin.ljx@alibaba-inc.com> apis: adapt to both v1alpha1&v1alpha2 noderesourcetopology (koordinator-sh#2593) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> scheduler: collect schedule pod result in metrics (koordinator-sh#2585) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> scheduler: add pre-allocation nominator (koordinator-sh#2592) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> koordlet: fix typo and add memory-ratio resource for buildXPUDevice() (koordinator-sh#2599) Signed-off-by: ZhuZhezz <zzhuzju@163.com> scheduler: don't invoke SnapshotSharedLister in parallizer (koordinator-sh#2600) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> scheduler: add questionedObjectKet and topologyKeyToExplain (koordinator-sh#2601) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> apis: add scheduleExplanation CRD (koordinator-sh#2602) Signed-off-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: wangjianyu.wjy <wangjianyu.wjy@alibaba-inc.com> scheduler: improve load aware perf by resources cache and vectorization (koordinator-sh#2582) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> koordlet: fix koordlet panic randomly,caused by node info not ready (koordinator-sh#2597) Signed-off-by: zhengj5 <zhengj5@trip.com> Co-authored-by: zhengj5 <zhengj5@trip.com> scheduler: loadaware support dominantResourceWeight (koordinator-sh#2603) Signed-off-by: zheng-weihao <zheng-weihao@outlook.com> Co-authored-by: wangjianyu <zmsjianyu@gmail.com> koordlet: fix path for sched_idle_saver_wmark (koordinator-sh#2611) Signed-off-by: saintube <saintube@foxmail.com> Co-authored-by: shenxin <rougang.hrg@alibaba-inc.com> provides GPU allocation and utilization metrics Signed-off-by: qinfustu <fu_qin_stu@163.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <30459241+qinfustu@users.noreply.github.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <30459241+qinfustu@users.noreply.github.com> provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <30459241+qinfustu@users.noreply.github.com> scheduler: don't invoke SnapshotSharedLister in parallizer (koordinator-sh#2625) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: support ignore nominatedPods of same job (koordinator-sh#2628) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: distinguish preemption failure&success in nominatingInfo (koordinator-sh#2629) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> koordlet: add mainline kernel support for IsCoreSchedSupported() (koordinator-sh#2621) Signed-off-by: hwenwur <hwenwur@gmail.com> scheduler: fix deviceShare UT (koordinator-sh#2631) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> all: provides vGPU allocation and utilization metrics Signed-off-by: qinfustu <30459241+qinfustu@users.noreply.github.com> scheduler: don't invoke SnapshotSharedLister in parallizer (koordinator-sh#2625) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: support ignore nominatedPods of same job (koordinator-sh#2628) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: distinguish preemption failure&success in nominatingInfo (koordinator-sh#2629) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> koordlet: add mainline kernel support for IsCoreSchedSupported() (koordinator-sh#2621) Signed-off-by: hwenwur <hwenwur@gmail.com> scheduler: fix deviceShare UT (koordinator-sh#2631) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> scheduler: support customize preemption diagnosis (koordinator-sh#2632) Signed-off-by: 乔普 <wangjianyu.wjy@alibaba-inc.com> Co-authored-by: 乔普 <wangjianyu.wjy@alibaba-inc.com>
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.This suggestion is invalid because no changes were made to the code.Suggestions cannot be applied while the pull request is closed.Suggestions cannot be applied while viewing a subset of changes.Only one suggestion per line can be applied in a batch.Add this suggestion to a batch that can be applied as a single commit.Applying suggestions on deleted lines is not supported.You must change the existing code in this line in order to create a valid suggestion.Outdated suggestions cannot be applied.This suggestion has been applied or marked resolved.Suggestions cannot be applied from pending reviews.Suggestions cannot be applied on multi-line comments.Suggestions cannot be applied while the pull request is queued to merge.Suggestion cannot be applied right now. Please check back later.
Ⅰ. Describe what this PR does
Ⅱ. Does this pull request fix one issue?
Ⅲ. Describe how to verify it
Ⅳ. Special notes for reviews
V. Checklist
make test