Releases: koordinator-sh/koordinator
Releases · koordinator-sh/koordinator
v1.0.0
What's Changed
- feat(deps): bump gorm.io/gorm from 1.23.8 to 1.23.10 by @dependabot in #654
- feat(deps): bump github.com/google/go-cmp from 0.5.8 to 0.5.9 by @dependabot in #619
- feat(deps): bump github.com/jedib0t/go-pretty/v6 from 6.3.8 to 6.3.9 by @dependabot in #663
- koord-manager: support calculate batch resource based on memory request by @saintube in #652
- chore: update bug issue template by @WeizhouRen in #666
- fix: correct pod informer delete event by @jasonliu747 in #669
- fix: use pointer for minor in device crd by @jasonliu747 in #656
- koordlet: unset cfs_quota for LSR/LSE pods by @saintube in #662
- koord-manager: modify quota-webhook by @xulinfei1996 in #660
- slo-controller report koordinator-gpu resource dimension by @buptcozy in #673
- koordlet: fix cpuset hook for unset cfs by @saintube in #680
- koord-scheduler: modify cpu and memory quantity formats and scales by @xulinfei1996 in #675
- fix gang by annotation create pg crd fail by @buptcozy in #683
- feat(deps): bump github.com/jedib0t/go-pretty/v6 from 6.3.9 to 6.4.0 by @dependabot in #671
- koordlet: enable group identity by sysctl when cpu qos enabled by @saintube in #687
- apis: support customizing different priority ranges by @eahydra in #691
- koord-scheduler: add GetQuotaSummaries/GetGangSummaries debug API by @buptcozy in #686
- koordlet: supply cpushare inject logic in runtimehooks protocols by @ZiMengSheng in #688
- koord-scheduler: abstract nodenumaresource plugin allowUseCPUSet by @ZiMengSheng in #690
- apis: update PriorityMidValueMax from 7099 to 7999 by @eahydra in #693
- chore: update ClusterColocationProfile UT by @eahydra in #694
- koord-scheduler: fix deviceshare onPodAdd when its device is not loaded by @xulinfei1996 in #695
- koordlet: divide states informer into plugins by @zwzhang0107 in #676
- apis: refactor scheduling config layout for release apis by @eahydra in #699
- chore: use openkruise/kruise-api instead of openkruise/kruise by @eahydra in #700
- fix pleg init err by @huiwq1990 in #702
- Fix #705 for the error of "io/ioutil" is deprecated by @denverdino in #706
- koord-scheduler: fix nodeNUMAResource allowUseCPUSet by @ZiMengSheng in #710
- koord-scheduler: fix NodeNUMAResource sortCPUsByRefCount by @ZiMengSheng in #709
- fix pleg ut for darwin by @zwzhang0107 in #713
- add webhook and node topo feature gates control by @zwzhang0107 in #715
- reducing nesting by @lucming in #719
- fix typo in pkg/koordlet/metriccache/api.go by @jasonliu747 in #725
- koord-scheduler: fix failed scheduling since missing CPUTopology by @eahydra in #721
- koord-scheduler: make Coscheduling/ElasticQuota/DeviceShare plugins more flexible by @xulinfei1996 in #704
- koord-scheduler: optimize the use of GroupQuotaManager's hierarchyLock by @xulinfei1996 in #681
- add E2E framework by @eahydra in #727
- koordlet: fix throw Can't get node error during initialization by @JasonRD in #732
- bugfix: append origin env when inject gpu env by @ZYecho in #689
- apis: fix resctrl qos defaults by @saintube in #731
- making code easier to read by @lucming in #653
- code collation by @lucming in #651
- koord-scheduler: fix the lack of gang function conversion by @xulinfei1996 in #734
- fix:exit the loop when enough pods have been found to be evicted by @lucming in #735
- fix: delete unused return args by @lucming in #724
- chore: update make deploy by @eahydra in #736
- chore: fix image registry in Kustomization by @jasonliu747 in #738
- koordlet: fix for cpu suppress when node topology reporting disabled by @saintube in #737
- koordlet: rm kubebuilder default tag for ResourceThresholdStrategy by @zwzhang0107 in #741
- support webhook debug api by @buptcozy in #740
- fix:failed to start nodeTopoInformer when tcp6 is not find in node by @lucming in #744
- RuntimeProxy: Support hook server deployed by k8s pod by @honpey in #718
- koordlet: add pprof flag by @zwzhang0107 in #747
- koordlet report gpu model and driver version by @buptcozy in #745
- koordlet: CPI collector for Interference Detection by @songtao98 in #622
- koord-scheduler: optimize ElasticQuota plugin's update logic by @xulinfei1996 in #729
- fix: error to init option(SysRootDir) by @lucming in #750
- apis: change the required fields in ClusterColocationProfile to optional by @eahydra in #751
- fix: remove unnecessary reassignments by @lucming in #753
- RuntimeProxy: return err when FailurePolicyType set to Fail by @honpey in #754
- koordlet: fix kubelet pid not found in tcp4 socks by @saintube in #761
- koordlet: add gc logs for metric cache by @zwzhang0107 in #749
- koord-scheduler: export client-go metrics by @ZiMengSheng in #770
- koord-manager: remove the quota webhook's dependency on koord-scheduler by @xulinfei1996 in #763
- feature: support pod skip runtime hook with specified label by @ZYecho in #682
- no processing for successfully exited pods by @lucming in #765
- koord-manager: add containerFilterFunc in requestLimitValidator by @ZiMengSheng in #772
- koord-scheduler: only allow the leader to initialize controllers by @xulinfei1996 in #764
- koord-scheduler: fix typo error in framework_extender.go by @KunWuLuan in #775
- koordlet: query kubelet config via HTTP API by @eahydra in #776
- koord-manager: update RBAC by @eahydra in #779
New Contributors
- @WeizhouRen made their first contribution in #666
- @denverdino made their first contribution in #706
- @KunWuLuan made their first contribution in #775
Full Changelog: v0.7.0...v1.0.0
v0.7.0
What's Changed
- refactor extended resource with
kubernetes.ionamespace by @zwzhang0107 in #350 - add multi-hierarchy-elastic-quota-management by @buptcozy in #398
- ci: rolling back to golangci-lint 1.47.3 by @jasonliu747 in #449
- adjusted the description of elastic quota by @hormes in #448
- koord-scheduler: adapt to different cpuset scheduling protocols by @ZiMengSheng in #452
- apis: add allocateOnce in reservation by @saintube in #447
- rename prefix of extended resource by @zwzhang0107 in #453
- koord-descheduler: complete configuration by @eahydra in #456
- ci: extend golangci-lint timeout to 10m by @jasonliu747 in #461
- fix: use uint32 to record process id by @jasonliu747 in #455
- ci: remove cache when setup go by @jasonliu747 in #463
- ci: add codeql analysis in workflows by @jasonliu747 in #240
- feat(deps): bump github.com/prometheus/client_golang from 1.12.2 to 1.13.0 by @dependabot in #454
- ADOPTERS: add iQIYI by @wangxiaoq in #465
- koord-scheduler: export NodeNUMAInfoCache and getAvailableCPUs by @ZiMengSheng in #469
- koord-scheduler: abstract GetPodQoSClass by @ZiMengSheng in #471
- add extended resource schedule plugin for compatibility by @zwzhang0107 in #472
- set kubernetes.io/batch-cpu as milli-core by @zwzhang0107 in #473
- koord-descheduler: init evictor plugin first by @eahydra in #458
- feature: support read args from KubeletConfiguration by @ZYecho in #470
- koord-scheduler: fix stale reservation status by @saintube in #474
- bugfix: fix exec exit with 126 by @ZYecho in #446
- feat(deps): bump github.com/spf13/cobra from 1.1.3 to 1.5.0 by @dependabot in #279
- koord-scheduler: fix typo by @jasonliu747 in #476
- koord-descheduler: add tests for framework runtime by @eahydra in #475
- ci: remove codeql from workflows by @jasonliu747 in #479
- docs: format markdown by @jasonliu747 in #480
- ci: add codeql in pipeline workflows by @jasonliu747 in #481
- koord-scheduler: reservation support allocateOnce by @saintube in #478
- feat(deps): bump go.uber.org/atomic from 1.9.0 to 1.10.0 by @dependabot in #483
- koord-runtime-proxy: regard empty string for CpusetCpus and CpusetMem… by @honpey in #484
- refactor batch node resource update by @zwzhang0107 in #485
- ci: use fast-test to skip redundant steps in workflows by @jasonliu747 in #486
- mv container extend resource to independent file by @zwzhang0107 in #488
- koord-descheduler: enable allocateOnce when creating Reservation by @eahydra in #494
- support node extended resource plugin by @zwzhang0107 in #491
- fix metric-expire-seconds usage by @huiwq1990 in #495
- adopters: add Quwan by @zshmmm in #497
- koord-descheduler: fix incorrect handling of Reservation's status by @eahydra in #500
- koord-scheduler: fix succeeded reservation GC and unschedulable retry by @saintube in #501
- koord-runtime-proxy: support inject env in proxy by @ZYecho in #503
- refactor: move GeneratePodPatch under util package by @jasonliu747 in #504
- fix: correct misspells in pkg/koordlet/metriccache by @huiwq1990 in #506
- koord-runtime-proxy: add more test cases by @honpey in #511
- apis: add DeviceAllocations and methods by @jasonliu747 in #513
- koord-scheduler: fix succeeded reservation in scheduler cache by @saintube in #516
- koord-descheduler: stop migration when reserved on the same node by @eahydra in #519
- koord-descheduler: support evict flow control to ensure availability by @eahydra in #520
- feat: not update status field when create NodeMetric by @huiwq1990 in #514
- koord-manager: record configmap and pod qos unmarshal failed events by @songtao98 in #518
- koord-scheduler: refactor NodeNUMAResource plugin by @eahydra in #507
- koord-descheduler: control the maximum number of migrations to ensure availability by @eahydra in #522
- apis: update Reservation kubebuilder:printcolumn comments by @eahydra in #524
- feature: support inject NVIDIA_VISIBLE_DEVICES env by @ZYecho in #517
- fix: Walkfunc should check the err by @j4ckstraw in #460
- runtime-hooks: add auto register hooks by @songtao98 in #505
- qos grep control extension by @zwzhang0107 in #529
- Add runtime hooks stage config flag by @songtao98 in #526
- docs: update fine grained device proposal by @jasonliu747 in #531
- chore: update tools version by @jasonliu747 in #535
- docs: reformat adopters list by @jasonliu747 in #537
- add quotaTreeWrapper and elasticQuota API by @xulinfei1996 in #515
- koord-scheduler: optimize the process after reservation scheduling fails by @eahydra in #532
- fix bug: modify unit test file path to temp dir by @songtao98 in #540
- ci: support aliyun hangzhou registry by @jasonliu747 in #542
- koord-scheduler: add coscheduling plugin args by @Wenshiqi222 in #538
- Update 20220629-fine-grained-device-scheduling.md by @hormes in #543
- koord-scheduler: export getDefaultNUMAAllocateStrategy by @ZiMengSheng in #547
- koord-scheduler: support API Services by @eahydra in #545
- chore: downgrade CUDA version to 11.2.2 by @jasonliu747 in #548
- koord-scheduler: support custom SharedLister by @ZiMengSheng in #549
- fix container cgroup comment by @huiwq1990 in #553
- koord-scheduler: fix CPUTopologyManager get&update race condition by @eahydra in #555
- koord-scheduler: CPU allocate algorithm supports maxRefCount by @ZiMengSheng in #551
- apis: update Device CRD fields by @jasonliu747 in #554
- koord-scheduler: fix coSchedulingArgs json name by @Wenshiqi222 in #558
- support Reservation select order by @eahydra in #552
- koord-descheduler: fix failed evict with StatefulSet Pod by @eahydra in #559
- koord-descheduler: fix UT conflict by @eahydra in #563
- koord-scheduler: improve reservation compatibility by @saintube in #550
- kood-scheduler: fix Reservation Score divide by zero by @eahydra in #564
- koord-scheduler: fix Reservation Score divide by zero with some empty… by @eahydra in https://github.com/koor...
v0.6.2
Changelog
- 54ed9a5 Add pod uid to pod meta when failover (#344)
- 492a861 Update ci.yaml
- 1328009 Use the structure as the key of the map instead of string. (#349)
- f81c89c [koord-runtime-proxy]: fix panic when no hook registered (#355)
- 42d695f add PodMigrationJob CRD proposal (#358)
- d1fb8c5 add descheduler framework proposal (#371)
- 7d46fad add fine-grained device scheduling proposal (#322)
- 82dc2ac add koord-descheduler (#425)
- 37a3aec add logs for proxy server (#329)
- 05a8c11 add pod annotations and labels to container request and cache (#362)
- 827bd6b add reservation plugin (#353)
- 78a4ebb add schedule gang md (#333)
- 993fc21 add scheduling framework extender (#365)
- 1cf37d0 add xiaohongshu as koordinator adopter (#424)
- c9cf1a4 api: add PodMigrationJob API (#375)
- 91cacc4 api: add device crd in scheduling group (#376)
- dab5a92 api: add device info into NodeMetric CRD (#378)
- 47e7189 api: update PodMigrationJob and Reservation CRD (#399)
- 74de8bd api: update reservation api (#384)
- bb3065a apis: add Gang api definition (#409)
- 0faf65e bugfix: always need to reset cpuset when cpu supress (#403)
- 9626a21 bugfix: avoid pod terminating in docker (#445)
- 24ecb9f bugfix: fix exec exit with 126 (#446)
- f133061 bugfix: skip when pod sandbox not found (#444)
- fbf4d97 change qos func name for old format adaption reason (#418)
- 5b1ce9d clear cpuset of BE container to avoid conflict with kubelet static policy, using the value of besteffort dir (#412)
- 6e0d88f cri-runtime-proxy: fix containerErr error when failOver pods and containers (#414)
- 6918290 feat(deps): bump github.com/stretchr/testify from 1.7.5 to 1.8.0 (#326)
- 3fce836 feat(deps): bump google.golang.org/protobuf from 1.28.0 to 1.28.1 (#419)
- d763879 feat(deps): bump gorm.io/driver/sqlite from 1.3.4 to 1.3.6 (#347)
- f32a0ba feat(deps): bump gorm.io/gorm from 1.23.6 to 1.23.8 (#351)
- bed2191 feat(deps): bump sigs.k8s.io/yaml from 1.2.0 to 1.3.0 (#427)
- 5b320c0 feat: add gpu metrics to crd (#397)
- 4301cc9 feat: collect gpu metrics (#361)
- 488f8d5 feature: report pod alloc of Guaranteed pod and cpu manager policy (#386)
- 9cf28bb feature: support read args from KubeletConfiguration (#470)
- b54bb0c fix auditor test in MacOS (#379)
- cdc41a0 fix go mod
- 5bcb7a7 fix koord-descheduler initialize profile error (#432)
- ecead7c fix reservation on mutil-scheduler (#431)
- 9e8fc01 fix reservation on pod patch failed (#428)
- b2fcc22 fix the loss of new updated resources from UpdateContainerResources request (#363)
- 0523d60 fix: consider lse/lsr when cpu suppress (#234) (#372)
- bf308ed fix: remove inline tag for corev1.ResourceList to fix #390 (#391)
- 6ac04d4 improve koordlet log verbosity (#338)
- a89cd98 koord-descheduler: implement PodMigrationJob controller (#404)
- 78afa0a koord-descheduler: implement descheduling configuration (#422)
- 49fa42c koord-descheduler: implement descheduling framework (#423)
- 3ed131c koord-descheduler: release Reservation when PodMigrationJob completes or is deleted (#438)
- 535bc21 koord-runtime-proxy: regard empty string for CpusetCpus and CpusetMem… (#487)
- 9eb7b7d koord-scheduler: compatible with Pods using kubelet static CPU manager policy (#433)
- c9ad604 koord-scheduler: improve reservation validation (#442)
- b78243b koord-scheduler: support CPU exclusive policy (#359)
- 8179245 koord-scheduler: support Node CPU orchestration API (#360)
- 1ab5c99 koord-scheduler: support default preferredCPUBindPolicy for LSE/LSR Pod if not specified (#354)
- 1e77f1f koord-scheduler: support kubelet cpu manager policy (#434)
- 171ad3e koordlet: define GPU metric struct (#343)
- 7442bc5 koordlet: fix build error on macOS caused by GPU (#413)
- 779ac80 koordlet: introduce Accelerators feature gate for GPU related features (#393)
- 91d2a4b koordlet: optimize auditor UT with httptest.Server (#382)
- 283c883 koordlet: refine initJiffies with default value (#367)
- 7510a3a make slo configmap name configurable (#415)
- b8dd567 rename resourceQoS to resourceQOS (#339)
- 0d9d9d4 style: unify the command parameter style of koordlet (#348)
- d0194b2 turn on pleg (#394)
v0.6.1
Changelog
- 54ed9a5 Add pod uid to pod meta when failover (#344)
- 1328009 Use the structure as the key of the map instead of string. (#349)
- f81c89c [koord-runtime-proxy]: fix panic when no hook registered (#355)
- 42d695f add PodMigrationJob CRD proposal (#358)
- d1fb8c5 add descheduler framework proposal (#371)
- 7d46fad add fine-grained device scheduling proposal (#322)
- 82dc2ac add koord-descheduler (#425)
- 37a3aec add logs for proxy server (#329)
- 05a8c11 add pod annotations and labels to container request and cache (#362)
- 827bd6b add reservation plugin (#353)
- 78a4ebb add schedule gang md (#333)
- 993fc21 add scheduling framework extender (#365)
- 1cf37d0 add xiaohongshu as koordinator adopter (#424)
- c9cf1a4 api: add PodMigrationJob API (#375)
- 91cacc4 api: add device crd in scheduling group (#376)
- dab5a92 api: add device info into NodeMetric CRD (#378)
- 47e7189 api: update PodMigrationJob and Reservation CRD (#399)
- 74de8bd api: update reservation api (#384)
- bb3065a apis: add Gang api definition (#409)
- 0faf65e bugfix: always need to reset cpuset when cpu supress (#403)
- f0daee1 bugfix: avoid pod terminating in docker (#445)
- 1c44a0a bugfix: skip when pod sandbox not found (#444)
- fbf4d97 change qos func name for old format adaption reason (#418)
- 5b1ce9d clear cpuset of BE container to avoid conflict with kubelet static policy, using the value of besteffort dir (#412)
- 6e0d88f cri-runtime-proxy: fix containerErr error when failOver pods and containers (#414)
- 6918290 feat(deps): bump github.com/stretchr/testify from 1.7.5 to 1.8.0 (#326)
- 3fce836 feat(deps): bump google.golang.org/protobuf from 1.28.0 to 1.28.1 (#419)
- d763879 feat(deps): bump gorm.io/driver/sqlite from 1.3.4 to 1.3.6 (#347)
- f32a0ba feat(deps): bump gorm.io/gorm from 1.23.6 to 1.23.8 (#351)
- bed2191 feat(deps): bump sigs.k8s.io/yaml from 1.2.0 to 1.3.0 (#427)
- 5b320c0 feat: add gpu metrics to crd (#397)
- 4301cc9 feat: collect gpu metrics (#361)
- 488f8d5 feature: report pod alloc of Guaranteed pod and cpu manager policy (#386)
- b54bb0c fix auditor test in MacOS (#379)
- 5bcb7a7 fix koord-descheduler initialize profile error (#432)
- ecead7c fix reservation on mutil-scheduler (#431)
- 9e8fc01 fix reservation on pod patch failed (#428)
- b2fcc22 fix the loss of new updated resources from UpdateContainerResources request (#363)
- 0523d60 fix: consider lse/lsr when cpu suppress (#234) (#372)
- bf308ed fix: remove inline tag for corev1.ResourceList to fix #390 (#391)
- 6ac04d4 improve koordlet log verbosity (#338)
- a89cd98 koord-descheduler: implement PodMigrationJob controller (#404)
- 78afa0a koord-descheduler: implement descheduling configuration (#422)
- 49fa42c koord-descheduler: implement descheduling framework (#423)
- 3ed131c koord-descheduler: release Reservation when PodMigrationJob completes or is deleted (#438)
- 9eb7b7d koord-scheduler: compatible with Pods using kubelet static CPU manager policy (#433)
- c9ad604 koord-scheduler: improve reservation validation (#442)
- b78243b koord-scheduler: support CPU exclusive policy (#359)
- 8179245 koord-scheduler: support Node CPU orchestration API (#360)
- 1ab5c99 koord-scheduler: support default preferredCPUBindPolicy for LSE/LSR Pod if not specified (#354)
- 1e77f1f koord-scheduler: support kubelet cpu manager policy (#434)
- 171ad3e koordlet: define GPU metric struct (#343)
- 7442bc5 koordlet: fix build error on macOS caused by GPU (#413)
- 779ac80 koordlet: introduce Accelerators feature gate for GPU related features (#393)
- 91d2a4b koordlet: optimize auditor UT with httptest.Server (#382)
- 283c883 koordlet: refine initJiffies with default value (#367)
- 7510a3a make slo configmap name configurable (#415)
- b8dd567 rename resourceQoS to resourceQOS (#339)
- 0d9d9d4 style: unify the command parameter style of koordlet (#348)
- d0194b2 turn on pleg (#394)
v0.6.0
What's Changed
- add logs for proxy server by @zwzhang0107 in #329
- chore: remove useless feature-gates by @saintube in #336
- ci: enable CGO when GoReleaser compiles binaries by @jasonliu747 in #334
- rename resourceQoS to resourceQOS by @zwzhang0107 in #339
- improve koordlet log verbosity by @saintube in #338
- Add pod uid to pod meta when failover by @cheimu in #344
- cleanup: Use the structure as the key of the map instead of string by @novahe in #349
- koordlet: define GPU metric struct by @jasonliu747 in #343
- koord-scheduler: support default preferredCPUBindPolicy for LSE/LSR P… by @eahydra in #354
- style: unify the command parameter style of koordlet by @jasonliu747 in #348
- add fine-grained device scheduling proposal by @buptcozy in #322
- [koord-runtime-proxy]: fix panic when no hook registered by @cheimu in #355
- koord-scheduler: support CPU exclusive policy by @eahydra in #359
- [koord-runtime-proxy] Add pod annotations and labels to container request and cache by @cheimu in #362
- [koord-runtime-proxy] fix the loss of new updated resources from UpdateContainerResources request by @cheimu in #363
- add scheduling framework extender by @saintube in #365
- koordlet: refine initJiffies with default value by @jasonliu747 in #367
- add PodMigrationJob CRD proposal by @eahydra in #358
- add proposal for gang scheduling by @buptcozy in #333
- Support node cpu orchestration api by @eahydra in #360
- chore: update dockerfile for each module by @jasonliu747 in #364
- feat(deps): bump github.com/stretchr/testify from 1.7.5 to 1.8.0 by @dependabot in #326
- feat(deps): bump gorm.io/driver/sqlite from 1.3.4 to 1.3.6 by @dependabot in #347
- chore: supply UT for pkg/util and pkg/util/system by @ZiMengSheng in #374
- api: add PodMigrationJob API by @eahydra in #375
- docs: remove redundant field in Device CRD by @jasonliu747 in #377
- api: add device CRD in scheduling group by @jasonliu747 in #376
- fix auditor test in MacOS by @hormes in #379
- koordlet: optimize auditor UT with httptest.Server by @ZiMengSheng in #382
- docs: add chinese version readme.md by @ZiMengSheng in #380
- fix: consider lse/lsr when cpu suppress (#234) by @ZYecho in #372
- api: add device info into NodeMetric CRD by @jasonliu747 in #378
- koordlet: support collecting GPU metrics from node/pod/container by @LambdaHJ in #361
- chore: cleanup resmanager by @saintube in #383
- api: update reservation api by @saintube in #384
- add descheduler framework proposal by @eahydra in #371
- feat(deps): bump gorm.io/gorm from 1.23.6 to 1.23.8 by @dependabot in #351
- fix: remove inline tag for corev1.ResourceList to fix #390 by @jasonliu747 in #391
- koordlet: Turn on pleg by @cheimu in #394
- feat: update GPU metrics in NodeMetric CRD by @LambdaHJ in #397
- bugfix: always need to reset cpuset when cpu supress by @ZYecho in #403
- feature: report pod alloc of Guaranteed pod and cpu manager policy by @ZYecho in #386
- api: update PodMigrationJob and Reservation CRD by @eahydra in #399
- koordlet: introduce
Acceleratorsfeature gate for GPU related features by @jasonliu747 in #393 - koordlet: fix build error caused by GPU by @eahydra in #413
- cri-runtime-proxy: fix containerErr error when failOver pods and cont… by @lx1036 in #414
- make slo configmap name configurable by @zwzhang0107 in #415
- clear cpuset of BE container to avoid conflict with kubelet static po… by @zwzhang0107 in #412
- change qos func name for old format adaption reason by @zwzhang0107 in #418
- docs: add ADOPTERS.md of Koordinator by @jasonliu747 in #392
- koord-descheduler: implement descheduling configuration by @eahydra in #422
- chore: execute staticcheck instead of github action by running golang… by @eahydra in #421
- koord-scheduler: add reservation plugin by @saintube in #353
- koord-descheduler: implement descheduling framework by @eahydra in #423
- [adopter] add xiaohongshu as koordinator adopter by @cheimu in #424
- add koord-descheduler by @eahydra in #425
- fix reservation on pod patch failed by @saintube in #428
- koord-descheduler: implement PodMigrationJob controller by @eahydra in #404
- fix reservation on mutil-scheduler by @saintube in #431
- fix koord-descheduler initialize profile error by @eahydra in #432
- api: add Gang api by @Wenshiqi222 in #409
- koord-scheduler: compatible with Pods using kubelet static CPU manager policy by @eahydra in #433
- koord-scheduler: support kubelet cpu manager policy by @eahydra in #434
- docs: add maturity level in adopters.md by @jasonliu747 in #426
- feat(deps): bump google.golang.org/protobuf from 1.28.0 to 1.28.1 by @dependabot in #419
- feat(deps): bump sigs.k8s.io/yaml from 1.2.0 to 1.3.0 by @dependabot in #427
- koord-descheduler: release Reservation when PodMigrationJob completes or is deleted by @eahydra in #438
- koord-scheduler: improve reservation validation by @saintube in #442
New Contributors
- @buptcozy made their first contribution in #322
- @ZiMengSheng made their first contribution in #374
- @lx1036 made their first contribution in #414
- @Wenshiqi222 made their first contribution in #409
Full Changelog: v0.5.0...v0.6.0
v0.5.0
Changelog
- c4e2272 ci: use matrix and cache to speed up the build (#282) @jasonliu747
- 5a69a22 Add PreCreateContainerHook and PostStopSandboxHook interfaces and update their parameters (#231) @cheimu
- 1220b23 Add container id to ContainerResourceHookRequest (#243) @cheimu
- 1fa6ec8 Fix wrong cgroup path for PLEG (#325) @cheimu
- 108ad50 Implement cri scenario PreCreateContainerHook and PostStopPodSandboxHook (#239) @cheimu
- c063d0c Proposal QoS Manager (#262) @stormgbs
- 5d624d7 add container id to container info (#251) @cheimu
- a6b005f add cpuset allocator (#324) @zwzhang0107
- 8910d29 add defer os.Remove (#247) @cheimu
- 927483a add koordlet running mode design doc (#306) @zwzhang0107
- 1035de1 add more tests (#272) @hormes
- 178f086 add more tests for docker-proxy (#287) @ZYecho
- a49ab45 add reconciler for runtime hook standalone work mode (#319) @zwzhang0107
- 1f07f09 add resource reservation proposal (#241) @saintube
- 1caf1cd api: add reservation API (#276) @saintube
- 8289720 apis: update cpu scheduling plugin args and apis (#308) @eahydra
- 4bf748b bugfix: fix noderesource-controller, reporter reconcile on node deletion (#309) @saintube
- bff21a8 change cpuacct.stat to cpuacct.usage (#248) @j4ckstraw
- 8b99047 chore(deps): bump goreleaser/goreleaser-action from 2 to 3 (#299)
- d84b497 defines CPU orchestration APIs (#263) @eahydra
- 493a1ec docker-proxy: ensure cgroup parent for docker is valid (#281) @cheimu
- cef1028 docker-proxy: support createContainer and stopSandbox hook (#236) @ZYecho
- fec29f9 feat(deps): bump github.com/docker/docker (#223)
- 00b1657 feat(deps): bump github.com/fsnotify/fsnotify from 1.4.9 to 1.5.4 (#183)
- c5ee5fa feat(deps): bump github.com/google/uuid from 1.2.0 to 1.3.0 (#256)
- 852df5f feat(deps): bump github.com/prometheus/client_golang (#184)
- 74b8f03 feat(deps): bump github.com/stretchr/testify from 1.7.0 to 1.7.4 (#293)
- 6e097f9 feat(deps): bump github.com/stretchr/testify from 1.7.4 to 1.7.5 (#313)
- 2e35be6 feat(deps): bump go.uber.org/atomic from 1.7.0 to 1.9.0 (#278)
- 4ff7b4f feat(deps): bump google.golang.org/protobuf from 1.26.0 to 1.28.0 (#284)
- 46fd995 feat(deps): bump gorm.io/driver/sqlite from 1.3.1 to 1.3.4 (#257)
- 444d598 feat(deps): bump gorm.io/gorm from 1.23.3 to 1.23.6 (#245)
- 360fc88 feat(deps): bump k8s.io/klog/v2 from 2.9.0 to 2.10.0 (#291)
- 444b82d feat(deps): bump sigs.k8s.io/controller-runtime from 0.10.2 to 0.10.3 (#290)
- c6bc07f feat: add accessing kubelet with http option for koordlet (#304) @jasonliu747
- b21ea49 feat: add read only port support for koordlet (#320) @LambdaHJ
- 86097cc feature: report cpu info to noderesoucetopology (#312) @ZYecho
- 975bb93 fix: incorrect coversion of an integer (#242) @jasonliu747
- 92a862b fixed setup kubelet docs, which caused kubelet startup exception (#274) @JasonRD
- 5da05b3 koord-scheduler: compatible with Kubernetes v1.18 ~ v1.20 (#315) @eahydra
- ea82b20 koord-scheduler: implement NodeNUMAResource plugin with CPUSet scheduling (#289) @eahydra
- 503cd9d koord-scheduler: loadAwareScheduling skip node without NodeMetric in filter/score phase (#317) @eahydra
- d331a10 koord-scheduler: refactor framework extender init function (#307) @eahydra
- 9ae9b77 proposal: design fine-grained CPU orchestration (#209) @eahydra
- 5834b42 refactor scheduling config code path (#275) @eahydra
- 9287683 refactor: move runtime module under util (#266) @jasonliu747
- 4fa8400 refactor: prune socket before launching runtime-proxy (#250) @jasonliu747
- 690618e refactor: remove useless field in NodeSLOReconciler (#259) @jasonliu747
- d139458 refactor: replace reinvented wheel with Get() in standard library (#244) @jasonliu747
- 3362924 refactor: slo-controller use cache manage configmap (#305) @chzhj
- 9cbe6a9 regenerate runtime api to fix typo (#238) @cheimu
- 5a0c927 remove macos tests (#269) @hormes
- 8a5fee9 rename runtime-hook working mode (Bypass->Standalone) (#318) @novahe
- 4d7aa91 update podInfo and containerInfo with hook resp cgroupParent (#252) @cheimu
- 2d23602 use unified internal protocol for running hooks plugin (#283) @zwzhang0107
New Contributors
- @j4ckstraw made their first contribution in #248
- @dependabot made their first contribution in #183
- @JasonRD made their first contribution in #274
- @chzhj made their first contribution in #305
- @novahe made their first contribution in #318
Full Changelog: v0.4.1...v0.5.0
v0.4.1
What's Changed
- chore: update codecov.yaml by @saintube in #208
- fix: change directory of generate-runtime.sh by @jasonliu747 in #215
- Fix bug where runtime proxy cannot decode annotations in Docker config by @cheimu in #220
- feat: add kubelet http2 support by @LambdaHJ in #180
New Contributors
Full Changelog: v0.4.0...v0.4.1
v0.4.0
✨ Features and improvements:
- Introduce main for runtime-manager by @honpey in #171
- feature: support docker proxy by @ZYecho in #128
- feat(koordlet): support memoryEvictLowerPercent in memory evict by @shinytang6 in #132
- proposal load-aware scheduling plugin by @eahydra in #135
- koordlet: support cpu evict feature by @jasonliu747 in #169
- add group identity plugin by @zwzhang0107 in #166
🐛 Fixed bugs:
- fix(koordlet): fix be container memory request by @shinytang6 in #129
⏫ Merged pull requests:
- chore: add cache for staticcheck by @jasonliu747 in #130
- add koordlet runtime design by @zwzhang0107 in #123
- 🌱 add validation for CRD by @jasonliu747 in #133
- test(controller): add unit test for resource_calculator by @jasonliu747 in #137
- Modify memqos wmark ratio doc desc by @tianzichenone in #142
- test(controller): add unit test for
config/config.goby @jasonliu747 in #134 - test(controller): add unit test for noderesource by @jasonliu747 in #138
- add scaffold of runtime hooks by @zwzhang0107 in #122
- test: use
T.TempDirto create temporary test directory by @Juneezee in #151 - update LoadAwareScheduling proposal by @eahydra in #155
- chore: fix test tempdir generation by @saintube in #156
- koordlet: support NodeMetricCollectPolicy by @eahydra in #157
- add cpu qos and mv nodeslo informer to states informer by @zwzhang0107 in #153
- update codecov configuration by @saintube in #131
- koordlet: support collect BE CPU metric by @jasonliu747 in #158
- apis: introduce cpu evict fields in NodeSLO by @jasonliu747 in #161
- Add pod annotations/labels for container level hook by @honpey in #165
- fix build errors by @hormes in #160
- ci: support running unit test on multiple os by @jasonliu747 in #162
- style: format header to fix ci errors by @jasonliu747 in #167
- Introduce image service proxy under cri scenario by @honpey in #168
- runtime-manager: refactor codes about store and resource-exectutor by @honpey in #170
- Support load aware scheduling by @eahydra in #159
- koord-scheduler: update scheduler apis groupName by @eahydra in #173
- test: add ut for configmap_event_handler by @jasonliu747 in #176
- refactor tests in nodemetric package by @hormes in #175
- vendor: goodbye vendor by @jasonliu747 in #149
- test: add ut for node_event_handler by @jasonliu747 in #177
- fix: add CPU Evict check in isFeatureDisabled by @jasonliu747 in #179
- Add the runtime-manager design doc by @honpey in #178
- chore: introduce dependabot by @jasonliu747 in #181
- add more tests by @hormes in #182
- chore: remove additional cache in golangci-lint by @jasonliu747 in #192
- api: remove deprecated field in NodeSLO by @jasonliu747 in #191
- Rename runtime-manager to koord-runtime-proxy by @honpey in #195
- add more tests by @hormes in #194
- koord-runtime-proxy: add installation manual by @honpey in #198
- add ChangeLog for v0.4.0 by @eahydra in #200
🎉 New Contributors:
- @shinytang6 made their first contribution in #129
- @tianzichenone made their first contribution in #142
- @Juneezee made their first contribution in #151
- @ZYecho made their first contribution in #128
Full Changelog: v0.3.1...0.4.0
v0.3.1
v0.3.0
✨ Features and improvements :
- Support CPU burst strategy #52
- Support Memory QoS strategy #55
- Support LLC and MBA isolation strategy #56
- Protocol design between runtime-manager and hook server #62
- Improve overall code coverage from 39% to 56% #69
🐛 Fixed bugs:
- when deploy on ACK 1.18.1 koord-manager pod always crash #49
- Handle unexpected CPU info in case of koordlet panic #90
⏫ Merged pull requests:
- New feature: cpu burst strategy #73 (stormgbs)
- Introduce protocol between RuntimeManager and RuntimeHookServer #76 (honpey)
- Improve readme #88 (hormes)
- update image file format #92 (zwzhang0107)
- 🌱 add expire cache #93 (jasonliu747)
- ✨ support LLC & MBA isolation #94 (jasonliu747)
- fix cpuinfo panic on arm64 #97 (saintube)
- 📖 fix typo in docs #98 (jasonliu747)
- Introduce HookServer config loading from /etc/runtime/hookserver.d/ #100 (honpey)
- add memory qos strategy #101 (saintube)
- add an issue template and rename feature request to proposal #108 (hormes)
- Introduce cri request parsing/generate-hook-request/checkpoing logic #110 (honpey)
- 🌱 add unit test for resmanager #111 (jasonliu747)
- Add cpu suppress test and revise memory qos #112 (saintube)
- ✨ Remove deprecated go get from Makefile #116 (jasonliu747)
- 🌱 add license checker in workflow #117 (jasonliu747)
- Support cpu burst strategy #118 (stormgbs)
- 🌱 add unit test for memory evict feature #119 (jasonliu747)
- add UTs for runtime handler #125 (saintube)
- 📖 add changelog for v0.3 #126 (jasonliu747)