Skip to content

Optimize the handling of the return messages of the extender scheduler#908

Open
learner0810 wants to merge 874 commits intoProject-HAMi:masterfrom
learner0810:enhance-http-response
Open

Optimize the handling of the return messages of the extender scheduler#908
learner0810 wants to merge 874 commits intoProject-HAMi:masterfrom
learner0810:enhance-http-response

Conversation

@learner0810
Copy link
Contributor

What type of PR is this?

/kind flake

What this PR does / why we need it:

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

Optimize the handling of the return messages of the extender scheduler

Does this PR introduce a user-facing change?:

lengrongfu and others added 30 commits September 2, 2024 11:05
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Bumps [github.com/opencontainers/runc](https://github.com/opencontainers/runc) from 1.1.12 to 1.1.14.
- [Release notes](https://github.com/opencontainers/runc/releases)
- [Changelog](https://github.com/opencontainers/runc/blob/main/CHANGELOG.md)
- [Commits](opencontainers/runc@v1.1.12...v1.1.14)

---
updated-dependencies:
- dependency-name: github.com/opencontainers/runc
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.6.1 to 6.7.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](docker/build-push-action@v6.6.1...v6.7.0)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
* fix golang-lint

Signed-off-by: limengxuan <391013634@qq.com>

* update CodeQL CI

Signed-off-by: limengxuan <391013634@qq.com>

* update ci and remove unused binary in order to support codeQL

Signed-off-by: limengxuan <391013634@qq.com>

---------

Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: zoyopei <zoyopei@gmail.com>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: zoyopei <zoyopei@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
…late

Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.7.0 to 6.8.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](docker/build-push-action@v6.7.0...v6.8.0)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: limengxuan <391013634@qq.com>
Kyrie336 and others added 23 commits May 22, 2025 16:00
Signed-off-by: Lei Guo <Lei.Guo@metax-tech.com>
Co-authored-by: Lei Guo <Lei.Guo@metax-tech.com>
…patibility with prerelease versions (Project-HAMi#1072)

- Updates `kubeVersion` from '>= 1.16.0' to '>= 1.18.0-0' to support GKE pre-release versions like v1.32.3-gke.*
- Updates README to reflect the minimum supported Kubernetes version

Signed-off-by: Yu Yin <yu.yin@dynamia.ai>
Co-authored-by: Yu Yin <yu.yin@dynamia.ai>
Signed-off-by: wawa0210 <xiao.zhang@dynamia.ai>
Signed-off-by: wawa0210 <xiaozhang0210@hotmail.com>
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.17.0 to 6.18.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](docker/build-push-action@v6.17.0...v6.18.0)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-version: 6.18.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: rongfu.leng <rongfu.leng@daocloud.io>
Signed-off-by: The Anh Nguyen <ntheanh201@gmail.com>
…DList

Signed-off-by: Shouren Yang <yangshouren@gmail.com>
…ByNode, EncodeContainerDeviceType and PatchPodAnnotations

Signed-off-by: Shouren Yang <yangshouren@gmail.com>
* feat: vGPUmonitor support MigInfo metrics Project-HAMi#1043

Signed-off-by: ouyangluwei(riseunion) <ouyangluwei@riseunion.io>

* feat: vGPUmonitor support MigInfo metrics Project-HAMi#1043

Signed-off-by: ouyangluwei(riseunion) <ouyangluwei@riseunion.io>

---------

Signed-off-by: ouyangluwei(riseunion) <ouyangluwei@riseunion.io>
Co-authored-by: ouyangluwei(riseunion) <ouyangluwei@riseunion.io>
Signed-off-by: rongfu.leng <rongfu.leng@daocloud.io>
Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>
…#1097)

* Optimize Fit-in-device logic to make it device-specific

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>
- Add configurable node lock timeout parameter to scheduler config
- Add --node-lock-timeout flag to scheduler command line options
- Change NodeLockTimeout type from int to time.Duration for better type safety
- Update command line flag to use DurationVar instead of IntVar
- Remove default value description from comment as suggested in code review
- Initialize nodelock.NodeLockTimeout from config at startup
- Default timeout set to 5 minutes but now can be customized
- Support flexible time units (e.g., 30s, 2m, 1h) instead of just minutes
- This allows operators to tune lock timeout based on cluster characteristics

This addresses code review feedback from wawa0210 regarding:
1. Using proper time.Duration type for timeout configuration
2. Removing redundant default value comments

Signed-off-by: Kevinz857 <kevinnz@foxmail.com>
Signed-off-by: Goend <jian.mei@easystack.cn>
Bumps [aquasecurity/trivy-action](https://github.com/aquasecurity/trivy-action) from 0.30.0 to 0.31.0.
- [Release notes](https://github.com/aquasecurity/trivy-action/releases)
- [Commits](aquasecurity/trivy-action@0.30.0...0.31.0)

---
updated-dependencies:
- dependency-name: aquasecurity/trivy-action
  dependency-version: 0.31.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
fix golint-CI

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>
Signed-off-by: wawa0210 <xiao.zhang@dynamia.ai>
Support topology-awareness for Kunlunxin device

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>
Signed-off-by: Lei Guo <Lei.Guo@metax-tech.com>
Co-authored-by: Lei Guo <Lei.Guo@metax-tech.com>
Signed-off-by: lixd <xueduan.li@gmail.com>
Signed-off-by: learner0810 <zhongjun.li@daocloud.io>
@hami-robot
Copy link
Contributor

hami-robot bot commented Jun 23, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: learner0810
Once this PR has been reviewed and has the lgtm label, please assign wawa0210 for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@hami-robot hami-robot bot added the size/M label Jun 23, 2025
@codecov
Copy link

codecov bot commented Jun 23, 2025

Codecov Report

❌ Patch coverage is 52.51142% with 104 lines in your changes missing coverage. Please review.

Files with missing lines Patch % Lines
pkg/device/ascend/device.go 52.51% 101 Missing and 3 partials ⚠️
Flag Coverage Δ
unittests 53.04% <52.51%> (?)

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines Coverage Δ
pkg/device/cambricon/device.go 56.89% <ø> (ø)
pkg/device/common/common.go 100.00% <ø> (ø)
pkg/device/devices.go 73.65% <ø> (ø)
pkg/device/enflame/device.go 50.44% <ø> (ø)
pkg/device/hygon/device.go 63.00% <ø> (ø)
pkg/device/iluvatar/device.go 53.84% <ø> (ø)
pkg/device/kunlun/device.go 43.30% <ø> (ø)
pkg/device/metax/config.go 100.00% <ø> (ø)
pkg/device/metax/device.go 41.20% <ø> (ø)
pkg/device/metax/protocol.go 80.00% <ø> (ø)
... and 26 more
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.