Skip to content

Add port definition to vGPU monitor DaemonSet#1085

Open
ntheanh201 wants to merge 855 commits intoProject-HAMi:masterfrom
ntheanh201:fix/helm-monitor
Open

Add port definition to vGPU monitor DaemonSet#1085
ntheanh201 wants to merge 855 commits intoProject-HAMi:masterfrom
ntheanh201:fix/helm-monitor

Conversation

@ntheanh201
Copy link
Contributor

What type of PR is this?

What this PR does / why we need it:

Which issue(s) this PR fixes:
Fixes #

Special notes for your reviewer:

Does this PR introduce a user-facing change?:

lengrongfu and others added 30 commits August 9, 2024 14:44
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.6.0 to 6.6.1.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](docker/build-push-action@v6.6.0...v6.6.1)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: william-wang <wang.platform@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
* fix: fix duplicate resource keys in configmap

* fix: Update incorrect component names in monitorservice
Bumps [github.com/opencontainers/runc](https://github.com/opencontainers/runc) from 1.1.2 to 1.1.12.
- [Release notes](https://github.com/opencontainers/runc/releases)
- [Changelog](https://github.com/opencontainers/runc/blob/main/CHANGELOG.md)
- [Commits](opencontainers/runc@v1.1.2...v1.1.12)

---
updated-dependencies:
- dependency-name: github.com/opencontainers/runc
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Bumps [github/codeql-action](https://github.com/github/codeql-action) from 2 to 3.
- [Release notes](https://github.com/github/codeql-action/releases)
- [Changelog](https://github.com/github/codeql-action/blob/main/CHANGELOG.md)
- [Commits](github/codeql-action@v2...v3)

---
updated-dependencies:
- dependency-name: github/codeql-action
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Bumps [actions/setup-go](https://github.com/actions/setup-go) from 4 to 5.
- [Release notes](https://github.com/actions/setup-go/releases)
- [Commits](actions/setup-go@v4...v5)

---
updated-dependencies:
- dependency-name: actions/setup-go
  dependency-type: direct:production
  update-type: version-update:semver-major
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Bumps [github.com/opencontainers/runc](https://github.com/opencontainers/runc) from 1.1.12 to 1.1.14.
- [Release notes](https://github.com/opencontainers/runc/releases)
- [Changelog](https://github.com/opencontainers/runc/blob/main/CHANGELOG.md)
- [Commits](opencontainers/runc@v1.1.12...v1.1.14)

---
updated-dependencies:
- dependency-name: github.com/opencontainers/runc
  dependency-type: indirect
...

Signed-off-by: dependabot[bot] <support@github.com>
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.6.1 to 6.7.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](docker/build-push-action@v6.6.1...v6.7.0)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: rongfu.leng <lenronfu@gmail.com>
Wangmin362 and others added 19 commits April 27, 2025 19:02
Project-HAMi#1023)

Signed-off-by: wangmin <wangmin@riseunion.io>

Co-authored-by: wangmin <wangmin@riseunion.io>
…i#1021)

* feat: Support for using RuntimeClass with nvidia devices

Signed-off-by: 王然 <ranwang@alauda.io>

* docs: runtimeClassName

Signed-off-by: 王然 <ranwang@alauda.io>

* feat: reset hasResource logic

Signed-off-by: 王然 <ranwang@alauda.io>

---------

Signed-off-by: 王然 <ranwang@alauda.io>
…1020)

Signed-off-by: wangmin <wangmin@riseunion.io>

Co-authored-by: wangmin <wangmin@riseunion.io>
…t after ConfigMap modification (Project-HAMi#1022)

Signed-off-by: 王然 <ranwang@alauda.io>
 (Project-HAMi#1012)

Signed-off-by: ouyangluwei(riseunion) <ouyangluwei@riseunion.io>
Co-authored-by: ouyangluwei(riseunion) <ouyangluwei@riseunion.io>
add new ai accelerator GCU S60 made by https://www.enflame-tech.com

Signed-off-by: winston-zhang-orz <73474183+winston-zhang-orz@users.noreply.github.com>
* update cambricon devices

Signed-off-by: limengxuan <391013634@qq.com>

* update

Signed-off-by: limengxuan <391013634@qq.com>

* update

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>

* update

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>

---------

Signed-off-by: limengxuan <391013634@qq.com>
Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>
…roject-HAMi#1031)

Fix scheduler metrics can not be accessed when using master branch of HAMi

Signed-off-by: limengxuan <mengxuan.li@dynamia.ai>
Signed-off-by: rongfu.leng <rongfu.leng@daocloud.io>
…roject-HAMi#938)

* Separate options from client to make the responsibility more clear.
Remove the magic number in the main function and define it as a constant.

Signed-off-by: yangshiqi <yangshiqi@riseunion.io>

* fix merge bugs and add testcase.
remove some comments to try e2e

Signed-off-by: yangshiqi <yangshiqi@riseunion.io>

* debug for e2e

Signed-off-by: yangshiqi <yangshiqi@riseunion.io>

* fix e2e error

Signed-off-by: yangshiqi <yangshiqi@riseunion.io>

---------

Signed-off-by: yangshiqi <yangshiqi@riseunion.io>
Co-authored-by: yangshiqi <yangshiqi@riseunion.io>
Signed-off-by: rongfu.leng <rongfu.leng@daocloud.io>
Bumps [docker/build-push-action](https://github.com/docker/build-push-action) from 6.16.0 to 6.17.0.
- [Release notes](https://github.com/docker/build-push-action/releases)
- [Commits](docker/build-push-action@v6.16.0...v6.17.0)

---
updated-dependencies:
- dependency-name: docker/build-push-action
  dependency-version: 6.17.0
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] <support@github.com>
Signed-off-by: Shouren Yang <yangshouren@gmail.com>
Signed-off-by: wawa0210 <xiao.zhang@dynamia.ai>
Signed-off-by: Lei Guo <Lei.Guo@metax-tech.com>
Co-authored-by: Lei Guo <Lei.Guo@metax-tech.com>
…patibility with prerelease versions (Project-HAMi#1072)

- Updates `kubeVersion` from '>= 1.16.0' to '>= 1.18.0-0' to support GKE pre-release versions like v1.32.3-gke.*
- Updates README to reflect the minimum supported Kubernetes version

Signed-off-by: Yu Yin <yu.yin@dynamia.ai>
Co-authored-by: Yu Yin <yu.yin@dynamia.ai>
Signed-off-by: The Anh Nguyen <ntheanh201@gmail.com>
@hami-robott
Copy link
Contributor

hami-robott bot commented May 26, 2025

[APPROVALNOTIFIER] This PR is NOT APPROVED

This pull-request has been approved by: ntheanh201
Once this PR has been reviewed and has the lgtm label, please assign wawa0210 for approval. For more information see the Kubernetes Code Review Process.

The full list of commands accepted by this bot can be found here.

Details Needs approval from an approver in each of these files:

Approvers can indicate their approval by writing /approve in a comment
Approvers can cancel approval by writing /approve cancel in a comment

@hami-robott hami-robott bot added the size/XS label May 26, 2025
@codecov
Copy link

codecov bot commented May 27, 2025

Codecov Report

✅ All modified and coverable lines are covered by tests.

Flag Coverage Δ
unittests 61.01% <ø> (-0.07%) ⬇️

Flags with carried forward coverage won't be shown. Click here to find out more.
see 1 file with indirect coverage changes

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@wawa0210
Copy link
Member

/cc @archlitchi @lengrongfu Can you help review this pr ?

@wawa0210 wawa0210 added the kind/enhancement New feature or request label Jun 16, 2025
@FouoF
Copy link
Contributor

FouoF commented Jul 21, 2025

Can you describe more about why we need this pr?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.