Skip to content
Open
Show file tree
Hide file tree
Changes from all commits
Commits
Show all changes
637 commits
Select commit Hold shift + click to select a range
4ff59ba
Merge pull request #179 from lengrongfu/feat/replace-hamish
archlitchi Mar 13, 2024
b789a7f
Merge pull request #180 from calvin0327/add-robot
archlitchi Mar 13, 2024
155390c
Bump actions/stale from 6.0.1 to 9.0.0
dependabot[bot] Mar 13, 2024
595453b
Bump arduino/setup-protoc from 1.1.2 to 3.0.0
dependabot[bot] Mar 13, 2024
0672ed6
Bump k8s.io/klog/v2 from 2.100.1 to 2.120.1
dependabot[bot] Mar 13, 2024
093b5bd
Merge pull request #183 from Project-HAMi/dependabot/github_actions/a…
dependabot[bot] Mar 13, 2024
51d45d2
Merge pull request #188 from Project-HAMi/dependabot/go_modules/k8s.i…
dependabot[bot] Mar 13, 2024
aa46ce6
Bump github.com/google/uuid from 1.4.0 to 1.6.0
dependabot[bot] Mar 13, 2024
03dbaab
fix GPU not available in v2.3.9
Mar 14, 2024
4195db8
Merge branch 'master' of github.com:Project-HAMi/HAMi
Mar 14, 2024
f9732ba
Update configmap.yaml
sonhosya Mar 14, 2024
146a524
Merge pull request #200 from Project-HAMi/dependabot/go_modules/githu…
dependabot[bot] Mar 14, 2024
6da1414
Bump github.com/fsnotify/fsnotify from 1.6.0 to 1.7.0
dependabot[bot] Mar 14, 2024
c9beaa4
fix node config
Mar 14, 2024
58275e5
Merge branch 'master' of github.com:Project-HAMi/HAMi
Mar 14, 2024
452860a
Bump golang.org/x/net from 0.17.0 to 0.22.0
dependabot[bot] Mar 14, 2024
73a0bdf
fix devices registry name error for Cambricon and Hygon this two type
lengrongfu Mar 15, 2024
2070c82
fix: init controller-runtime log.SetLogger()
lengrongfu Mar 16, 2024
52ba1bd
fix klog upgrade after verify error
lengrongfu Mar 19, 2024
15a49c4
Merge pull request #214 from lengrongfu/fix/klog-upgrade
chaunceyjiang Mar 19, 2024
f986386
fix: when container nou user device resource index out of range
lengrongfu Mar 19, 2024
bf27c46
Generate different image tags for each PR
chaunceyjiang Mar 20, 2024
658d721
Optimize scheduling cache. Accelerate the discovery and deletion of n…
chaunceyjiang Mar 20, 2024
5348a86
add ut action
CoderTH Mar 21, 2024
37b4330
add ut action
CoderTH Mar 21, 2024
944780d
Bump docker/setup-buildx-action from 1 to 3
dependabot[bot] Mar 21, 2024
37b373d
Bump docker/login-action from 1.10.0 to 3.1.0
dependabot[bot] Mar 21, 2024
442db44
Merge pull request #215 from lengrongfu/fix/check-mulit-container
archlitchi Mar 26, 2024
f3c7220
Merge pull request #210 from lengrongfu/fix/logger_mode
archlitchi Mar 26, 2024
9a1a707
Merge pull request #207 from lengrongfu/fix/devices-registry-name
archlitchi Mar 26, 2024
ddc4054
fix: when multi container use device allocate fail
lengrongfu Mar 20, 2024
45fa7d9
Merge pull request #217 from lengrongfu/fix/deviceplugin-multicontain…
archlitchi Mar 26, 2024
28e7b11
init commit
haitwang-cloud Mar 12, 2024
e686398
Merge branch 'master' into add-webhook-ut
haitwang-cloud Mar 26, 2024
f126de9
remove duplicate code
haitwang-cloud Mar 26, 2024
af0ba46
add use can specify use or not use device id feature
lengrongfu Mar 15, 2024
fd23347
add lint script & fix code that fails static checking
CoderTH Mar 22, 2024
a375ed0
Merge pull request #225 from CoderTH/add-ci-lint
wawa0210 Mar 27, 2024
5def0ca
Bump actions/checkout from 2 to 4
dependabot[bot] Mar 27, 2024
1f8df63
Merge pull request #211 from lengrongfu/feat/add-user-uuid
archlitchi Mar 27, 2024
e529f78
modify ci
Mar 27, 2024
dc54dc9
fix code lint
lengrongfu Mar 27, 2024
f9a6e98
Merge pull request #229 from lengrongfu/fix/lint-check
wawa0210 Mar 27, 2024
0df52c8
upgrade HAMi-core version
chaunceyjiang Mar 27, 2024
04a570b
Merge pull request #230 from chaunceyjiang/core
archlitchi Mar 27, 2024
e7368cb
Use NODE_NAME as an environment variable uniformly
lengrongfu Mar 27, 2024
11f07cc
add a robot to mark labels
Mar 27, 2024
7156c22
Delete useless symbols and fix descriptive errors in yaml examples.
yangcheng-dev Mar 28, 2024
d25935d
Merge pull request #233 from yangcheng-dev/master
archlitchi Mar 28, 2024
cc04c5b
Merge pull request #159 from CoderTH/add-issue-temple
archlitchi Mar 28, 2024
c038d9c
Merge pull request #232 from calvin0327/add-label-robot
archlitchi Mar 28, 2024
e2ae421
update
Apr 1, 2024
57d0892
Merge pull request #216 from chaunceyjiang/schedule_score
archlitchi Apr 1, 2024
8fca4c5
Merge pull request #231 from lengrongfu/feat/NODE_NAME-name-uniformly
archlitchi Apr 1, 2024
5bb21c6
Merge pull request #205 from Project-HAMi/dependabot/go_modules/golan…
Apr 2, 2024
94b3786
Merge pull request #191 from Project-HAMi/dependabot/go_modules/githu…
Apr 2, 2024
e665c54
Merge pull request #187 from Project-HAMi/dependabot/github_actions/a…
Apr 2, 2024
6f99086
Bump gotest.tools/v3 from 3.4.0 to 3.5.1
dependabot[bot] Apr 2, 2024
a0d9dc1
Bump github.com/golang/glog from 1.1.0 to 1.2.0
dependabot[bot] Apr 2, 2024
52d5e5d
Merge pull request #184 from Project-HAMi/dependabot/github_actions/d…
Apr 2, 2024
da8a7a7
Bump actions/setup-go from 4 to 5
dependabot[bot] Apr 2, 2024
c978bb2
Merge pull request #199 from Project-HAMi/dependabot/github_actions/d…
Apr 2, 2024
cb37e30
Merge pull request #189 from Project-HAMi/dependabot/go_modules/gotes…
Apr 2, 2024
d7de632
Merge pull request #190 from Project-HAMi/dependabot/go_modules/githu…
Apr 2, 2024
f4fb5cb
Merge pull request #186 from Project-HAMi/dependabot/github_actions/a…
Apr 2, 2024
efb342f
Merge pull request #224 from Project-HAMi/dependabot/github_actions/a…
Apr 2, 2024
8ceaae3
Merge pull request #176 from lengrongfu/feat/remove-config
Apr 2, 2024
0b0119d
remove testcollector binrary
Apr 2, 2024
d2d84a4
Merge pull request #237 from wawa0210/master
Apr 3, 2024
65ae712
fix make lint error
Apr 3, 2024
4a72922
Merge pull request #242 from wawa0210/master
Apr 3, 2024
bad7d41
Bump github.com/onsi/gomega from 1.27.10 to 1.32.0
dependabot[bot] Apr 3, 2024
13096f9
Bump github.com/urfave/cli/v2 from 2.4.0 to 2.27.1
dependabot[bot] Apr 3, 2024
ce2f27c
Merge pull request #240 from Project-HAMi/dependabot/go_modules/githu…
Apr 3, 2024
6fcb06e
Merge pull request #239 from Project-HAMi/dependabot/go_modules/githu…
Apr 3, 2024
c1f50fa
Bump github.com/NVIDIA/k8s-device-plugin from 0.14.1 to 0.14.5
dependabot[bot] Apr 3, 2024
b81ab9c
add cncf landscape logo
Apr 7, 2024
f85f801
Merge pull request #247 from wawa0210/master
Apr 7, 2024
8cb0eef
update offline_install docs
archlitchi Apr 7, 2024
72fa079
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Apr 7, 2024
58f61ff
Merge branch 'master' into add-webhook-ut
haitwang-cloud Apr 8, 2024
70a28c9
fix with lint
haitwang-cloud Apr 8, 2024
203ec33
Update offline-install.md
Apr 8, 2024
f829d94
update slack url
Apr 9, 2024
c65bd06
Merge pull request #250 from wawa0210/master
Apr 9, 2024
13fc125
add code of conduct
Apr 9, 2024
15389c1
Merge pull request #251 from wawa0210/master
Apr 9, 2024
c20e4e0
add contributing guideline and update contact
Apr 9, 2024
50add27
remove gitlab ci file
Apr 9, 2024
788707d
Merge pull request #252 from wawa0210/master
Apr 9, 2024
6934ce4
Merge pull request #253 from wawa0210/remote-gitlab
Apr 9, 2024
b644b11
Merge pull request #227 from haitwang-cloud/add-webhook-ut
Apr 9, 2024
d6a9663
add contributer ladder docs
Apr 9, 2024
8da3329
Merge pull request #243 from Project-HAMi/dependabot/go_modules/githu…
Apr 9, 2024
ef20f88
Bump github.com/stretchr/testify from 1.8.4 to 1.9.0
dependabot[bot] Apr 9, 2024
9633195
Merge pull request #244 from Project-HAMi/dependabot/go_modules/githu…
Apr 10, 2024
bbef03b
Merge pull request #254 from wawa0210/master
Apr 10, 2024
be8a1c2
doc: modify the error .so file description of the iluvatar device
LOMOGO Apr 10, 2024
8381474
Merge pull request #260 from LOMOGO/master
Apr 10, 2024
df82321
Bump nvidia/cuda in /docker
dependabot[bot] Apr 10, 2024
5212173
add another dockerfile with lib
archlitchi Apr 12, 2024
305060e
add another dockerfile with lib
archlitchi Apr 12, 2024
7ee021e
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Apr 12, 2024
037975f
remove libvgpu.so
archlitchi Apr 12, 2024
f823372
trivy:fix specify card type to use error
Apr 12, 2024
7068d17
Merge pull request #266 from wawa0210/master
Apr 12, 2024
30c896e
update hami license and add license check (#268)
Apr 12, 2024
3617dde
fix bad link in README (#270)
panpan0000 Apr 15, 2024
2c1c469
add aliases import validation
Apr 16, 2024
9f445ac
fix: When a node cannot bind, it should stop scheduling pods to that …
chaunceyjiang Apr 16, 2024
3b7b606
update CONTRIBUTING.md
Apr 17, 2024
4e9dedb
Merge pull request #273 from chaunceyjiang/bind
Apr 17, 2024
b918011
Support GPU Utilization Metrics
chaunceyjiang Apr 18, 2024
68e3e3a
support new cambricon devices
archlitchi Apr 22, 2024
182fe85
remove unused files
archlitchi Apr 22, 2024
d9a8726
update mlu-related docs and examples
archlitchi Apr 22, 2024
c3c1d15
Merge pull request #280 from chaunceyjiang/device_util
Apr 22, 2024
1e781d7
update libvgpu
archlitchi Apr 22, 2024
103b2b6
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Apr 22, 2024
0e2a6e4
add scheduler policy design doc
lengrongfu Feb 4, 2024
d472283
scheduler server add leader elect
lengrongfu Mar 13, 2024
e4a4600
Merge pull request #196 from lengrongfu/feat/add-leader-elect-param
Apr 22, 2024
4f1a323
Merge pull request #152 from lengrongfu/add_scheduler_policy
Apr 22, 2024
c30db33
fix nvidia device plugin nodename init sort error
Apr 25, 2024
4dd84d3
fix metrics error
archlitchi Apr 25, 2024
e892934
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Apr 25, 2024
90af849
fix RegistrInAnnotation log time error
Apr 28, 2024
54715cd
fix device plugin crash whrn numa get err (#290)
Apr 28, 2024
d59ef57
update
archlitchi Apr 28, 2024
f859cf9
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Apr 28, 2024
d770a47
To avoid swallowing exception information during device stop, it shou…
Apr 30, 2024
de36c8c
add break change document
lengrongfu Apr 30, 2024
7a5b7ea
Replace project architecture diagram (#296)
totapotato May 6, 2024
e678dbe
fix gpu type matching mechanism
May 6, 2024
fd8deca
triv:Remove the build double quotes
May 7, 2024
3648648
add support for 910B devices
archlitchi May 11, 2024
17bf9e5
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi May 11, 2024
548892a
update webhook and charts
archlitchi May 11, 2024
c0ce784
remove github robot
May 13, 2024
50cc567
update docs for v2.3.11
archlitchi May 13, 2024
dda23af
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi May 13, 2024
1cf3470
update ascend_docs
archlitchi May 13, 2024
45b97cd
update ascend_docs
archlitchi May 13, 2024
81fea57
add hami repos
May 13, 2024
9065c76
Merge pull request #263 from Project-HAMi/dependabot/docker/docker/nv…
hami-robott[bot] May 16, 2024
1c7418b
update dockerwithlib
archlitchi May 16, 2024
f2e3126
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi May 16, 2024
17052d8
add scheduler command
CoderTH May 15, 2024
5b04c5b
add scheduler policy doc (#312)
CoderTH May 17, 2024
796e5c5
update docs
archlitchi May 21, 2024
c14da94
Do not use job-patch to modify failurePolicy.
chaunceyjiang May 20, 2024
018881d
update
archlitchi May 23, 2024
834ba44
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi May 23, 2024
2e839f0
remove hygon related files
archlitchi May 23, 2024
7189c97
update docs
archlitchi May 23, 2024
e341df8
update hygon
archlitchi May 23, 2024
a7c1619
update charts
archlitchi May 24, 2024
3c0c045
update image
archlitchi May 24, 2024
e3305a6
update cambricon docs
archlitchi May 24, 2024
eafb2fd
update docs
archlitchi May 24, 2024
b398634
update
archlitchi May 27, 2024
5df963d
update
archlitchi May 27, 2024
18e86e5
update
archlitchi May 28, 2024
4af7069
The annotation of the device that has been deleted is incorrect. (#323)
chaunceyjiang May 28, 2024
70cd1e3
fix bugs
archlitchi May 31, 2024
29f353a
update hami roadmp
May 31, 2024
39baf55
Bump docker/login-action from 3.1.0 to 3.2.0
dependabot[bot] May 28, 2024
299e421
enhance unregisterd node logs
May 23, 2024
7ea1e8f
Bump ubuntu from 20.04 to 24.04 in /docker
dependabot[bot] May 23, 2024
cabb7fb
enhance get current node pendding pods (#340)
Jun 3, 2024
4399dc1
Fix the issue where scheduling can still occur on the node when the d…
chaunceyjiang Jun 4, 2024
8f82997
:bulb: Upate Benchmark build script and deployment files (#279)
haitwang-cloud Jun 4, 2024
7e23ec4
add pod utilization dashboard (#342)
CoderTH Jun 5, 2024
b379f1d
fix doc (#337)
lengrongfu Jun 11, 2024
128611a
fix: typo in use_exclusive_card.yaml and add new exclusive example (#…
JasonHe-WQ Jun 17, 2024
a8a4830
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Jun 17, 2024
7ed4a58
optimization node compute score (#336)
lengrongfu Jun 19, 2024
7ffc4da
Bump golang.org/x/net from 0.22.0 to 0.26.0 (#344)
dependabot[bot] Jun 21, 2024
d063d1f
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Jun 24, 2024
790fe12
Bump github.com/spf13/cobra from 1.7.0 to 1.8.1 (#362)
dependabot[bot] Jun 25, 2024
972d9f3
feat: Update webhook.yaml to whitelist kube-system and istio-system n…
haitwang-cloud Jun 25, 2024
af5767f
Fix HostGPUMemoryUsage wrong unit (#367)
joy717 Jun 25, 2024
53df0f2
upgrade go-nvml version (#371)
lengrongfu Jun 27, 2024
8b5e5b8
feat: add liveness probe on scheduler extender (#354)
JasonHe-WQ Jun 28, 2024
e0427fb
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Jul 1, 2024
1d9faf4
docs: Optimize the user experience of using documents (#380)
wjp-cn Jul 10, 2024
ced2900
feat: support huawei ascend310p
peizhaoyou Jul 15, 2024
34bc150
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Jul 17, 2024
22ef124
update charts
archlitchi Jul 19, 2024
ce92821
fix: CUDA_DISABLE_CONTROL not working properly (#384)
joy717 Jul 25, 2024
8f0ef98
feat: Add GPU node selector to scheduler deployment (#376)
haitwang-cloud Jul 25, 2024
92acc44
bump github.com/NVIDIA/k8s-device-plugin v0.15.0 (#298)
morlay Jul 25, 2024
87bd0fa
add workflow to build chart tag and image (#309)
calvin0327 Jul 25, 2024
2ffa5ac
update mountPath
archlitchi Jul 25, 2024
526cb3d
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Jul 25, 2024
83197c8
update chart version
archlitchi Jul 25, 2024
6399471
update golang version
archlitchi Jul 25, 2024
f5a106b
update golang version
archlitchi Jul 25, 2024
45e16cc
remove arm64 from release ci
archlitchi Jul 25, 2024
03d17d6
add submodule to build-image-release
archlitchi Jul 25, 2024
9948e40
update master ci
archlitchi Jul 26, 2024
c7d3dbf
update ci
archlitchi Jul 26, 2024
cdc7262
update ci
archlitchi Jul 26, 2024
eb14ee2
update ci
archlitchi Jul 26, 2024
7e930b9
update ci
archlitchi Jul 26, 2024
0901691
update ci
archlitchi Jul 26, 2024
3f60981
Update MAINTAINERS.md
Jul 29, 2024
07b95b9
fixed go build image version
chaunceyjiang Jul 29, 2024
38313a4
fix goproxy.cn
archlitchi Aug 2, 2024
f1f3093
Support GPU Utilization Metrics (#258)
chaunceyjiang Aug 2, 2024
3ddebda
disable PreferredAllocation
lengrongfu Aug 2, 2024
2c972a0
Bump azure/setup-helm from 3 to 4
dependabot[bot] Aug 2, 2024
42c5754
Bump docker/build-push-action from 5.1.0 to 6.5.0
dependabot[bot] Aug 2, 2024
7a365d0
Bump docker/login-action from 3.2.0 to 3.3.0
dependabot[bot] Jul 26, 2024
facfff3
Signed-off-by: wawa0210 <xiao.zhang@daocloud.io> (#417)
Aug 5, 2024
83273db
update ci
archlitchi Aug 6, 2024
2a8884c
Merge branch 'master' of github.com:Project-HAMi/HAMi
archlitchi Aug 6, 2024
a3b625c
Bump docker/build-push-action from 6.5.0 to 6.6.0 (#420)
dependabot[bot] Aug 8, 2024
901b43a
optimization code
lengrongfu Jul 26, 2024
7a6319d
add hami community Governance
Aug 12, 2024
1d5f0e5
update roadmap&update community governace
Aug 12, 2024
2cf0dec
Bump docker/build-push-action from 6.6.0 to 6.6.1
dependabot[bot] Aug 8, 2024
adfc166
add vgpu doc and update the readme. (#430)
william-wang Aug 13, 2024
4b88138
add image vulnerability scanner
Aug 13, 2024
090c327
add hami vulnerability scan and report
Aug 13, 2024
c33bc6b
Update ci-image-scanning.yaml
Aug 13, 2024
b3dd1ca
Update ci-image-scanning.yaml
Aug 13, 2024
c81f6fd
Update ci-image-scanning.yaml
Aug 13, 2024
1ea64f3
fix: fix duplicate resource keys in configmap (#422)
devenami Aug 14, 2024
d534a1a
Bump github.com/opencontainers/runc from 1.1.2 to 1.1.12
dependabot[bot] Aug 14, 2024
5c5c393
fix data race when read pods info
lengrongfu Aug 7, 2024
5f71955
add codeql analysis
Aug 14, 2024
7a677c7
Bump github/codeql-action from 2 to 3
dependabot[bot] Aug 19, 2024
56bb5e9
Bump actions/setup-go from 4 to 5
dependabot[bot] Aug 19, 2024
b253768
update hami logo (#455)
lengrongfu Aug 27, 2024
07902a0
update hami logo (#456)
lengrongfu Aug 29, 2024
321e042
feat: Support for multiple versions of cudevshr for vGPUmonitor (#458)
zoyopei Sep 2, 2024
6683c37
add node record pod info (#451)
lengrongfu Sep 2, 2024
fcfee50
update hami sandbox project
Sep 4, 2024
5a5f955
Add OpenSSF Best Practices Badge
Sep 4, 2024
22112d3
add Coverage Analytics
Sep 6, 2024
5dfe03a
fix OpenSSF Best Practices
Sep 6, 2024
ca49092
Bump github.com/opencontainers/runc from 1.1.12 to 1.1.14
dependabot[bot] Sep 6, 2024
14faf52
Bump docker/build-push-action from 6.6.1 to 6.7.0
dependabot[bot] Aug 13, 2024
1682454
add issue template
Sep 9, 2024
7428a83
Optimize monitoring performance
coldzerofear Sep 10, 2024
647eb17
Update pod.go
coldzerofear Sep 13, 2024
3f6f308
update
coldzerofear Sep 13, 2024
ec30019
update
coldzerofear Sep 13, 2024
1592b47
update
coldzerofear Sep 13, 2024
File filter

Filter by extension

Filter by extension


Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
33 changes: 33 additions & 0 deletions .github/ISSUE_TEMPLATE/bug-report.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,33 @@
---
name: Bug Report
about: Report a bug encountered while using HAMi.
labels: kind/bug

---

<!-- Please use this template while reporting a bug and provide as much info as possible. Not doing so may result in your bug not being addressed in a timely manner. Thanks!
-->

**What happened**:

**What you expected to happen**:

**How to reproduce it (as minimally and precisely as possible)**:

**Anything else we need to know?**:

- The output of `nvidia-smi -a` on your host
- Your docker or containerd configuration file (e.g: `/etc/docker/daemon.json`)
- The hami-device-plugin container logs
- The hami-scheduler container logs
- The kubelet logs on the node (e.g: `sudo journalctl -r -u kubelet`)
- Any relevant kernel output lines from `dmesg`

**Environment**:
- HAMi version:
- nvidia driver or other AI device driver version:
- Docker version from `docker version`
- Docker command, image and tag used
- Kernel version from `uname -a`
- Others:

22 changes: 22 additions & 0 deletions .github/ISSUE_TEMPLATE/enhancement.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
---
name: Enhancement Request
about: Suggest an enhancement to the project
labels: kind/feature

---
<!-- Please only use this template for submitting enhancement requests -->

**What would you like to be added**:

**What type of PR is this?**

/kind feature

**What this PR does / why we need it**:

**Which issue(s) this PR fixes**:
Fixes #

**Special notes for your reviewer**:

**Does this PR introduce a user-facing change?**:
31 changes: 31 additions & 0 deletions .github/ISSUE_TEMPLATE/good-first.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,31 @@
---
name: Good First Issue
about: Publish a good first issue
labels: good first issue

---

<!-- Please use this template while publishing a good first issue. Thanks!
-->

**Task description**:

**Solution**:

**Who can join or take the task**:

The good first issue is intended for `first-time contributors` to get started on his/her contributor journey.

After a contributor has successfully completed 1-2 good first issue's,
they should be ready to move on to `help wanted` items, saving the remaining `good first issue` for other new contributors.

**How to join or take the task**:

Just reply on the issue with the message `/assign` in a separate line.

Then, the issue will be assigned to you.

**How to ask for help**:

If you need help or have questions, please feel free to ask on this issue.
The issue author or other members of the community will guide you through the contribution process.
15 changes: 15 additions & 0 deletions .github/ISSUE_TEMPLATE/question.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,15 @@
---
name: Question
about: Question relating to HAMi.
labels: kind/question

---

**Please provide an in-depth description of the question you have**:

**What do you think about this question?**:

**Environment**:
- HAMi version:
- Kubernetes version:
- Others:
22 changes: 22 additions & 0 deletions .github/PULL_REQUEST_TEMPLATE.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,22 @@
**What type of PR is this?**

<!--
Add one of the following kinds:
/kind bug
/kind cleanup
/kind deprecation
/kind design
/kind documentation
/kind failing-test
/kind feature
/kind flake
-->

**What this PR does / why we need it**:

**Which issue(s) this PR fixes**:
Fixes #

**Special notes for your reviewer**:

**Does this PR introduce a user-facing change?**:
21 changes: 21 additions & 0 deletions .github/dependabot.yml
Original file line number Diff line number Diff line change
@@ -0,0 +1,21 @@

# To get started with Dependabot version updates, you'll need to specify which
# package ecosystems to update and where the package manifests are located.
# Please see the documentation for all configuration options:
# https://docs.github.com/github/administering-a-repository/configuration-options-for-dependency-updates


version: 2
updates:
- package-ecosystem: "gomod"
directory: "/"
schedule:
interval: "daily"
- package-ecosystem: "docker"
directory: "/docker"
schedule:
interval: "daily"
- package-ecosystem: "github-actions"
directory: "/"
schedule:
interval: "daily"
36 changes: 36 additions & 0 deletions .github/workflows/build-helm-release.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,36 @@
name: Release helm

on:
workflow_dispatch:
push:
branches:
- main
tags:
- v*

jobs:
helm-release:
runs-on: ubuntu-latest
steps:
- name: Checkout
uses: actions/checkout@v4
with:
fetch-depth: 0

- name: Configure Git
run: |
git config user.name "$GITHUB_ACTOR"
git config user.email "$GITHUB_ACTOR@users.noreply.github.com"

- name: Install Helm
uses: azure/setup-helm@v4
with:
version: v3.7.1

- name: Run chart-releaser
uses: helm/chart-releaser-action@v1.6.0
with:
charts_dir: charts
env:
CR_TOKEN: "${{ secrets.GITHUB_TOKEN }}"

77 changes: 77 additions & 0 deletions .github/workflows/build-image-release.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,77 @@
name: Release Arm Image

env:
REGISTRY: docker.io
IMAGE_REPO: projecthami/hami
IMAGE_ROOT_PATH: docker
BUILD_PLATFORM: linux/arm64
REGISTER_USER: ${{ github.actor }}
REGISTER_PASSWORD: ${{ secrets.GITHUB_TOKEN }}

on:
workflow_dispatch:
push:
tags:
- v*

jobs:
docker-build:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@master
- name: Get the version
id: get_version
run: |
VERSION=${GITHUB_REF#refs/tags/}
if [[ ${GITHUB_REF} == "refs/heads/main" ]]; then
VERSION=latest
fi
echo ::set-output name=VERSION::${VERSION}

- name: Checkout code
uses: actions/checkout@v4
with:
fetch-depth: 0

- name: Checkout submodule
uses: Mushus/checkout-submodule@v1.0.1
with:
basePath: # optional, default is .
submodulePath: libvgpu

- name: Docker Login
uses: docker/login-action@v3.3.0
with:
username: ${{ secrets.DOCKERHUB_TOKEN }}
password: ${{ secrets.DOCKERHUB_PASSWD }}

- name: Set up QEMU
uses: docker/setup-qemu-action@v3

- name: Set up Docker Buildx
uses: docker/setup-buildx-action@v3
with:
driver-opts: image=moby/buildkit:master

- name: Extract metadata (tags, labels) for Docker
id: meta
uses: docker/metadata-action@v5
with:
images: ${{ env.REGISTRY }}/${{ env.IMAGE_REPO }}

- name: Build & Pushing hami image
uses: docker/build-push-action@v6.7.0
with:
context: .
file: ${{ env.IMAGE_ROOT_PATH }}/Dockerfile
labels: ${{ steps.meta.outputs.labels }}
platforms: ${{ env.BUILD_PLATFORM }}
build-args: |
VERSION=${{ steps.get_version.outputs.VERSION }}
GOLANG_IMAGE=golang:1.22.5-bullseye
NVIDIA_IMAGE=nvidia/cuda:12.2.0-devel-ubuntu20.04
DEST_DIR=/usr/local
tags: ${{ steps.meta.outputs.tags }}
push: true
github-token: ${{ env.REGISTER_PASSWORD }}

62 changes: 62 additions & 0 deletions .github/workflows/ci-image-scanning.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,62 @@
name: Trivy Scan
on:
schedule:
- cron: "0 0 * * *"
jobs:
trivy-scan:
permissions:
contents: read # for actions/checkout to fetch code
security-events: write # for github/codeql-action/upload-sarif to upload SARIF results
actions: read # only required for a private repository by github/codeql-action/upload-sarif to get the Action run status
runs-on: ubuntu-22.04
steps:
- name: Checkout
uses: actions/checkout@v4
- name: Checkout submodule
uses: Mushus/checkout-submodule@v1.0.1
with:
basePath: # optional, default is .
submodulePath: libvgpu
- name: Get branch name
uses: nelonoel/branch-name@v1.0.1
- name: Docker Login
uses: docker/login-action@v3.3.0
with:
username: ${{ secrets.DOCKERHUB_TOKEN }}
password: ${{ secrets.DOCKERHUB_PASSWD }}
- name: Set up Docker Buildx
id: buildx
uses: docker/setup-buildx-action@v3
- name: Generating image tag
id: runtime-tag
run: |
echo tag="$(git rev-parse --short HEAD)" >> $GITHUB_OUTPUT
- run: make tidy
- run: SHORT_VERSION=${{ steps.runtime-tag.outputs.tag }} bash ./hack/build.sh
- name: Run Trivy vulnerability scanner (table output)
# Prevent running from the forked repository that doesn't need to upload coverage.
# In addition, running on the forked repository would fail as missing the necessary secret.
if: ${{ github.repository == 'Project-HAMi/HAMi' }}
uses: aquasecurity/trivy-action@0.24.0
with:
image-ref: "projecthami/hami:${{ steps.runtime-tag.outputs.tag }}"
format: "table"
ignore-unfixed: true
severity: "HIGH,CRITICAL"
vuln-type: "os,library"
trivyignores: .trivyignore
- name: Run Trivy vulnerability scanner (SARIF)
uses: aquasecurity/trivy-action@0.24.0
with:
image-ref: "projecthami/hami:${{ steps.runtime-tag.outputs.tag }}"
format: "sarif"
output: "trivy-results.sarif"
ignore-unfixed: true
vuln-type: "os,library"
trivyignores: .trivyignore
if: always() && github.repository == 'Project-HAMi/HAMi'
- name: Upload Trivy scan results to GitHub Security tab
uses: github/codeql-action/upload-sarif@v3
with:
sarif_file: "trivy-results.sarif"
if: always() && github.repository == 'Project-HAMi/HAMi'
59 changes: 59 additions & 0 deletions .github/workflows/ci.yaml
Original file line number Diff line number Diff line change
@@ -0,0 +1,59 @@
name: CI
on:
pull_request:
push:
branches: ["master"]

env:
GO_VERSION: "1.21.6"

jobs:
golangci:
name: lint
runs-on: ubuntu-22.04
steps:
- name: checkout code
uses: actions/checkout@v4
- name: install Go
uses: actions/setup-go@v5
with:
go-version: "1.21"
- name: verify license
run: hack/verify-license.sh
- name: go tidy
run: make tidy
- name: lint
run: make lint
- name: import alias
run: hack/verify-import-aliases.sh
test:
name: Unit test
runs-on: ubuntu-latest
steps:
- name: Checkout code
uses: actions/checkout@v4
- name: Checkout submodule
uses: Mushus/checkout-submodule@v1.0.1
with:
basePath: # optional, default is .
submodulePath: libvgpu
- name: Install Go
uses: actions/setup-go@v5
with:
go-version: "1.21"
- run: make tidy
- run: make test
- name: Upload coverage to Codecov
# Prevent running from the forked repository that doesn't need to upload coverage.
# In addition, running on the forked repository would fail as missing the necessary secret.
if: ${{ github.repository == 'Project-HAMi/HAMi' }}
uses: codecov/codecov-action@v4
with:
# Even though token upload token is not required for public repos,
# but adding a token might increase successful uploads as per:
# https://community.codecov.com/t/upload-issues-unable-to-locate-build-via-github-actions-api/3954
token: ${{secrets.CODECOV_TOKEN}}
files: ./_output/coverage/coverage.out
flags: unittests
fail_ci_if_error: false
verbose: true
Loading