Skip to content

Commit d01f76a

Browse files
authored
chore: Cherrypick changelog r0.6.0 (#358)
* Update release notes header in changelog build (#355) Signed-off-by: Pablo Garay <[email protected]> * add changelog-config (#356) Signed-off-by: Pablo Garay <[email protected]> * Changelog 0.6.0 (#357) Signed-off-by: Pablo Garay <[email protected]> * spelling --------- Signed-off-by: Pablo Garay <[email protected]>
1 parent 04f900a commit d01f76a

File tree

4 files changed

+178
-3
lines changed

4 files changed

+178
-3
lines changed

.github/workflows/changelog-build.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ jobs:
4949
RELEASE_VERSION=${RELEASE_BRANCH#r}
5050
CHANGELOG=$(echo "$CHANGELOG" | sed '/^[[:blank:]]*#/s/#/###/')
5151
52-
RELEASE_NOTES="## NVIDIA Neural Modules $RELEASE_VERSION
52+
RELEASE_NOTES="## NVIDIA Nemo Run $RELEASE_VERSION
5353
5454
### Detailed Changelogs:
5555
Lines changed: 118 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,118 @@
1+
{
2+
"categories": [
3+
{
4+
"title": "## Executors\n\n",
5+
"labels": ["executor", "local", "slurm", "dgxcloud", "lepton", "skypilot", "docker"],
6+
"exclude_labels": ["ignore"]
7+
},
8+
{
9+
"title": "\n## Ray Integration\n\n",
10+
"labels": ["ray", "kuberay", "ray-slurm"],
11+
"exclude_labels": ["ignore"]
12+
},
13+
{
14+
"title": "\n## CLI & Configuration\n\n",
15+
"labels": ["cli", "config", "parsing"],
16+
"exclude_labels": ["ignore"]
17+
},
18+
{
19+
"title": "\n## Experiment & Job Management\n\n",
20+
"labels": ["experiment", "job", "task"],
21+
"exclude_labels": ["ignore"]
22+
},
23+
{
24+
"title": "\n## Packaging & Deployment\n\n",
25+
"labels": ["packaging", "deployment"],
26+
"exclude_labels": ["ignore"]
27+
},
28+
{
29+
"title": "\n## Documentation\n\n",
30+
"labels": ["docs", "documentation"],
31+
"exclude_labels": ["ignore"]
32+
},
33+
{
34+
"title": "\n## CI/CD\n\n",
35+
"labels": ["ci", "github-actions", "workflow"],
36+
"exclude_labels": ["ignore"]
37+
},
38+
{
39+
"title": "\n## Bug Fixes\n\n",
40+
"labels": ["bug", "bugfix", "fix"],
41+
"exclude_labels": ["ignore"]
42+
}
43+
],
44+
"ignore_labels": [
45+
"ignore",
46+
"skip-changelog"
47+
],
48+
"sort": "ASC",
49+
"template": "\n${{CHANGELOG}}\n## Others\n\n${{UNCATEGORIZED}}\n",
50+
"pr_template": "- ${{TITLE}} [#${{NUMBER}}](${{URL}})",
51+
"empty_template": "- No changes in this release",
52+
"label_extractor": [
53+
{
54+
"pattern": "(.*executor.*)|(.*local.*)|(.*slurm.*)|(.*dgxcloud.*)|(.*lepton.*)|(.*skypilot.*)|(.*docker.*)",
55+
"target": "executor",
56+
"flags": "gimu",
57+
"on_property": ["title", "body"]
58+
},
59+
{
60+
"pattern": "(.*ray.*)|(.*kuberay.*)",
61+
"target": "ray",
62+
"flags": "gimu",
63+
"on_property": ["title", "body"]
64+
},
65+
{
66+
"pattern": "(.*cli.*)|(.*command.*)|(.*parse.*)|(.*argument.*)",
67+
"target": "cli",
68+
"flags": "gimu",
69+
"on_property": ["title", "body"]
70+
},
71+
{
72+
"pattern": "(.*experiment.*)|(.*job.*)|(.*task.*)",
73+
"target": "experiment",
74+
"flags": "gimu",
75+
"on_property": ["title", "body"]
76+
},
77+
{
78+
"pattern": "(.*packaging.*)|(.*package.*)|(.*deploy.*)|(.*archive.*)|(.*mount.*)",
79+
"target": "packaging",
80+
"flags": "gimu",
81+
"on_property": ["title", "body"]
82+
},
83+
{
84+
"pattern": "(.*doc.*)|(.*readme.*)|(.*guide.*)|(.*tutorial.*)",
85+
"target": "docs",
86+
"flags": "gimu",
87+
"on_property": ["title", "body"]
88+
},
89+
{
90+
"pattern": "(.*\\bci\\b.*)|(.*github.*)|(.*workflow.*)|(.*action.*)",
91+
"target": "ci",
92+
"flags": "gimu",
93+
"on_property": ["title", "body"]
94+
},
95+
{
96+
"pattern": "(.*\\[bug.*)|(.*\\bfix\\b.*)|(.*bugfix.*)|(.*patch.*)",
97+
"target": "bug",
98+
"flags": "gimu",
99+
"on_property": ["title", "body"]
100+
}
101+
],
102+
"duplicate_filter": {
103+
"pattern": ".+",
104+
"on_property": "title",
105+
"method": "match"
106+
},
107+
"transformers": [
108+
],
109+
"max_tags_to_fetch": 100,
110+
"max_pull_requests": 500,
111+
"max_back_track_time_days": 365,
112+
"exclude_merge_branches": [
113+
],
114+
"tag_resolver": {
115+
"method": "semver"
116+
}
117+
}
118+

CHANGELOG.md

Lines changed: 57 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,63 @@
11
# Changelog
22

33
<!-- Next changelog -->
4+
## NVIDIA Nemo Run 0.6.0
5+
6+
### Detailed Changelogs:
7+
8+
## Executors
9+
10+
- Added Pre-Launch Commands Support to LeptonExecutor [#312](https://github.com/NVIDIA-NeMo/Run/pull/312)
11+
- Remove breaking torchrun config for single-node runs [#292](https://github.com/NVIDIA-NeMo/Run/pull/292)
12+
- Upgrade skypilot to v0.10.0, introduce network_tier [#297](https://github.com/NVIDIA-NeMo/Run/pull/297)
13+
- Fixes for multi-node execution with torchrun + LocalExecutor [#251](https://github.com/NVIDIA-NeMo/Run/pull/251)
14+
- Add option to specify --container-env for srun [#293](https://github.com/NVIDIA-NeMo/Run/pull/293)
15+
- Fix skypilot archive mount bug [#288](https://github.com/NVIDIA-NeMo/Run/pull/288)
16+
- finetune on dgxcloud with nemo-run and deploy on bedrock example [#286](https://github.com/NVIDIA-NeMo/Run/pull/286)
17+
18+
## Ray Integration
19+
20+
- Add nsys patch in ray sub template [#318](https://github.com/NVIDIA-NeMo/Run/pull/318)
21+
- Add logs dir to container mount for ray slurm [#287](https://github.com/NVIDIA-NeMo/Run/pull/287)
22+
- Allow customizing folder for SlurmRayRequest [#281](https://github.com/NVIDIA-NeMo/Run/pull/281)
23+
24+
## CLI & Configuration
25+
26+
## Experiment & Job Management
27+
28+
- Use thread pool for status, run methods inside experiment + other fixes [#295](https://github.com/NVIDIA-NeMo/Run/pull/295)
29+
30+
## Packaging & Deployment
31+
32+
- Correctly append tar files for packaging [#317](https://github.com/NVIDIA-NeMo/Run/pull/317)
33+
34+
## Documentation
35+
36+
- Create CHANGELOG.md [#314](https://github.com/NVIDIA-NeMo/Run/pull/314)
37+
- docs: Fixing doc build issue [#290](https://github.com/NVIDIA-NeMo/Run/pull/290)
38+
- fix docs tutorial links and add intro to guides/index.md [#285](https://github.com/NVIDIA-NeMo/Run/pull/285)
39+
- README [#277](https://github.com/NVIDIA-NeMo/Run/pull/277)
40+
41+
## CI/CD
42+
43+
- changelog workflow [#315](https://github.com/NVIDIA-NeMo/Run/pull/315)
44+
- Update release.yml [#306](https://github.com/NVIDIA-NeMo/Run/pull/306)
45+
- ci(fix): Use GITHUB_TOKEN for community bot [#302](https://github.com/NVIDIA-NeMo/Run/pull/302)
46+
- ci: Add community-bot [#300](https://github.com/NVIDIA-NeMo/Run/pull/300)
47+
48+
## Bug Fixes
49+
50+
- [Bugfix] Adding a check for name length [#273](https://github.com/NVIDIA-NeMo/Run/pull/273)
51+
- misc fixes [#280](https://github.com/NVIDIA-NeMo/Run/pull/280)
52+
- adding fix for lowercase and name length k8s requirements [#274](https://github.com/NVIDIA-NeMo/Run/pull/274)
53+
54+
## Others
55+
56+
- Specify nodes for gpu metrics collection and split data to each rank [#320](https://github.com/NVIDIA-NeMo/Run/pull/320)
57+
- Apply '_enable_goodbye_message' check to both goodbye messages. [#319](https://github.com/NVIDIA-NeMo/Run/pull/319)
58+
- Update refs [#278](https://github.com/NVIDIA-NeMo/Run/pull/278)
59+
- chore: Bump to version 0.6.0rc0.dev0 [#272](https://github.com/NVIDIA-NeMo/Run/pull/272)
60+
461
## NVIDIA Nemo Run 0.5.0
562

663

nemo_run/run/experiment.py

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -636,13 +636,13 @@ def run(
636636
If sequential=True, all tasks will be run one after the other.
637637
The order is based on the order in which they were added.
638638
639-
Parallel mode only works if all exectuors in the experiment support it.
639+
Parallel mode only works if all executors in the experiment support it.
640640
Currently, all executors support parallel mode.
641641
642642
In sequential mode, if all executor supports dependencies, then all tasks will be scheduled at once
643643
by specifying the correct dependencies to each task.
644644
Otherwise, the experiment.run call will block and each task that is scheduled will be executed sequentially.
645-
In this particular case, we cannot guarantee the state of the exeperiment if the process exits in the middle.
645+
In this particular case, we cannot guarantee the state of the experiment if the process exits in the middle.
646646
647647
Currently, only the slurm executor supports dependencies.
648648

0 commit comments

Comments
 (0)