Commit db16cb4
Adds option to set object-store size when starting Ray cluster, uses a process group for managing ray child processes (#1274)
* Updates stop() to kill the ray process group created by init_cluster().
Signed-off-by: rlratzel <[email protected]>
* Creates a process group when running the 'ray start' command and kills the process group in RayClient.stop() to ensure all child processes are terminated.
Signed-off-by: rlratzel <[email protected]>
* Adds support for creating a process group for the ray subprocesses and killing the group to better ensure all processes are killed.
Signed-off-by: rlratzel <[email protected]>
* Wip ab/final metrics (#996)
* Adding prometheus and grafana to the nemo curator metrics path
Signed-off-by: Abhinav Garg <[email protected]>
* Implement safe extraction for tar files in file_utils.py
- Added functions `_is_safe_path` and `tar_safe_extract` to ensure safe extraction of tar files, preventing path traversal attacks.
- Included necessary imports and updated the file structure by removing outdated files from the ray-curator module.
Signed-off-by: [Your Name] [[email protected]]
Signed-off-by: Abhinav Garg <[email protected]>
* Refactor references from ray_curator to nemo_curator across multiple files
- Updated file paths and comments in api-design.md, __init__.py, client.py, and start_prometheus_grafana.py to reflect the new nemo_curator namespace.
- Changed package name in package_info.py from ray_curator to nemo_curator.
Signed-off-by: [Your Name] [[email protected]]
Signed-off-by: Abhinav Garg <[email protected]>
* Rename function `get_ray_client` to `start_prometheus_grafana` in start_prometheus_grafana.py for clarity and consistency with the new metrics path. Update the function call in the main execution block accordingly.
Signed-off-by: Abhinav Garg <[email protected]>
* Adding prometheus and grafana to the nemo curator metrics path
Signed-off-by: Abhinav Garg <[email protected]>
* Adding README for metrics
Signed-off-by: Abhinav Garg <[email protected]>
---------
Signed-off-by: Abhinav Garg <[email protected]>
Signed-off-by: [Your Name] [[email protected]]
Co-authored-by: Sarah Yurick <[email protected]>
Signed-off-by: rlratzel <[email protected]>
* undoes merge mistakes.
Signed-off-by: rlratzel <[email protected]>
* Fixes typo.
Signed-off-by: rlratzel <[email protected]>
* Adds OSError to exception handlers to handle process groups that have already terminated.
Signed-off-by: rlratzel <[email protected]>
---------
Signed-off-by: rlratzel <[email protected]>
Signed-off-by: Abhinav Garg <[email protected]>
Signed-off-by: [Your Name] [[email protected]]
Co-authored-by: Sarah Yurick <[email protected]>1 parent b323a3a commit db16cb4
File tree
3 files changed
+52
-18
lines changed- nemo_curator/core
- tests/core
3 files changed
+52
-18
lines changed| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
14 | 14 | | |
15 | 15 | | |
16 | 16 | | |
| 17 | + | |
17 | 18 | | |
18 | 19 | | |
19 | 20 | | |
| |||
56 | 57 | | |
57 | 58 | | |
58 | 59 | | |
| 60 | + | |
59 | 61 | | |
| 62 | + | |
60 | 63 | | |
61 | 64 | | |
62 | 65 | | |
| |||
72 | 75 | | |
73 | 76 | | |
74 | 77 | | |
| 78 | + | |
75 | 79 | | |
76 | 80 | | |
77 | 81 | | |
78 | | - | |
| 82 | + | |
79 | 83 | | |
80 | 84 | | |
81 | 85 | | |
82 | 86 | | |
83 | 87 | | |
84 | 88 | | |
85 | 89 | | |
86 | | - | |
| 90 | + | |
87 | 91 | | |
88 | 92 | | |
89 | 93 | | |
| |||
101 | 105 | | |
102 | 106 | | |
103 | 107 | | |
| 108 | + | |
| 109 | + | |
| 110 | + | |
| 111 | + | |
| 112 | + | |
| 113 | + | |
| 114 | + | |
| 115 | + | |
| 116 | + | |
| 117 | + | |
| 118 | + | |
104 | 119 | | |
105 | 120 | | |
106 | 121 | | |
| |||
119 | 134 | | |
120 | 135 | | |
121 | 136 | | |
122 | | - | |
123 | | - | |
124 | | - | |
125 | | - | |
126 | | - | |
127 | | - | |
128 | | - | |
129 | | - | |
130 | | - | |
| 137 | + | |
| 138 | + | |
| 139 | + | |
| 140 | + | |
| 141 | + | |
| 142 | + | |
| 143 | + | |
| 144 | + | |
| 145 | + | |
| 146 | + | |
131 | 147 | | |
132 | 148 | | |
133 | 149 | | |
| |||
140 | 156 | | |
141 | 157 | | |
142 | 158 | | |
143 | | - | |
144 | | - | |
| 159 | + | |
| 160 | + | |
| 161 | + | |
| 162 | + | |
| 163 | + | |
| 164 | + | |
| 165 | + | |
| 166 | + | |
| 167 | + | |
| 168 | + | |
| 169 | + | |
| 170 | + | |
| 171 | + | |
| 172 | + | |
| 173 | + | |
145 | 174 | | |
146 | 175 | | |
147 | 176 | | |
| |||
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
77 | 77 | | |
78 | 78 | | |
79 | 79 | | |
| 80 | + | |
80 | 81 | | |
81 | 82 | | |
82 | 83 | | |
| |||
99 | 100 | | |
100 | 101 | | |
101 | 102 | | |
| 103 | + | |
| 104 | + | |
102 | 105 | | |
103 | 106 | | |
104 | 107 | | |
| |||
124 | 127 | | |
125 | 128 | | |
126 | 129 | | |
127 | | - | |
| 130 | + | |
| 131 | + | |
| 132 | + | |
128 | 133 | | |
129 | | - | |
| 134 | + | |
130 | 135 | | |
131 | 136 | | |
| Original file line number | Diff line number | Diff line change | |
|---|---|---|---|
| |||
51 | 51 | | |
52 | 52 | | |
53 | 53 | | |
54 | | - | |
55 | | - | |
56 | | - | |
57 | 54 | | |
58 | 55 | | |
| 56 | + | |
| 57 | + | |
| 58 | + | |
59 | 59 | | |
60 | 60 | | |
61 | 61 | | |
| |||
0 commit comments