-
Notifications
You must be signed in to change notification settings - Fork 510
Description
What you would like to be added?
It would be nice to be able to specify on which node Katib jobs would run. This corresponds basically to the node_selector field of a Pod.
For example:
katib_client.tune(
name=name,
objective=objective,
parameters=parameters,
objective_metric_name="loss",
max_trial_count=5,
resources_per_trial={"cpu": "2"},
node_selector="nvidia.com/gpu.product": "NVIDIA-H100-NVL"}
)Why is this needed?
If you have a heterogeneous cluster with node configs, e.g. different GPU vendors, models, ... a user might want to specify which node should be used.
Love this feature?
Give it a π We prioritize the features with most π
Reactions are currently unavailable