There are cases where it would be nice to turn off multiple deployments/statefulsets based on traffic coming to a single service. For example, when we have a web application with a database behind it.
It would be possible to use the annotation on all services, but then each deployment would be upscaled sequentially, most likely resulting in a timeout.
A similar pattern is apparently used in OpenShift (https://github.com/openshift/service-idler).
What would you like to be added?
Update the service annotation so it can take references to multiple deployments.
Why is this needed?
Make it possible to scale to 0 more complex applications.