Make it easier to show node assignments for the official python build. Right now it seems a debug build is required, which is very cumbersome. For the python distribution I think we are more lenient on wheel size and should include more profiling capabilities to boost productivity.
Additionally, it would be good to have a way to produce an artifact that records node assignment, beyond the log messages that we have now, to enable better benchmarking and performance analysis.
@tianleiwu @devang-ml