Can we add first occurrence times of unique ops to ops_unique_args summary? This can make ops listed in order of their execution and allow us to compare ROCm against CUDA in an apple-2-apple way.
First occurrence time of an unique op mean when this unique op happens at its first time.