we only focus on inputs now. it's better to use [launch_exit_hook](https://github.com/triton-lang/triton/blob/main/python/triton/knobs.py#L468) to collect the output of kernels.