Currently, after the mark inline phase we run the simplifier on the whole program. Can it have some undesired consequences and slow down the code? Can we run the simplifier only on those functions that we want to fuse instead of running it on the whole program?