We should provide a string diff output that pinpoints what exactly is different. (Similar to other test frameworks already provide)