Cosmin spoke ominously of some data-mining benchmark he'd like to add that uses multi-gigabyte input files. This won't fly with Git.