If I start extraction using the CLI, but then my program crashes halfway through I will have an incomplete directory that is named correctly. The CLI will think the layer is already cached. We should do something to checksum or otherwise validate the cache is coherent between runs.