We want to address all parser failures in the Digital Corpora 767 dataset. [Reference](https://github.com/NVIDIA/nv-ingest/blob/main/evaluation/bo767_ids.txt) List of files known to fail - `bo767/1844014.pdf`: `page-dimensions`