Per #110 (comment), ideally we want to test core functionality (mostly in _benchmark.py) directly, instead of just via implemented and example benchmarks. Some non-exclusive options:
- Make more example benchmarks accessible from {arrowbench}
- Put necessary R and JSON in testing resources
- Mock out parts of classes in tests so smaller blocks of code can be tested