In the IO doc page of the user guide, there is a section comparing the performance of some the supported IO methods: https://pandas.pydata.org/pandas-docs/stable/user_guide/io.html#performance-considerations This is already quite old, and would be nice to update this: - add the parquet format to the compared set - rerun the code and update the code-block with your runs