Skip to content

Evaluate zlib-cloudflare for 15% performance speedup of WarcRecordWriter #22

@tfmorris

Description

@tfmorris

According to this 2019 analysis, fully 1/3 of WarcRecordWriter's time is being spent in zlib.so. Cloudflare has a performance enhanced drop-in compatible version of zlib, zlib-cloudflare, which is claimed to be almost twice as fast at gzip compression.

This could provide a significant speedup (~15% overall) for minimal implementation cost. There is documentation available which describes how to set it up. Ignore the fact that it's a Graviton page. It applies to all architectures.

While switching to a different algorithm is also possible, that would be much more disruptive to the ecosystem as compared to a drop-in replacement implementing the same algorithm.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions