Skip to content

Have you reconsidered adding WARC support?Β #1407

@YousufSSyed

Description

@YousufSSyed

I saw the issue for it posted all the way back in 2019 and I think its a really good time to look at supporting the WARC format.

  • There's a lot more software that supports it now.
  • It can be viewed in the browser with sites like https://replayweb.page.
  • WARCs (both .warc and .warc.gz) can easily be concatenated, with unix cat in the command line for instance.
  • When combined, their resources can be deduplicated, allowing space to be saved.
  • Can be added to Web replay software like Pywb.
  • There's currently no good addons to download pages as WARCs, Warcreate is only available on Chrome and not Firefox. Other software requires lots of setup outside of the browser.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions