Skip to content

Latest commit

 

History

History
2 lines (2 loc) · 335 Bytes

File metadata and controls

2 lines (2 loc) · 335 Bytes

warc

C++ library to parse WARC files according to the specification. Work in progress with no tests or support for decompressing response bodies or parsing HTTP headers in responses. Basic parsing works on a recent common crawl dump file.