A fork of https://github.com/linkchecker/linkchecker that includes:
- All the AnchorCheck fixes and improvements
- Just add
[AnchorCheck]to yourlinkchecker.rcto get high-performance anchor checking. - You might want to tune
anchorcachesize(in the[checking]section) if you see high memory use.
- Just add
- The HTTP performance fix
- The
maxrequestspersecondsetting is now actually obeyed. If you are testing against an internal (development) web server, I suggest setting it to1000to get nearly the performance of checking HTML files on disk. If you are testing against public / production web servers,10might still be a polite value.
- The
- A feature to report all references to a URL (i.e. to report ALL broken links, not just the first one encountered for each target)
linkchecker.rcsetting:reportallreferencesin the[output]section.- Command line argument:
--allrefs.
- A fix for working with a newer (4.10+) install of BeautifulSoup
- A warning when using AnchorCheck with links that point to a directory and have an anchor, indicating that those anchors won't be checked.
-
Clone this repository.
-
cd path/to/this/repo -
pip3 install -e .
Now you should be able to run linkchecker (and do everything else, as usual) per the main-repo docs, plus the notes above.