Updated docs

edsu · edsu · commit f14c17c7216e · 2026-02-04T13:21:50.000-05:00
diff --git a/README.md b/README.md
@@ -5,21 +5,33 @@ collections have archived the URL. This kind of information can sometimes
 provide insight about why a particular web resource or set of web resources were
 archived from the web.
 
-## Install 
+## Run
 
-    pip install waybackprov
+If you have [uv] installed you can run `waybackprov` easily without installing anything:
+
+```
+uvx waybackprov
+```
+
+Otherwise you'll probably want to install it with `pip`:
+
+```
+pip install waybackprov
+```
 
 ## Basic Usage
 
 To check a particular URL here's how it works:
 
-    % waybackprov https://twitter.com/EPAScottPruitt
-    364 https://archive.org/details/focused_crawls
-    306 https://archive.org/details/edgi_monitor
-    151 https://archive.org/details/www3.epa.gov
-     60 https://archive.org/details/epa.gov4
-     47 https://archive.org/details/epa.gov5
-    ...
+```shell
+waybackprov https://twitter.com/EPAScottPruitt
+364 https://archive.org/details/focused_crawls
+306 https://archive.org/details/edgi_monitor
+151 https://archive.org/details/www3.epa.gov
+ 60 https://archive.org/details/epa.gov4
+ 47 https://archive.org/details/epa.gov5
+  ...
+```
 
 The first column contains the number of crawls for a particular URL, and the
 second column contains the URL for the Internet Archive collection that added
@@ -30,14 +42,18 @@ it.
 By default waybackprov will only look at the current year. If you would like it
 to examine a range of years use the `--start` and `--end` options:
 
-    % waybackprov --start 2016 --end 2018 https://twitter.com/EPAScottPruitt
+```shell
+waybackprov --start 2016 --end 2018 https://twitter.com/EPAScottPruitt
+```
 
 ## Multiple Pages
 
 If you would like to look at all URLs at a particular URL prefix you can use the
 `--prefix` option:
 
-    % waybackprov --prefix https://twitter.com/EPAScottPruitt
+```shell
+waybackprov --prefix https://twitter.com/EPAScottPruitt
+```
 
 This will use the Internet Archive's [CDX API](https://github.com/webrecorder/pywb/wiki/CDX-Server-API) to also include URLs that are extensions of the URL you supply, so it would include for example:
 
@@ -53,7 +69,9 @@ interested in is highly recommended since it prevents lots of lookups for CSS,
 JavaScript and image files that are components of the resource that was
 initially crawled.
 
-    % waybackprov --prefix --match 'status/\d+$' https://twitter.com/EPAScottPruitt
+```
+waybackprov --prefix --match 'status/\d+$' https://twitter.com/EPAScottPruitt
+```
 
 ## Collections
 
@@ -78,12 +96,15 @@ rather than a summary.
 If you would like to see detailed information about what *waybackprov* is doing
 use the `--log` option to supply the a file path to log to:
 
-    % waybackprov --log waybackprov.log https://example.com/
+```shell
+waybackprov --log waybackprov.log https://example.com/
+```
 
 ## Test
 
 If you would like to test it first install [pytest] and then:
 
-    pytest test.py
+    uv run pytest test.py
 
 [pytest]: https://docs.pytest.org/en/latest/
+[uv]: https://docs.astral.sh/uv/
diff --git a/pyproject.toml b/pyproject.toml
@@ -1,6 +1,6 @@
 [project]
 name = "waybackprov"
-version = "0.0.9"
+version = "0.1.0"
 description = "Checks the provenance of a URL in the Wayback machine"
 readme = "README.md"
 authors = [
diff --git a/src/waybackprov/__init__.py b/src/waybackprov/__init__.py
@@ -138,7 +138,6 @@ def get_crawls(
         # month. So some spots in the first and last row are null. Not
         # every day has any data if the URL wasn't crawled then.
         logging.info("getting calendar year %s for %s", year, url)
-        print("getting calendar year %s for %s", year, url)
         cal = get_json(api % (url, year))
         for month in cal:
             for week in month: