To get started, please ensure you have python3 and pip3 installed.
a. Clone this repository:
git clone https://github.com/michaelstresing/wiki_viz
b. Enter the repo:
cd wiki_viz
c. Install the requirements:
pip3 install -r requirements.txt
Argument | CLI Symbol | Default |
---|---|---|
URL | --url | https://en.wikipedia.org/wiki/Kraftwerk |
Width (# of links scraped /page) | --w (or) --width | 3 |
Depth (# of steps away from url) | --d (or) --depth | 4 |
For example:
python3 main.py --url https://en.wikipedia.org/wiki/Sade_(singer) --d 4 --w 5
Here is a sample output of the graph starting on Tim Berners-Lee's page, scraping 4 links per page, 3 layers deep.
Please note, for an alternative rendering of the vizualisation, using nx/matplotlib, please run:
export WIKI_CRAWL_OUTPUT="nx"
Then use as normal.