Skip to content

Commit bd75bc9

Browse files
committed
fix: add lxml dependency to environment2.yml and update argparse namespace in html_crawler.py
1 parent 42900e6 commit bd75bc9

File tree

2 files changed

+2
-1
lines changed

2 files changed

+2
-1
lines changed

environment2.yml

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -16,6 +16,7 @@ dependencies:
1616
- huggingface_hub==0.23.5
1717
- inflect==7.0.0
1818
- langdetect==1.0.9
19+
- lxml==5.4.0
1920
- matplotlib==3.7.3
2021
- multiprocess==0.70.16
2122
- networkx==3.1

poligrapher/scripts/html_crawler.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ def main(url, output):
9393
format="%(asctime)s [%(levelname)s] %(message)s", level=logging.INFO
9494
)
9595

96-
args = argparse.Namespace(url=url, output=output)
96+
args = argparse.Namespace(url=url, output=output, no_readability_js=False)
9797
access_url = url_arg_handler(args.url)
9898

9999
if access_url is None:

0 commit comments

Comments
 (0)