Skip to content

Commit 6f1b449

Browse files
committed
Better CJK support (and fix for lxml-clean), thanks @cdhigh
1 parent f02d865 commit 6f1b449

File tree

5 files changed

+29
-6
lines changed

5 files changed

+29
-6
lines changed

.gitignore

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -14,4 +14,5 @@ nosetests.xml
1414
.idea
1515
.cache
1616
/.noseids
17-
/.venv
17+
/.venv
18+
/poetry.lock

Makefile

Lines changed: 4 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -50,13 +50,14 @@ clean_all: clean_venv
5050
# ###########
5151
.PHONY: dist
5252
dist:
53+
$(PY) -m pip install wheel
5354
$(PY) setup.py sdist bdist_wheel
5455
$(TWINE) check dist/*
5556

5657
.PHONY: upload
5758
upload:
5859
$(TWINE) upload dist/*
5960

60-
.PHONY: version_update
61-
version_update:
62-
$(EDITOR) setup.py
61+
.PHONY: bump
62+
bump:
63+
$(EDITOR) readability/__init__.py

README.rst

Lines changed: 3 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,9 @@ Usage
4848
4949
Change Log
5050
----------
51-
51+
- 0.8.4 Better CJK support, thanks @cdhigh
52+
- 0.8.3.1 Support for python 3.8 - 3.13
53+
- 0.8.3 We can now save all images via keep_all_images=True (default is to save 1 main image), thanks @botlabsDev
5254
- 0.8.2 Added article author(s) (thanks @mattblaha)
5355
- 0.8.1 Fixed processing of non-ascii HTMLs via regexps.
5456
- 0.8 Replaced XHTML output with HTML5 output in summary() call.

pyproject.toml

Lines changed: 19 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,19 @@
1+
[tool.poetry]
2+
name = "readability-lxml"
3+
version = "0.8.4"
4+
description = "fast html to text parser (article readability tool) with python 3 support"
5+
authors = ["Yuri Baburov <[email protected]>"]
6+
license = "Apache License 2.0"
7+
readme = "README.rst"
8+
9+
[tool.poetry.dependencies]
10+
python = ">=3.8.2,<3.14"
11+
chardet = "^5.2.0"
12+
cssselect = "~1.2"
13+
lxml = {extras = ["html-clean"], version = "^5.4.0"}
14+
lxml-html-clean = {markers = "python_version < \"3.11\"", version = "^0.4.2"}
15+
16+
17+
[build-system]
18+
requires = ["poetry-core"]
19+
build-backend = "poetry.core.masonry.api"

readability/__init__.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,3 @@
1-
__version__ = "0.8.3"
1+
__version__ = "0.8.4"
22

33
from .readability import Document

0 commit comments

Comments
 (0)