Skip to content

Commit 397c9d1

Browse files
authored
Merge pull request #2556 from nexB/improve-copyright-detection
Improve copyright detection
2 parents 91b5f72 + 06baf2d commit 397c9d1

File tree

79 files changed

+572
-196
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

79 files changed

+572
-196
lines changed

CHANGELOG.rst

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -39,6 +39,11 @@ Copyright detection:
3939
- The data structure in the JSON is now using consistently named attributes as
4040
opposed to a plain value.
4141

42+
- The copyright detection speed has been significantly improved with the tests
43+
taking roughly 1/2 of the time to run. This is achieved mostly by replacing
44+
NLTK with a the minimal and simplified subset we need in a new library named
45+
pygmars.
46+
4247

4348
Package detection:
4449
~~~~~~~~~~~~~~~~~~

requirements.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,6 @@ license-expression==1.2
3434
lxml==4.6.3
3535
MarkupSafe==2.0.1
3636
more-itertools==8.8.0
37-
nltk==3.4.5
3837
normality==2.2.2
3938
packageurl-python==0.9.4
4039
packaging==20.9
@@ -50,6 +49,7 @@ publicsuffix2==2.20191221
5049
pyahocorasick==1.4.2
5150
pycparser==2.20
5251
Pygments==2.9.0
52+
pygmars==0.5.0
5353
pymaven-patch==0.3.0
5454
pyparsing==2.4.7
5555
PyYAML==5.4.1

setup-mini.cfg

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ install_requires =
7676
license_expression >= 1.0
7777
lxml >= 4.6.3, < 5.0.0
7878
MarkupSafe >= 1.0
79-
nltk >= 3.2, !=3.6, < 4.0
8079
packageurl_python >= 0.9.0
8180
packaging > 20
8281
pdfminer.six >= 20200101
@@ -86,6 +85,7 @@ install_requires =
8685
plugincode >= 21.1.21
8786
publicsuffix2
8887
pyahocorasick >= 1.4.0, < 1.5
88+
pygmars
8989
pygments
9090
pymaven_patch >= 0.2.8
9191
requests >= 2.7.0, < 3.0.0

setup.cfg

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -76,7 +76,6 @@ install_requires =
7676
license_expression >= 1.0
7777
lxml >= 4.6.3, < 5.0.0
7878
MarkupSafe >= 1.0
79-
nltk >= 3.2, !=3.6, < 4.0
8079
packageurl_python >= 0.9.0
8180
packaging > 20
8281
pdfminer.six >= 20200101
@@ -86,6 +85,7 @@ install_requires =
8685
plugincode >= 21.1.21
8786
publicsuffix2
8887
pyahocorasick >= 1.4.0, < 1.5
88+
pygmars
8989
pygments
9090
pymaven_patch >= 0.2.8
9191
requests >= 2.7.0, < 3.0.0

0 commit comments

Comments
 (0)