Skip to content

Commit 2a712b1

Browse files
authored
Merge pull request #110 from microsoft/omri/prep_for_release
2 parents 80b3421 + 281c159 commit 2a712b1

File tree

2 files changed

+6
-2
lines changed

2 files changed

+6
-2
lines changed

README.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -44,9 +44,15 @@ To install the package:
4444
pip install poetry
4545
poetry install --with=dev
4646

47+
# Download tge spaCy pipeline used for tokenization
48+
poetry run python -m spacy download en_core_web_sm
49+
4750
# To install with all additional NER dependencies (e.g. Flair, Stanza), run:
4851
# poetry install --with='ner,dev'
4952

53+
# To use the default Presidio configuration, a spaCy model is required:
54+
poetry run python -m spacy download en_core_web_lg
55+
5056
# Verify installation
5157
pytest
5258
```

pyproject.toml

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -22,8 +22,6 @@ xmltodict = "^0.12.0"
2222
python-dotenv = "^1.0.0"
2323
plotly= "^5.24.0"
2424
azure-ai-textanalytics = "^5.3.0"
25-
en_core_web_sm = {url = "https://github.com/explosion/spacy-models/releases/download/en_core_web_sm-3.8.0/en_core_web_sm-3.8.0.tar.gz"}
26-
en_core_web_lg = {url = "https://github.com/explosion/spacy-models/releases/download/en_core_web_lg-3.8.0/en_core_web_lg-3.8.0.tar.gz"}
2725

2826
# optional dependencies for the different NLP approaches
2927
[tool.poetry.group.ner]

0 commit comments

Comments
 (0)