Skip to content

Commit 277c993

Browse files
authored
drop reference to langchain (#152)
1 parent e629e92 commit 277c993

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

48 files changed

+70
-84
lines changed

CHANGELOG.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,9 @@
1+
## 0.0.23
2+
3+
### Fixes
4+
5+
* **Remove check for langchain dependency in embedders**
6+
17
## 0.0.22
28

39
### Enhancements

docs/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -146,7 +146,7 @@ if __name__ == "__main__":
146146
chunker_config=ChunkerConfig(
147147
chunking_strategy="basic",
148148
),
149-
embedder_config=EmbedderConfig(embedding_provider="langchain-huggingface"),
149+
embedder_config=EmbedderConfig(embedding_provider="huggingface"),
150150
151151
destination_connection_config=ChromaConnectionConfig(
152152
access_config=ChromaAccessConfig(settings=None, headers=None),

requirements/common/constraints.txt

Lines changed: 0 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15,16 +15,13 @@ urllib3<1.27
1515
botocore<1.34.132
1616
# TODO: Constriant due to both 8.5.0 and 8.4.0 being installed during pip-compile
1717
importlib-metadata>=8.5.0
18-
# TODO: Constraint due to langchain, remove when that gets updated:
19-
packaging<24.0
2018
# TODO: Constraint due to boto, with python before 3.10 not requiring openssl 1.1.1, remove when that gets
2119
# updated or we drop support for 3.9
2220
urllib3<1.27
2321
unstructured-client>= 0.25.8
2422
fsspec==2024.5.0
2523
# python 3.12 support
2624
wrapt>=1.14.0
27-
langchain-community>=0.2.5
2825
# NOTE(robinson): chroma was pinned to importlib-metadata>=7.1.0 but 7.1.0 was installed
2926
# instead of 7.2.0. Need to investigate
3027
importlib-metadata==7.1.0

test_e2e/dest/astradb.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -56,7 +56,7 @@ PYTHONPATH=. ./unstructured_ingest/main.py \
5656
--chunking-strategy by_title \
5757
--chunk-max-characters 1500 \
5858
--chunk-multipage-sections \
59-
--embedding-provider "langchain-huggingface" \
59+
--embedding-provider "huggingface" \
6060
astradb \
6161
--token "$ASTRA_DB_APPLICATION_TOKEN" \
6262
--api-endpoint "$ASTRA_DB_API_ENDPOINT" \

test_e2e/dest/azure-cognitive-search.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -86,7 +86,7 @@ PYTHONPATH=${PYTHONPATH:-.} "$RUN_SCRIPT" \
8686
--chunk-max-characters 2500 \
8787
--chunk-multipage-sections \
8888
--no-chunk-include-orig-elements \
89-
--embedding-provider "langchain-huggingface" \
89+
--embedding-provider "huggingface" \
9090
azure-cognitive-search \
9191
--key "$AZURE_SEARCH_API_KEY" \
9292
--endpoint "$AZURE_SEARCH_ENDPOINT" \

test_e2e/dest/chroma.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -45,7 +45,7 @@ PYTHONPATH=. ./unstructured_ingest/main.py \
4545
--chunking-strategy by_title \
4646
--chunk-max-characters 1500 \
4747
--chunk-multipage-sections \
48-
--embedding-provider "langchain-huggingface" \
48+
--embedding-provider "huggingface" \
4949
chroma \
5050
--host "localhost" \
5151
--port 8000 \

test_e2e/dest/couchbase.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -67,7 +67,7 @@ PYTHONPATH=. ./unstructured_ingest/main.py \
6767
--chunking-strategy by_title \
6868
--chunk-max-characters 1500 \
6969
--chunk-multipage-sections \
70-
--embedding-provider "langchain-huggingface" \
70+
--embedding-provider "huggingface" \
7171
couchbase \
7272
--connection-string "$CB_CONN_STR" \
7373
--bucket "$CB_BUCKET" \

test_e2e/dest/elasticsearch.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -50,7 +50,7 @@ PYTHONPATH=. ./unstructured_ingest/main.py \
5050
--chunk-new-after-n-chars 2500 \
5151
--chunk-max-characters 38000 \
5252
--chunk-multipage-sections \
53-
--embedding-provider "langchain-huggingface" \
53+
--embedding-provider "huggingface" \
5454
elasticsearch \
5555
--hosts http://localhost:9200 \
5656
--index-name ingest-test-destination \

test_e2e/dest/kafka-local.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -49,7 +49,7 @@ PYTHONPATH=. ./unstructured_ingest/main.py \
4949
--chunk-new-after-n-chars 2500 \
5050
--chunk-max-characters 38000 \
5151
--chunk-multipage-sections \
52-
--embedding-provider "langchain-huggingface" \
52+
--embedding-provider "huggingface" \
5353
kafka \
5454
--topic "$KAFKA_TOPIC" \
5555
--bootstrap-server "$KAFKA_BOOTSTRAP_SERVER" \

test_e2e/dest/kdbai.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ PYTHONPATH=. ./unstructured_ingest/main.py \
4848
--reprocess \
4949
--input-path example-docs/pdf/fake-memo.pdf \
5050
--work-dir "$WORK_DIR" \
51-
--embedding-provider "langchain-huggingface" \
51+
--embedding-provider "huggingface" \
5252
kdbai \
5353
--table-name "unstructured_test" \
5454
--batch-size 100

0 commit comments

Comments
 (0)