Skip to content

Commit e629e92

Browse files
authored
feat/drop langchain as dependency for embedders (#146)
* drop langchain as dependency for embedders * bump changelog * fix voyageai unit test * swap deprecated .json() with .model_dump_json() * swap deprecated .dict() with .model_dump() * fix huggingface embedder * remove import from voyageai method * fix vertexai unit test * Add vertexai to test deps to support unit test * fix bedrock default model * fix vertexai embedder * fix vertexai unit test * Add back in mixedbreadai deps
1 parent c403237 commit e629e92

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

51 files changed

+402
-730
lines changed

CHANGELOG.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,10 +1,11 @@
1-
## 0.0.22-dev2
1+
## 0.0.22
22

33
### Enhancements
44

55
* **Add documentation for developing sources/destinations**
66
* **Leverage `uv` for pip compile**
77
* **Use incoming fsspec data to populate metadata** Rather than make additional calls to collect metadata after initial file list, use connector-specific data to populate the metadata.
8+
* **Drop langchain as dependency for embedders**
89

910
## 0.0.21
1011

requirements/embed/aws-bedrock.in

Lines changed: 0 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,3 @@
11
-c ../common/constraints.txt
2-
-c ../common/constraints.txt
32

43
boto3
5-
langchain-community

requirements/embed/aws-bedrock.txt

Lines changed: 4 additions & 140 deletions
Original file line numberDiff line numberDiff line change
@@ -1,159 +1,23 @@
11
# This file was autogenerated by uv via the following command:
2-
# uv pip compile ./embed/aws-bedrock.in --output-file ./embed/aws-bedrock.txt --no-strip-extras --python-version 3.9
3-
aiohappyeyeballs==2.4.3
4-
# via aiohttp
5-
aiohttp==3.10.8
6-
# via
7-
# langchain
8-
# langchain-community
9-
aiosignal==1.3.1
10-
# via aiohttp
11-
annotated-types==0.7.0
12-
# via pydantic
13-
anyio==4.6.0
14-
# via httpx
15-
async-timeout==4.0.3
16-
# via
17-
# aiohttp
18-
# langchain
19-
attrs==24.2.0
20-
# via aiohttp
2+
# uv pip compile aws-bedrock.in --output-file aws-bedrock.txt --no-strip-extras
213
boto3==1.34.131
22-
# via -r ./embed/aws-bedrock.in
4+
# via -r aws-bedrock.in
235
botocore==1.34.131
246
# via
25-
# -c ./embed/../common/constraints.txt
7+
# -c ../common/constraints.txt
268
# boto3
279
# s3transfer
28-
certifi==2024.8.30
29-
# via
30-
# httpcore
31-
# httpx
32-
# requests
33-
charset-normalizer==3.3.2
34-
# via requests
35-
dataclasses-json==0.6.7
36-
# via langchain-community
37-
exceptiongroup==1.2.2
38-
# via anyio
39-
frozenlist==1.4.1
40-
# via
41-
# aiohttp
42-
# aiosignal
43-
h11==0.14.0
44-
# via httpcore
45-
httpcore==1.0.6
46-
# via httpx
47-
httpx==0.27.2
48-
# via langsmith
49-
idna==3.10
50-
# via
51-
# anyio
52-
# httpx
53-
# requests
54-
# yarl
5510
jmespath==1.0.1
5611
# via
5712
# boto3
5813
# botocore
59-
jsonpatch==1.33
60-
# via langchain-core
61-
jsonpointer==3.0.0
62-
# via jsonpatch
63-
langchain==0.3.1
64-
# via langchain-community
65-
langchain-community==0.3.1
66-
# via
67-
# -c ./embed/../common/constraints.txt
68-
# -r ./embed/aws-bedrock.in
69-
langchain-core==0.3.7
70-
# via
71-
# langchain
72-
# langchain-community
73-
# langchain-text-splitters
74-
langchain-text-splitters==0.3.0
75-
# via langchain
76-
langsmith==0.1.129
77-
# via
78-
# langchain
79-
# langchain-community
80-
# langchain-core
81-
marshmallow==3.22.0
82-
# via dataclasses-json
83-
multidict==6.1.0
84-
# via
85-
# aiohttp
86-
# yarl
87-
mypy-extensions==1.0.0
88-
# via typing-inspect
89-
numpy==1.26.4
90-
# via
91-
# -c ./embed/../common/constraints.txt
92-
# langchain
93-
# langchain-community
94-
orjson==3.10.7
95-
# via langsmith
96-
packaging==23.2
97-
# via
98-
# -c ./embed/../common/constraints.txt
99-
# langchain-core
100-
# marshmallow
101-
pydantic==2.9.2
102-
# via
103-
# langchain
104-
# langchain-core
105-
# langsmith
106-
# pydantic-settings
107-
pydantic-core==2.23.4
108-
# via pydantic
109-
pydantic-settings==2.5.2
110-
# via langchain-community
11114
python-dateutil==2.9.0.post0
11215
# via botocore
113-
python-dotenv==1.0.1
114-
# via pydantic-settings
115-
pyyaml==6.0.2
116-
# via
117-
# langchain
118-
# langchain-community
119-
# langchain-core
120-
requests==2.32.3
121-
# via
122-
# langchain
123-
# langchain-community
124-
# langsmith
12516
s3transfer==0.10.2
12617
# via boto3
12718
six==1.16.0
12819
# via python-dateutil
129-
sniffio==1.3.1
130-
# via
131-
# anyio
132-
# httpx
133-
sqlalchemy==2.0.35
134-
# via
135-
# langchain
136-
# langchain-community
137-
tenacity==8.5.0
138-
# via
139-
# langchain
140-
# langchain-community
141-
# langchain-core
142-
typing-extensions==4.12.2
143-
# via
144-
# anyio
145-
# langchain-core
146-
# multidict
147-
# pydantic
148-
# pydantic-core
149-
# sqlalchemy
150-
# typing-inspect
151-
typing-inspect==0.9.0
152-
# via dataclasses-json
15320
urllib3==1.26.20
15421
# via
155-
# -c ./embed/../common/constraints.txt
22+
# -c ../common/constraints.txt
15623
# botocore
157-
# requests
158-
yarl==1.13.1
159-
# via aiohttp

requirements/embed/huggingface.in

Lines changed: 1 addition & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,3 @@
11
-c ../common/constraints.txt
2-
-c ../common/constraints.txt
3-
42

5-
langchain-huggingface
3+
sentence-transformers

requirements/embed/huggingface.txt

Lines changed: 10 additions & 63 deletions
Original file line numberDiff line numberDiff line change
@@ -1,59 +1,30 @@
11
# This file was autogenerated by uv via the following command:
2-
# uv pip compile ./embed/huggingface.in --output-file ./embed/huggingface.txt --no-strip-extras --python-version 3.9
3-
annotated-types==0.7.0
4-
# via pydantic
5-
anyio==4.6.0
6-
# via httpx
2+
# uv pip compile huggingface.in --output-file huggingface.txt --no-strip-extras
73
certifi==2024.8.30
8-
# via
9-
# httpcore
10-
# httpx
11-
# requests
4+
# via requests
125
charset-normalizer==3.3.2
136
# via requests
14-
exceptiongroup==1.2.2
15-
# via anyio
167
filelock==3.16.1
178
# via
189
# huggingface-hub
1910
# torch
2011
# transformers
2112
fsspec==2024.5.0
2213
# via
23-
# -c ./embed/../common/constraints.txt
14+
# -c ../common/constraints.txt
2415
# huggingface-hub
2516
# torch
26-
h11==0.14.0
27-
# via httpcore
28-
httpcore==1.0.6
29-
# via httpx
30-
httpx==0.27.2
31-
# via langsmith
3217
huggingface-hub==0.25.1
3318
# via
34-
# langchain-huggingface
3519
# sentence-transformers
3620
# tokenizers
3721
# transformers
3822
idna==3.10
39-
# via
40-
# anyio
41-
# httpx
42-
# requests
23+
# via requests
4324
jinja2==3.1.4
4425
# via torch
4526
joblib==1.4.2
4627
# via scikit-learn
47-
jsonpatch==1.33
48-
# via langchain-core
49-
jsonpointer==3.0.0
50-
# via jsonpatch
51-
langchain-core==0.3.7
52-
# via langchain-huggingface
53-
langchain-huggingface==0.1.0
54-
# via -r ./embed/huggingface.in
55-
langsmith==0.1.129
56-
# via langchain-core
5728
markupsafe==2.1.5
5829
# via jinja2
5930
mpmath==1.3.0
@@ -62,37 +33,26 @@ networkx==3.2.1
6233
# via torch
6334
numpy==1.26.4
6435
# via
65-
# -c ./embed/../common/constraints.txt
36+
# -c ../common/constraints.txt
6637
# scikit-learn
6738
# scipy
6839
# transformers
69-
orjson==3.10.7
70-
# via langsmith
7140
packaging==23.2
7241
# via
73-
# -c ./embed/../common/constraints.txt
42+
# -c ../common/constraints.txt
7443
# huggingface-hub
75-
# langchain-core
7644
# transformers
7745
pillow==10.4.0
7846
# via sentence-transformers
79-
pydantic==2.9.2
80-
# via
81-
# langchain-core
82-
# langsmith
83-
pydantic-core==2.23.4
84-
# via pydantic
8547
pyyaml==6.0.2
8648
# via
8749
# huggingface-hub
88-
# langchain-core
8950
# transformers
9051
regex==2024.9.11
9152
# via transformers
9253
requests==2.32.3
9354
# via
9455
# huggingface-hub
95-
# langsmith
9656
# transformers
9757
safetensors==0.4.5
9858
# via transformers
@@ -103,21 +63,14 @@ scipy==1.13.1
10363
# scikit-learn
10464
# sentence-transformers
10565
sentence-transformers==3.1.1
106-
# via langchain-huggingface
107-
sniffio==1.3.1
108-
# via
109-
# anyio
110-
# httpx
66+
# via -r huggingface.in
11167
sympy==1.13.3
11268
# via torch
113-
tenacity==8.5.0
114-
# via langchain-core
11569
threadpoolctl==3.5.0
11670
# via scikit-learn
11771
tokenizers==0.19.1
11872
# via
119-
# -c ./embed/../common/constraints.txt
120-
# langchain-huggingface
73+
# -c ../common/constraints.txt
12174
# transformers
12275
torch==2.4.1
12376
# via sentence-transformers
@@ -127,18 +80,12 @@ tqdm==4.66.5
12780
# sentence-transformers
12881
# transformers
12982
transformers==4.44.2
130-
# via
131-
# langchain-huggingface
132-
# sentence-transformers
83+
# via sentence-transformers
13384
typing-extensions==4.12.2
13485
# via
135-
# anyio
13686
# huggingface-hub
137-
# langchain-core
138-
# pydantic
139-
# pydantic-core
14087
# torch
14188
urllib3==1.26.20
14289
# via
143-
# -c ./embed/../common/constraints.txt
90+
# -c ../common/constraints.txt
14491
# requests

requirements/embed/mixedbreadai.in

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,3 @@
11
-c ../common/constraints.txt
2-
-c ../common/constraints.txt
32

43
mixedbread-ai

requirements/embed/mixedbreadai.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# This file was autogenerated by uv via the following command:
2-
# uv pip compile ./embed/mixedbreadai.in --output-file ./embed/mixedbreadai.txt --no-strip-extras --python-version 3.9
2+
# uv pip compile mixedbreadai.in --output-file mixedbreadai.txt --no-strip-extras
33
annotated-types==0.7.0
44
# via pydantic
55
anyio==4.6.0
@@ -21,7 +21,7 @@ idna==3.10
2121
# anyio
2222
# httpx
2323
mixedbread-ai==2.2.6
24-
# via -r ./embed/mixedbreadai.in
24+
# via -r mixedbreadai.in
2525
pydantic==2.9.2
2626
# via mixedbread-ai
2727
pydantic-core==2.23.4

requirements/embed/octoai.in

Lines changed: 0 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,4 @@
11
-c ../common/constraints.txt
2-
-c ../common/constraints.txt
32

43
openai
54
tiktoken

requirements/embed/octoai.txt

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,5 @@
11
# This file was autogenerated by uv via the following command:
2-
# uv pip compile ./embed/octoai.in --output-file ./embed/octoai.txt --no-strip-extras --python-version 3.9
2+
# uv pip compile octoai.in --output-file octoai.txt --no-strip-extras
33
annotated-types==0.7.0
44
# via pydantic
55
anyio==4.6.0
@@ -31,7 +31,7 @@ idna==3.10
3131
jiter==0.5.0
3232
# via openai
3333
openai==1.51.0
34-
# via -r ./embed/octoai.in
34+
# via -r octoai.in
3535
pydantic==2.9.2
3636
# via openai
3737
pydantic-core==2.23.4
@@ -46,7 +46,7 @@ sniffio==1.3.1
4646
# httpx
4747
# openai
4848
tiktoken==0.7.0
49-
# via -r ./embed/octoai.in
49+
# via -r octoai.in
5050
tqdm==4.66.5
5151
# via openai
5252
typing-extensions==4.12.2
@@ -57,5 +57,5 @@ typing-extensions==4.12.2
5757
# pydantic-core
5858
urllib3==1.26.20
5959
# via
60-
# -c ./embed/../common/constraints.txt
60+
# -c ../common/constraints.txt
6161
# requests

0 commit comments

Comments
 (0)