Skip to content

Commit c5d9469

Browse files
authored
feat: add xls support (#632)
Add support for older .XLS files from the partition function in unstructured.partition.auto. Note, this should also work on the centos7 unstructured image (with the requirements/*txt updates in this PR).
1 parent b767f6b commit c5d9469

File tree

15 files changed

+128
-21
lines changed

15 files changed

+128
-21
lines changed

CHANGELOG.md

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,3 +1,13 @@
1+
## 0.6.10
2+
3+
### Enhancements
4+
5+
* XLS support from auto partiton
6+
7+
### Features
8+
9+
### Fixes
10+
111
## 0.6.9
212

313
### Enhancements

example-docs/tests-example.xls

16 KB
Binary file not shown.

requirements/base.in

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,3 +14,4 @@ python-pptx
1414
python-magic
1515
markdown
1616
requests
17+
xlrd

requirements/base.txt

Lines changed: 4 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -19,7 +19,7 @@ certifi==2023.5.7
1919
cffi==1.15.1
2020
# via cryptography
2121
chardet==5.1.0
22-
# via unstructured (setup.py)
22+
# via -r requirements/base.in
2323
charset-normalizer==3.1.0
2424
# via
2525
# pdfminer-six
@@ -123,7 +123,7 @@ tqdm==4.65.0
123123
# nltk
124124
typer==0.9.0
125125
# via argilla
126-
typing-extensions==4.6.0
126+
typing-extensions==4.6.2
127127
# via
128128
# pydantic
129129
# rich
@@ -136,6 +136,8 @@ wrapt==1.14.1
136136
# via
137137
# argilla
138138
# deprecated
139+
xlrd==2.0.1
140+
# via -r requirements/base.in
139141
xlsxwriter==3.1.1
140142
# via python-pptx
141143
zipp==3.15.0

requirements/dev.txt

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -39,8 +39,6 @@ cffi==1.15.1
3939
# argon2-cffi-bindings
4040
cfgv==3.3.1
4141
# via pre-commit
42-
chardet==5.1.0
43-
# via -r requirements/dev.in
4442
click==8.1.3
4543
# via
4644
# -c requirements/base.txt
@@ -144,7 +142,7 @@ jupyter-core==5.3.0
144142
# qtconsole
145143
jupyter-events==0.6.3
146144
# via jupyter-server
147-
jupyter-server==2.5.0
145+
jupyter-server==2.6.0
148146
# via
149147
# nbclassic
150148
# notebook-shim
@@ -192,6 +190,8 @@ notebook==6.5.4
192190
# via jupyter
193191
notebook-shim==0.2.3
194192
# via nbclassic
193+
overrides==7.3.1
194+
# via jupyter-server
195195
packaging==23.1
196196
# via
197197
# -c requirements/base.txt
@@ -221,7 +221,7 @@ platformdirs==3.5.1
221221
# virtualenv
222222
pre-commit==3.3.2
223223
# via -r requirements/dev.in
224-
prometheus-client==0.16.0
224+
prometheus-client==0.17.0
225225
# via
226226
# jupyter-server
227227
# nbclassic
@@ -346,7 +346,7 @@ traitlets==5.9.0
346346
# nbformat
347347
# notebook
348348
# qtconsole
349-
typing-extensions==4.6.0
349+
typing-extensions==4.6.2
350350
# via
351351
# -c requirements/base.txt
352352
# -c requirements/test.txt

requirements/huggingface.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -90,7 +90,7 @@ tqdm==4.65.0
9090
# transformers
9191
transformers==4.29.2
9292
# via -r requirements/huggingface.in
93-
typing-extensions==4.6.0
93+
typing-extensions==4.6.2
9494
# via
9595
# -c requirements/base.txt
9696
# huggingface-hub

requirements/ingest-azure.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -93,7 +93,7 @@ six==1.16.0
9393
# azure-core
9494
# azure-identity
9595
# isodate
96-
typing-extensions==4.6.0
96+
typing-extensions==4.6.2
9797
# via
9898
# -c requirements/base.txt
9999
# azure-core

requirements/ingest-google-drive.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -17,9 +17,9 @@ charset-normalizer==3.1.0
1717
# requests
1818
google-api-core==2.11.0
1919
# via google-api-python-client
20-
google-api-python-client==2.86.0
20+
google-api-python-client==2.87.0
2121
# via -r requirements/ingest-google-drive.in
22-
google-auth==2.18.1
22+
google-auth==2.19.0
2323
# via
2424
# google-api-core
2525
# google-api-python-client

requirements/ingest-s3.txt

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -52,7 +52,7 @@ six==1.16.0
5252
# via
5353
# -c requirements/base.txt
5454
# python-dateutil
55-
typing-extensions==4.6.0
55+
typing-extensions==4.6.2
5656
# via
5757
# -c requirements/base.txt
5858
# aioitertools

requirements/local-inference.txt

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -102,7 +102,7 @@ numpy==1.23.5
102102
# transformers
103103
omegaconf==2.3.0
104104
# via effdet
105-
onnxruntime==1.14.1
105+
onnxruntime==1.15.0
106106
# via unstructured-inference
107107
opencv-python==4.7.0.72
108108
# via
@@ -226,7 +226,7 @@ tqdm==4.65.0
226226
# transformers
227227
transformers==4.29.2
228228
# via unstructured-inference
229-
typing-extensions==4.6.0
229+
typing-extensions==4.6.2
230230
# via
231231
# -c requirements/base.txt
232232
# huggingface-hub

0 commit comments

Comments
 (0)