You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
<Cardtitle="Quantitative Reasoning with tables inside PDFs"href="https://app.hex.tech/5e6b6e24-dead-4d3b-b9da-a9a7ad587b96/hex/cb595d8a-6eac-4e19-96ed-b1540e5c031c/draft/logic">
25
25
<br/>
26
26
From Pixels to Insights: Seamlessly Extracting and Visualizing Table Data with Unstructured and Hex
27
27
<br/>
28
-
``Unstructured API````Hex``
28
+
``Unstructured API````Hex````Advanced notebook``
29
29
<br/>
30
30
</Card>
31
31
<Cardtitle="PII removal with GLiNER in unstructured data ETL"href="https://colab.research.google.com/drive/1HwOMnGjrNbcHZ1vlhaAG0MSDBcwQfexF?usp=sharing">
32
32
<br/>
33
33
Remove Personally Identifiable Information (PII) as a part of unstructured data preprocessing.
<Cardtitle="Unstructured data ETL from S3 to SingleStore DB"href="https://colab.research.google.com/drive/1Krvn5XlYNERQe7DNIXKEz3AmESJdABLF?usp=sharing">
59
59
<br/>
60
60
Learn to ingest, partition, chunk, embed and load data from an S3 bucket into SingleStore DB.
<Cardtitle="Preprocess PDFs in AWS S3, load into Elasticsearch"href="https://colab.research.google.com/drive/1axADo7T_dMkeOWnZ5C4dKve16-wrtQuV?usp=sharing">
79
79
<br/>
80
80
Ingest PDF documents from an S3 bucket, transform them into a normalized JSON with Unstructured Serverless API, chunk, embed and load into Elasticsearch.
<Cardtitle="Preprocess documents in Google Drive, load into Databricks Volume"href="https://colab.research.google.com/drive/1gVd03geFUD_OTROMuhjVAHYQvgVbViq7?usp=sharing">
86
86
<br/>
87
87
Preprocess documents from a Google Drive Unstructured Serverless API and load them into Databricks Volume.
<Cardtitle="Source references in RAG responses"href="https://colab.research.google.com/drive/1Lc8eq8P87JjzUhbYb33_c7h7njsWb-hn?usp=sharing">
93
93
<br/>
94
94
Add document source references to RAG responses based on documents metadata.
95
95
<br/>
96
-
``Unstructured````RAG````LangChain``
96
+
``Unstructured API````RAG````LangChain``
97
97
<br/>
98
98
</Card>
99
99
<Cardtitle="Query processed PDF with HuggingChat"href="https://colab.research.google.com/drive/1rNVVX5qo7vyBwR7wTa-zS6lDMkKpTei0?usp=sharing">
100
100
<br/>
101
101
Send a PDF to Unstructured for processing, and send a subset of the returned PDF's processed text to [HuggingChat](https://huggingface.co/chat/) for chatbot-style querying.
0 commit comments