Skip to content

Commit 56f7ebc

Browse files
authored
Embed conceptual videos (#761)
1 parent 19d2389 commit 56f7ebc

File tree

3 files changed

+30
-0
lines changed

3 files changed

+30
-0
lines changed

ui/chunking.mdx

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,16 @@
22
title: Chunking
33
---
44

5+
<iframe
6+
width="560"
7+
height="315"
8+
src="https://www.youtube.com/embed/ZWpm_TzwN-w"
9+
title="YouTube video player"
10+
frameborder="0"
11+
allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
12+
allowfullscreen
13+
></iframe>
14+
515
After partitioning, _chunking_ rearranges the resulting document elements into manageable "chunks" to stay within
616
the limits of an embedding model and to improve retrieval precision. The goal is to retrieve only parts of documents
717
that contain only the information that is relevant to a user's query. You can specify if and how Unstructured chunks

ui/document-elements.mdx

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,16 @@
22
title: Document elements and metadata
33
---
44

5+
<iframe
6+
width="560"
7+
height="315"
8+
src="https://www.youtube.com/embed/qWu9HPRkkPg"
9+
title="YouTube video player"
10+
frameborder="0"
11+
allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
12+
allowfullscreen
13+
></iframe>
14+
515
When Unstructured [partitions](/ui/partitioning) a file, the result is a list of _document elements_, sometimes referred to simply as _elements_. These elements represent different components of the source file.
616

717
## Element example

ui/partitioning.mdx

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -2,6 +2,16 @@
22
title: Partitioning
33
---
44

5+
<iframe
6+
width="560"
7+
height="315"
8+
src="https://www.youtube.com/embed/0HAWt9Xog-Y"
9+
title="YouTube video player"
10+
frameborder="0"
11+
allow="accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture"
12+
allowfullscreen
13+
></iframe>
14+
515
_Partitioning_ extracts content from raw unstructured files and outputs that content as structured [document elements](/ui/document-elements).
616

717
For specific file types, such as image files and PDF files, Unstructured offers special strategies to partition them. Each of these

0 commit comments

Comments
 (0)