Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
72 changes: 6 additions & 66 deletions notebooks/Dropbox_To_Pinecone_Connector_Quickstart.ipynb
Original file line number Diff line number Diff line change
Expand Up @@ -19,7 +19,7 @@
"\n",
"Unstructured can connect to several types of [sources](https://docs.unstructured.io/api-reference/workflow/sources/overview) that contain your organization's documents and data—and [destinations](https://docs.unstructured.io/api-reference/workflow/destinations/overview) that Unstructured can send its structured representations of these documents and data to.\n",
"\n",
"In this quickstart, you use the [Unstructured Workflow Endpoint](https://docs.unstructured.io/api-reference/workflow/overview) to programmatically create a [Dropbox source connector](https://docs.unstructured.io/api-reference/workflow/sources/dropbox) and a [Pinecone destination connector](https://docs.unstructured.io/api-reference/workflow/destinations/pinecone) in your Unstructured account. You then programmatically add these connectors to a [workflow](https://docs.unstructured.io/api-reference/workflow/workflows) in your Unstructured account, run that workflow as a [job](https://docs.unstructured.io/api-reference/workflow/jobs), and then explore the job's results.\n",
"In this quickstart, you use the workflow operations in the [Unstructured API](https://docs.unstructured.io/api-reference/workflow/overview) to programmatically create a [Dropbox source connector](https://docs.unstructured.io/api-reference/workflow/sources/dropbox) and a [Pinecone destination connector](https://docs.unstructured.io/api-reference/workflow/destinations/pinecone) in your Unstructured account. You then programmatically add these connectors to a [workflow](https://docs.unstructured.io/api-reference/workflow/workflows) in your Unstructured account, run that workflow as a [job](https://docs.unstructured.io/api-reference/workflow/jobs), and then explore the job's results.\n",
"\n",
"This source connector enables your workflows to process your files that you store in a Dropbox account. This destination connector enables your workflows to send processed data about your documents into a [Pinecone dense serverless index](https://docs.pinecone.io/guides/index-data/indexing-overview#dense-indexes) in your Pinecone account.\n",
"\n",
Expand Down Expand Up @@ -235,27 +235,15 @@
},
{
"cell_type": "code",
"execution_count": 3,
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "CaMNwwTy9HQs",
"outputId": "8ad5f668-b182-4ebe-8bef-21bd1843aa48"
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Requirement already satisfied: requests in /usr/local/lib/python3.12/dist-packages (2.32.4)\n",
"Requirement already satisfied: charset_normalizer<4,>=2 in /usr/local/lib/python3.12/dist-packages (from requests) (3.4.3)\n",
"Requirement already satisfied: idna<4,>=2.5 in /usr/local/lib/python3.12/dist-packages (from requests) (3.10)\n",
"Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.12/dist-packages (from requests) (2.5.0)\n",
"Requirement already satisfied: certifi>=2017.4.17 in /usr/local/lib/python3.12/dist-packages (from requests) (2025.8.3)\n"
]
}
],
"outputs": [],
"source": [
"!pip install requests"
]
Expand Down Expand Up @@ -413,68 +401,20 @@
"id": "Cr0sgBj4Gej1"
},
"source": [
"👉 8. Run the following cell, which installs the `unstructured-client` and `datetime` libraries. These libraries are required for making HTTP requests to the Unstructured Workflow Endpoint and for assigning unique asset names in this example, respectively."
"👉 8. Run the following cell to install the `unstructured-client` and `datetime` libraries. The former is used to call the Unstructured API and the latter for assigning unique asset names in this example."
]
},
{
"cell_type": "code",
"execution_count": 6,
"execution_count": null,
"metadata": {
"colab": {
"base_uri": "https://localhost:8080/"
},
"id": "D3aA8dvuGpiK",
"outputId": "5def117e-c0a2-4388-c295-bdc88ac82e4f"
},
"outputs": [
{
"name": "stdout",
"output_type": "stream",
"text": [
"Collecting unstructured-client\n",
" Downloading unstructured_client-0.42.3-py3-none-any.whl.metadata (23 kB)\n",
"Collecting datetime\n",
" Downloading DateTime-5.5-py3-none-any.whl.metadata (33 kB)\n",
"Requirement already satisfied: aiofiles>=24.1.0 in /usr/local/lib/python3.12/dist-packages (from unstructured-client) (24.1.0)\n",
"Requirement already satisfied: cryptography>=3.1 in /usr/local/lib/python3.12/dist-packages (from unstructured-client) (43.0.3)\n",
"Requirement already satisfied: httpcore>=1.0.9 in /usr/local/lib/python3.12/dist-packages (from unstructured-client) (1.0.9)\n",
"Requirement already satisfied: httpx>=0.27.0 in /usr/local/lib/python3.12/dist-packages (from unstructured-client) (0.28.1)\n",
"Requirement already satisfied: pydantic>=2.11.2 in /usr/local/lib/python3.12/dist-packages (from unstructured-client) (2.11.7)\n",
"Collecting pypdf>=4.0 (from unstructured-client)\n",
" Downloading pypdf-6.0.0-py3-none-any.whl.metadata (7.1 kB)\n",
"Requirement already satisfied: requests-toolbelt>=1.0.0 in /usr/local/lib/python3.12/dist-packages (from unstructured-client) (1.0.0)\n",
"Collecting zope.interface (from datetime)\n",
" Downloading zope.interface-7.2-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl.metadata (44 kB)\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m44.4/44.4 kB\u001b[0m \u001b[31m3.1 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25hRequirement already satisfied: pytz in /usr/local/lib/python3.12/dist-packages (from datetime) (2025.2)\n",
"Requirement already satisfied: cffi>=1.12 in /usr/local/lib/python3.12/dist-packages (from cryptography>=3.1->unstructured-client) (1.17.1)\n",
"Requirement already satisfied: certifi in /usr/local/lib/python3.12/dist-packages (from httpcore>=1.0.9->unstructured-client) (2025.8.3)\n",
"Requirement already satisfied: h11>=0.16 in /usr/local/lib/python3.12/dist-packages (from httpcore>=1.0.9->unstructured-client) (0.16.0)\n",
"Requirement already satisfied: anyio in /usr/local/lib/python3.12/dist-packages (from httpx>=0.27.0->unstructured-client) (4.10.0)\n",
"Requirement already satisfied: idna in /usr/local/lib/python3.12/dist-packages (from httpx>=0.27.0->unstructured-client) (3.10)\n",
"Requirement already satisfied: annotated-types>=0.6.0 in /usr/local/lib/python3.12/dist-packages (from pydantic>=2.11.2->unstructured-client) (0.7.0)\n",
"Requirement already satisfied: pydantic-core==2.33.2 in /usr/local/lib/python3.12/dist-packages (from pydantic>=2.11.2->unstructured-client) (2.33.2)\n",
"Requirement already satisfied: typing-extensions>=4.12.2 in /usr/local/lib/python3.12/dist-packages (from pydantic>=2.11.2->unstructured-client) (4.15.0)\n",
"Requirement already satisfied: typing-inspection>=0.4.0 in /usr/local/lib/python3.12/dist-packages (from pydantic>=2.11.2->unstructured-client) (0.4.1)\n",
"Requirement already satisfied: requests<3.0.0,>=2.0.1 in /usr/local/lib/python3.12/dist-packages (from requests-toolbelt>=1.0.0->unstructured-client) (2.32.4)\n",
"Requirement already satisfied: setuptools in /usr/local/lib/python3.12/dist-packages (from zope.interface->datetime) (75.2.0)\n",
"Requirement already satisfied: pycparser in /usr/local/lib/python3.12/dist-packages (from cffi>=1.12->cryptography>=3.1->unstructured-client) (2.22)\n",
"Requirement already satisfied: charset_normalizer<4,>=2 in /usr/local/lib/python3.12/dist-packages (from requests<3.0.0,>=2.0.1->requests-toolbelt>=1.0.0->unstructured-client) (3.4.3)\n",
"Requirement already satisfied: urllib3<3,>=1.21.1 in /usr/local/lib/python3.12/dist-packages (from requests<3.0.0,>=2.0.1->requests-toolbelt>=1.0.0->unstructured-client) (2.5.0)\n",
"Requirement already satisfied: sniffio>=1.1 in /usr/local/lib/python3.12/dist-packages (from anyio->httpx>=0.27.0->unstructured-client) (1.3.1)\n",
"Downloading unstructured_client-0.42.3-py3-none-any.whl (207 kB)\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m207.8/207.8 kB\u001b[0m \u001b[31m8.3 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25hDownloading DateTime-5.5-py3-none-any.whl (52 kB)\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m52.6/52.6 kB\u001b[0m \u001b[31m4.0 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25hDownloading pypdf-6.0.0-py3-none-any.whl (310 kB)\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m310.5/310.5 kB\u001b[0m \u001b[31m14.8 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25hDownloading zope.interface-7.2-cp312-cp312-manylinux_2_5_x86_64.manylinux1_x86_64.manylinux_2_17_x86_64.manylinux2014_x86_64.whl (264 kB)\n",
"\u001b[2K \u001b[90m━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━━\u001b[0m \u001b[32m264.7/264.7 kB\u001b[0m \u001b[31m20.4 MB/s\u001b[0m eta \u001b[36m0:00:00\u001b[0m\n",
"\u001b[?25hInstalling collected packages: zope.interface, pypdf, datetime, unstructured-client\n",
"Successfully installed datetime-5.5 pypdf-6.0.0 unstructured-client-0.42.3 zope.interface-7.2\n"
]
}
],
"outputs": [],
"source": [
"!pip install -U unstructured-client datetime"
]
Expand Down