Skip to content

Commit 59cc893

Browse files
authored
Merge pull request #4941 from morsapaes/docs-s3-pipe
ClickPipes: restructure documentation for Object Storage pipes
2 parents 406e933 + 72ec0c0 commit 59cc893

File tree

34 files changed

+718
-249
lines changed

34 files changed

+718
-249
lines changed

docs/integrations/data-ingestion/clickpipes/object-storage/01_create_clickpipe_for_object_storage.md renamed to docs/_snippets/clickpipes/object-storage/_create_clickpipe.md

Lines changed: 24 additions & 36 deletions
Original file line numberDiff line numberDiff line change
@@ -1,17 +1,6 @@
1-
---
2-
sidebar_label: 'Create your first object storage ClickPipe'
3-
description: 'Seamlessly connect your object storage to ClickHouse Cloud.'
4-
slug: /integrations/clickpipes/object-storage
5-
title: 'Creating your first object-storage ClickPipe'
6-
doc_type: 'guide'
7-
integration:
8-
- support_level: 'core'
9-
- category: 'clickpipes'
10-
---
11-
1+
import Image from '@theme/IdealImage';
122
import cp_step0 from '@site/static/images/integrations/data-ingestion/clickpipes/cp_step0.png';
13-
import cp_step1 from '@site/static/images/integrations/data-ingestion/clickpipes/cp_step1.png';
14-
import cp_step2_object_storage from '@site/static/images/integrations/data-ingestion/clickpipes/cp_step2_object_storage.png';
3+
import cp_step2 from '@site/static/images/integrations/data-ingestion/clickpipes/object-storage/amazon-s3/cp_step2.png';
154
import cp_step3_object_storage from '@site/static/images/integrations/data-ingestion/clickpipes/cp_step3_object_storage.png';
165
import cp_step4a from '@site/static/images/integrations/data-ingestion/clickpipes/cp_step4a.png';
176
import cp_step4a3 from '@site/static/images/integrations/data-ingestion/clickpipes/cp_step4a3.png';
@@ -21,41 +10,38 @@ import cp_success from '@site/static/images/integrations/data-ingestion/clickpip
2110
import cp_remove from '@site/static/images/integrations/data-ingestion/clickpipes/cp_remove.png';
2211
import cp_destination from '@site/static/images/integrations/data-ingestion/clickpipes/cp_destination.png';
2312
import cp_overview from '@site/static/images/integrations/data-ingestion/clickpipes/cp_overview.png';
24-
import Image from '@theme/IdealImage';
25-
26-
Object Storage ClickPipes provide a simple and resilient way to ingest data from Amazon S3, Google Cloud Storage, Azure Blob Storage, and DigitalOcean Spaces into ClickHouse Cloud. Both one-time and continuous ingestion are supported with exactly-once semantics.
2713

28-
# Creating your first object storage ClickPipe {#creating-your-first-clickpipe}
14+
import S3DataSource from '@site/docs/_snippets/clickpipes/object-storage/amazon-s3/_1-data-source.md';
15+
import GCSSDataSource from '@site/docs/_snippets/clickpipes/object-storage/google-cloud-storage/_1-data-source.md';
16+
import ABSDataSource from '@site/docs/_snippets/clickpipes/object-storage/azure-blob-storage/_1-data-source.md';
2917

30-
## Prerequisite {#prerequisite}
18+
<VerticalStepper type="numbered" headerLevel="h2">
3119

32-
- You have familiarized yourself with the [ClickPipes intro](../index.md).
20+
## Select the data source {#1-select-the-data-source}
3321

34-
## Navigate to data sources {#1-load-sql-console}
22+
**1.** In ClickHouse Cloud, select **Data sources** in the main navigation menu and click **Create ClickPipe**.
3523

36-
In the cloud console, select the `Data Sources` button on the left-side menu and click on "Set up a ClickPipe"
24+
<Image img={cp_step0} alt="Select imports" size="lg" border/>
3725

38-
<Image img={cp_step0} alt="Select imports" size="lg" border/>
26+
{props.provider === 's3' && <S3DataSource />}
27+
{props.provider === 'gcs' && <GCSSDataSource />}
28+
{props.provider === 'abs' && <ABSDataSource />}
3929

40-
## Select a data source {#2-select-data-source}
30+
## Setup your ClickPipe connection {#2-setup-your-clickpipe-connection}
4131

42-
Select your data source.
32+
**1.** To setup a new ClickPipe, you must provide details on how to connect to and authenticate with your object storage service.
4333

44-
<Image img={cp_step1} alt="Select data source type" size="lg" border/>
34+
{props.provider === 's3' && <S3DataSource />}
35+
{props.provider === 'gcs' && <GCSSDataSource />}
36+
{props.provider === 'abs' && <ABSDataSource />}
4537

46-
## Configure the ClickPipe {#3-configure-clickpipe}
38+
**2.** Click **Incoming data**. ClickPipes will fetch metadata from your bucket for the next step.
4739

48-
Fill out the form by providing your ClickPipe with a name, a description (optional), your IAM role or credentials, and bucket URL.
49-
You can specify multiple files using bash-like wildcards.
50-
For more information, [see the documentation on using wildcards in path](/integrations/clickpipes/object-storage/reference/#limitations).
51-
52-
<Image img={cp_step2_object_storage} alt="Fill out connection details" size="lg" border/>
53-
54-
## Select data format {#4-select-format}
40+
## Select data format {#3-select-data-format}
5541

5642
The UI will display a list of files in the specified bucket.
5743
Select your data format (we currently support a subset of ClickHouse formats) and if you want to enable continuous ingestion.
58-
([More details below](/integrations/clickpipes/object-storage/reference/#continuous-ingest)).
44+
See the "continuous ingest" section in the overview page for more details.
5945

6046
<Image img={cp_step3_object_storage} alt="Set data format and topic" size="lg" border/>
6147

@@ -77,7 +63,7 @@ In that case, the UI will allow you to map fields from the source to the ClickHo
7763
<Image img={cp_step4b} alt="Use an existing table" size="lg" border/>
7864

7965
:::info
80-
You can also map [virtual columns](../../sql-reference/table-functions/s3#virtual-columns), like `_path` or `_size`, to fields.
66+
You can also map [virtual columns](/sql-reference/table-functions/s3#virtual-columns), like `_path` or `_size`, to fields.
8167
:::
8268

8369
## Configure permissions {#6-configure-permissions}
@@ -107,5 +93,7 @@ As well as controls to remove the ClickPipe and display a summary of the ingest
10793
<Image img={cp_overview} alt="View overview" size="lg" border/>
10894

10995
**Congratulations!** you have successfully set up your first ClickPipe.
110-
If this is a streaming ClickPipe, it will be continuously running, ingesting data in real-time from your remote data source.
96+
If this is a ClickPipe configure for continuous ingestion, it will be continuously running, ingesting data in real-time from your remote data source.
11197
Otherwise, it will ingest the batch and complete.
98+
99+
</VerticalStepper>
Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
import cp_step1 from '@site/static/images/integrations/data-ingestion/clickpipes/object-storage/amazon-s3/cp_step1.png';
2+
import Image from '@theme/IdealImage';
3+
4+
**2.** Click the **Amazon S3** tile. You can also use this tile to connect to other S3-compatible services not listed in the ClickPipes UI.
5+
6+
<Image img={cp_step1} alt="Select imports" size="lg" border/>
7+
8+
:::tip
9+
Due to differences in URL formats and API implementations across object storage service providers, not all S3-compatible services are supported out-of-the-box. If you're running into issues with a service that is not listed under [supported data sources](/integrations/clickpipes/object-storage/s3/overview#supported-data-sources), please [reach out to our team](https://clickhouse.com/company/contact?loc=clickpipes).
10+
:::
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
import cp_step2 from '@site/static/images/integrations/data-ingestion/clickpipes/object-storage/amazon-s3/cp_step2.png';
2+
import Image from '@theme/IdealImage';
3+
4+
<Image img={cp_step2} alt="Fill out connection details" size="lg" border/>
5+
6+
* **Authentication method**: the S3 ClickPipe supports [IAM credentials](/integrations/clickpipes/object-storage/amazon-s3/overview/#iam-credentials) (`Credentials`) and [IAM role-based authentication](/integrations/clickpipes/object-storage/amazon-s3/overview/#iam-role) (`IAM role`). See the [reference documentation](/integrations/clickpipes/object-storage/overview/#access-control) for guidance on authentication and permissions.
7+
8+
* **S3 file path**: the S3 ClickPipe expects a [virtual-hosted-style URI](https://docs.aws.amazon.com/AmazonS3/latest/userguide/VirtualHosting.html#virtual-hosted-style-access).
9+
10+
```bash
11+
https://bucket-name.s3.region-code.amazonaws.com/key-name
12+
```
13+
14+
You can use POSIX wildcards to match multiple files or prefixes. See the [reference documentation](/integrations/clickpipes/object-storage/overview/#file-pattern-matching) for guidance on supported patterns.
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
import cp_step1 from '@site/static/images/integrations/data-ingestion/clickpipes/object-storage/azure-blob-storage/cp_step1.png';
2+
import Image from '@theme/IdealImage';
3+
4+
**2.** Click the **Azure Blob Storage** tile.
5+
6+
<Image img={cp_step1} alt="Select imports" size="lg" border/>
Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,6 @@
1+
import cp_step1 from '@site/static/images/integrations/data-ingestion/clickpipes/object-storage/google-cloud-storage/cp_step1.png';
2+
import Image from '@theme/IdealImage';
3+
4+
**2.** Click the **Google Cloud Storage** tile.
5+
6+
<Image img={cp_step1} alt="Select imports" size="lg" border/>
Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,14 @@
1+
import cp_step2 from '@site/static/images/integrations/data-ingestion/clickpipes/object-storage/azure-blob-storage/cp_step2.png';
2+
import Image from '@theme/IdealImage';
3+
4+
<Image img={cp_step2} alt="Fill out connection details" size="lg" border/>
5+
6+
* **Authentication method**: the ABS ClickPipe supports [HMAC credentials](/integrations/clickpipes/object-storage/azure-blob-storage/overview/#) (`Credentials`). See the [reference documentation](/integrations/clickpipes/object-storage/azure-blob-storage/overview/#access-control) for guidance on authentication and permissions.
7+
8+
* **GCS file path**: The GCS ClickPipe uses the Cloud Storage [XML API](https://docs.cloud.google.com/storage/docs/interoperability) for interoperability, which requires the `storage.googleapis.com` endpoint:
9+
10+
```bash
11+
https://storage.googleapis.com/bucket-name/key-name
12+
```
13+
14+
You can use POSIX wildcards to match multiple files or prefixes. See the [reference documentation](/integrations/clickpipes/object-storage/overview/#file-pattern-matching) for guidance on supported patterns.

docs/integrations/data-ingestion/clickpipes/kafka/index.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -15,9 +15,9 @@ keywords: ['Kafka ClickPipes', 'Apache Kafka', 'streaming ingestion', 'real-time
1515
<!--AUTOGENERATED_START-->
1616
| Page | Description |
1717
|-----|-----|
18-
| [Creating your first Kafka ClickPipe](/integrations/clickpipes/kafka/create-your-first-kafka-clickpipe) | Step-by-step guide to creating your first Kafka ClickPipe. |
19-
| [Schema registries for Kafka ClickPipe](/integrations/clickpipes/kafka/schema-registries) | How to integrate for ClickPipes with a schema registry for schema management |
2018
| [Reference](/integrations/clickpipes/kafka/reference) | Details supported formats, sources, delivery semantics, authentication and experimental features supported by Kafka ClickPipes |
21-
| [Best practices](/integrations/clickpipes/kafka/best-practices) | Details best practices to follow when working with Kafka ClickPipes |
19+
| [Schema registries for Kafka ClickPipe](/integrations/clickpipes/kafka/schema-registries) | How to integrate for ClickPipes with a schema registry for schema management |
20+
| [Creating your first Kafka ClickPipe](/integrations/clickpipes/kafka/create-your-first-kafka-clickpipe) | Step-by-step guide to creating your first Kafka ClickPipe. |
2221
| [Kafka ClickPipes FAQ](/integrations/clickpipes/kafka/faq) | Frequently asked questions about ClickPipes for Kafka |
22+
| [Best practices](/integrations/clickpipes/kafka/best-practices) | Details best practices to follow when working with Kafka ClickPipes |
2323
<!--AUTOGENERATED_END-->

docs/integrations/data-ingestion/clickpipes/object-storage/02_reference.md

Lines changed: 0 additions & 102 deletions
This file was deleted.

0 commit comments

Comments
 (0)