You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* datalakes_featuregate
Updating set up instructions to replace the need to contact Support to get the in-app set up link
* Update src/connections/storage/catalog/data-lakes/index.md
Co-authored-by: LRubin <[email protected]>
* Update src/connections/storage/catalog/data-lakes/index.md
Co-authored-by: LRubin <[email protected]>
Co-authored-by: LRubin <[email protected]>
Copy file name to clipboardExpand all lines: src/connections/storage/catalog/data-lakes/index.md
+6-9Lines changed: 6 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -27,24 +27,21 @@ The Terraform module and manual set up instructions both provide a base level of
27
27
28
28
## Step 2 - Enable Data Lakes Destination
29
29
30
-
After you set up the necessary AWS resources:
30
+
After you set up the necessary AWS resources, the next step is to set up the Data Lakes destination within Segment:
31
31
32
-
1.[Contact the Support team](https://segment.com/help/contact/) to receive a link to the Data Lakes landing page in your workspace.
32
+
1.In the [Segment App](https://app.segment.com/goto-my-workspace/overview), click **Add Destination**, then search for and select **Data Lakes**.
33
33
34
-
2. Click the link provided, and from the Data Lakes landing page, click **Configure Data Lakes**.
35
-
36
-
3. Select the source to connect to the Data Lakes destination.
37
-
38
-
Each source must be individually connected to the Data Lakes destination. However, you can copy the settings from another source by clicking the “…” button (next to the button for “Setup Guide”).
39
-
40
-
> **Note**: You must include all source ids in the external ID list in the IAM policy, or else the source data cannot be synced to S3.
34
+
2. Click **Configure Data Lakes** and select the source to connect to the Data Lakes destination.
35
+
> **Warning**: You must include all source ids in the external ID list in the IAM policy, or else the source data cannot be synced to S3.
41
36
42
37
4. In the Settings tab, enter and save the following connection settings:
43
38
-**AWS Region**: The AWS Region where your EMR cluster, S3 Bucket and Glue DB reside.
44
39
-**EMR Cluster ID**: The EMR Cluster ID where the Data Lakes jobs will be run.
45
40
-**Glue Catalog ID**: The Glue Catalog ID (this must be the same as your AWS account ID).
46
41
-**IAM Role ARN**: The ARN of the IAM role that Segment will use to connect to Data Lakes.
47
42
-**S3 Bucket**: Name of the S3 bucket used by Data Lakes. The EMR cluster will store logs in this bucket.
43
+
44
+
You must individually connect each source to the Data Lakes destination. However, you can copy the settings from another source by clicking **…** ("more") (next to the button for “Set up Guide”).
48
45
49
46
5._(Optional)_**Date Partition**: Optional advanced setting to change the date partition structure, with a default structure `day=<YYYY-MM-DD>/hr=<HH>`. To use the default, leave this setting unchanged. To partition the data by a different date structure, choose one of the following options:
0 commit comments