Skip to content

Commit f35a05a

Browse files
authored
Merge pull request #673 from segmentio/repo-sync
repo sync
2 parents ef4ea3c + 9ba32ec commit f35a05a

File tree

1 file changed

+1
-1
lines changed
  • src/connections/storage/data-lakes

1 file changed

+1
-1
lines changed

src/connections/storage/data-lakes/index.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -55,7 +55,7 @@ When setting up your data lake using the [Data Lakes catalog page](/docs/connect
5555

5656
#### EMR
5757

58-
Data Lakes uses an EMR cluster to run jobs that load events from all sources into Data Lakes. The [AWS resources portion of the set up instructions](/docs/connections/storage/catalog/data-lakes/#step-1---set-up-aws-resources) sets up an EMR cluster using the `m5.xlarge` node type. Data Lakes keeps the cluster always running, however the cluster auto-scales to ensure it's not always running at full capacity. Check the Terraform module documentation for the [EMR specifications](https://github.com/segmentio/terraform-aws-data-lake/tree/master/modules/emr){:target="_blank"}.
58+
Data Lakes uses an EMR cluster to run jobs that load events from all sources into Data Lakes. The [AWS resources portion of the set up instructions](/docs/connections/storage/catalog/data-lakes/#step-1---set-up-aws-resources) sets up an EMR cluster using the `m5.xlarge` node type. Data Lakes keeps the cluster always running, however the cluster auto-scales to ensure it's not always running at full capacity. Check the Terraform module documentation for the [EMR specifications](https://github.com/segmentio/terraform-segment-data-lakes/tree/master/aws_datalake/modules/emr){:target="_blank"}.
5959

6060
#### AWS IAM role
6161

0 commit comments

Comments
 (0)