Merge pull request #265264 from AjayBathini-MSFT/patch-142

prmerger-automator[bot] · web-flow · commit 2de497d21906 · 2024-02-20T05:03:35.000Z
(AzureCXP) fixes MicrosoftDocs/azure-docs#119494
diff --git a/articles/data-factory/concepts-integration-runtime-performance.md b/articles/data-factory/concepts-integration-runtime-performance.md
@@ -29,11 +29,11 @@ If your data flow has many joins and lookups, you may want to use a **memory opt
 
 ## Cluster size
 
-Data flows distribute the data processing over different nodes in a Spark cluster to perform operations in parallel. A Spark cluster with more cores increases the number of nodes in the compute environment. More nodes increase the processing power of the data flow. Increasing the size of the cluster is often an easy way to reduce the processing time.
+Data flows distribute the data processing over different cores in a Spark cluster to perform operations in parallel. A Spark cluster with more cores increases the number of cores in the compute environment. More cores increase the processing power of the data flow. Increasing the size of the cluster is often an easy way to reduce the processing time.
 
-The default cluster size is four driver nodes and four worker nodes (small). As you process more data, larger clusters are recommended. Below are the possible sizing options:
+The default cluster size is four driver cores and four worker cores (small). As you process more data, larger clusters are recommended. Below are the possible sizing options:
 
-| Worker Nodes | Driver Nodes | Total Nodes | Notes |
+| Worker Cores | Driver Cores | Total Cores | Notes |
 | ------------ | ------------ | ----------- | ----- |
 | 4 | 4 | 8 | Small |
 | 8 | 8 | 16 | Medium |
@@ -46,7 +46,7 @@ The default cluster size is four driver nodes and four worker nodes (small). As
 Data flows are priced at vcore-hrs meaning that both cluster size and execution-time factor into this. As you scale up, your cluster cost per minute will increase, but your overall time will decrease.
 
 > [!TIP]
-> There is a ceiling on how much the size of a cluster affects the performance of a data flow. Depending on the size of your data, there is a point where increasing the size of a cluster will stop improving performance. For example, If you have more nodes than partitions of data, adding additional nodes won't help. 
+> There is a ceiling on how much the size of a cluster affects the performance of a data flow. Depending on the size of your data, there is a point where increasing the size of a cluster will stop improving performance. For example, If you have more cores than partitions of data, adding additional cores won't help. 
 A best practice is to start small and scale up to meet your performance needs. 
 
 ## Custom shuffle partition