Skip to content

Commit d060865

Browse files
Update apache-spark-performance.md
1 parent bc14353 commit d060865

File tree

1 file changed

+5
-4
lines changed

1 file changed

+5
-4
lines changed

articles/synapse-analytics/spark/apache-spark-performance.md

Lines changed: 5 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -57,11 +57,12 @@ Spark provides its own native caching mechanisms, which can be used through diff
5757
Spark operates by placing data in memory, so managing memory resources is a key aspect of optimizing the execution of Spark jobs. There are several techniques you can apply to use your cluster's memory efficiently.
5858

5959
* Prefer smaller data partitions and account for data size, types, and distribution in your partitioning strategy.
60-
* In Synapse Spark (Runtime 3.1 or higher) you get Kryo data serialization enabled by default [Kryo data serialization](https://github.com/EsotericSoftware/kryo).
61-
* You can customize the kryoserializer buffer size using Spark configuration based on your workload requirements
60+
* In Synapse Spark (Runtime 3.1 or higher), Kryo data serialization is enabled by default Kryo data serialization.
61+
* You can customize the kryoserializer buffer size using Spark configuration based on your workload requirements:
6262

63-
`// Set the desired property`
64-
`spark.conf.set("spark.kryoserializer.buffer.max", "256m") `
63+
```scala
64+
// Set the desired property
65+
spark.conf.set("spark.kryoserializer.buffer.max", "256m")
6566

6667
* Monitor and tune Spark configuration settings.
6768

0 commit comments

Comments
 (0)