Skip to content

Commit f949d33

Browse files
committed
fixed the block
1 parent 634a60d commit f949d33

File tree

1 file changed

+7
-7
lines changed

1 file changed

+7
-7
lines changed

articles/synapse-analytics/monitoring/apache-spark-advisor.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
---
22
title: Spark Advisor
3-
description: Spark Advisor would be a system to automatically analyze commands/queries, and show the appropriate advise when customer execute code or query.
3+
description: Spark Advisor is a system to automatically analyze commands/queries, and show the appropriate advice when a customer executes code or query.
44
services: synapse-analytics
55
author: jejiang
66
ms.author: jejiang
@@ -13,19 +13,19 @@ ms.date: 06/23/2022
1313

1414
# Spark Advisor
1515

16-
Spark Advisor would be a system to automatically analyze commands/queries, and show the appropriate advise when customer executes code or query. After applying the advise, you would have chance to improve your execution performance, decrease cost and fix the execution failures.
16+
Spark Advisor is a system to automatically analyze commands/queries, and show the appropriate advise when customer executes code or query. After applying the advice, you would have chance to improve your execution performance, decrease cost and fix the execution failures.
1717

1818

19-
## Advises provided
19+
## Advice provided
2020

21-
## May return inconsistent results when using 'randomSplit'
21+
### May return inconsistent results when using 'randomSplit'
2222
Inconsistent or inaccurate results may be returned when working with the results of the 'randomSplit' method. Use Apache Spark (RDD) caching before using the 'randomSplit' method.
2323

24-
Method randomSplit() is equivalent to performing sample() on your data frame multiple times, with each sample refetching, partitioning, and sorting your data frame within partitions. The data distribution across partitions and sorting order is important for both randomSplit() and sample(). If either change upon data refetch, there may be duplicates, or missing values across splits and the same sample using the same seed may produce different results.
24+
Method randomSplit() is equivalent to performing sample() on your data frame multiple times, with each sample refetching, partitioning, and sorting your data frame within partitions. The data distribution across partitions and sorting order is important for both randomSplit() and sample(). If either changes upon data refetch, there may be duplicates, or missing values across splits and the same sample using the same seed may produce different results.
2525

2626
These inconsistencies may not happen on every run, but to eliminate them completely, cache your data frame, repartition on a column(s), or apply aggregate functions such as groupBy.
2727

28-
## Table/View Name is already in use
28+
### Table/view name is already in use
2929
A view already exists with the same name as the created table, or a table already exists with the same name as the created view.
3030
When this name is used in queries or applications, only the view will be returned no matter, which one created first. To avoid conflicts, rename either the table or the view.
3131

@@ -37,7 +37,7 @@ The selected query contains a hint that isn't recognized. Verify that the hint i
3737
spark.sql("SELECT /*+ unknownHint */ * FROM t1")
3838
```
3939

40-
### Unable to find a specified Relation name(s)
40+
### Unable to find a specified relation name(s)
4141
Unable to find the relation(s) specified in the hint. Verify that the relation(s) are spelled correctly and accessible within the scope of the hint.
4242

4343
```scala

0 commit comments

Comments
 (0)