Merge pull request #205511 from MariDjo/master

PRMerger19 · web-flow · commit 96f96114661d · 2022-07-21T08:46:33.000-07:00
Added: Resolving Delta logs failed
diff --git a/articles/synapse-analytics/sql/resources-self-help-sql-on-demand.md b/articles/synapse-analytics/sql/resources-self-help-sql-on-demand.md
@@ -6,7 +6,7 @@ ms.service: synapse-analytics
 ms.topic: overview
 ms.subservice: sql
 ms.custom: event-tier1-build-2022
-ms.date: 05/16/2022
+ms.date: 07/21/2022
 ms.author: stefanazaric
 ms.reviewer: sngun, wiassaf
 ---
@@ -883,6 +883,20 @@ If the dataset is valid, [create a support ticket](../../azure-portal/supportabi
 
 Now you can continue using the Delta Lake folder with Spark pool. You'll provide copied data to Microsoft support if you're allowed to share this information. The Azure team will investigate the content of the `delta_log` file and provide more information about possible errors and workarounds.
 
+### Resolving Delta logs failed
+
+The following error indicates that serverless SQL pool cannot resolve Delta logs:
+```
+Resolving Delta logs on path '%ls' failed with error: Cannot parse json object from log folder.
+```   
+The most common cause is that `last_checkpoint_file` in `_delta_log` folder is larger than 200 bytes due to the `checkpointSchema` field added in Spark 3.3. 
+	  
+There are two options available to circumvent this error:
+* Modify appropriate config in Spark notebook and generate a new checkpoint, so that `last_checkpoint_file` gets re-created. In case you are using Azure Databricks, the config modification is the following: `spark.conf.set("spark.databricks.delta.checkpointSchema.writeThresholdLength", 0);`
+* Downgrade to Spark 3.2.1.
+
+Our engineering team is currently working on a full support for Spark 3.3.
+
 ## Performance
 
 Serverless SQL pool assigns the resources to the queries based on the size of the dataset and query complexity. You can't change or limit the resources that are provided to the queries. There are some cases where you might experience unexpected query performance degradations and you might have to identify the root causes.