Merge pull request #223019 from ilijazagorac/patch-1

prmerger-automator[bot] · web-flow · commit 91eea8a2d2b6 · 2023-01-09T11:48:15.000Z
Wildcard expansion issue
diff --git a/articles/synapse-analytics/sql/resources-self-help-sql-on-demand.md b/articles/synapse-analytics/sql/resources-self-help-sql-on-demand.md
@@ -707,6 +707,21 @@ This error might indicate that some internal process issue happened in serverles
 
 Describe anything that might be unusual compared to the regular workload. For example, perhaps there was a large number of concurrent requests or a special workload or query started executing before this error happened.
 
+### Wildcard expansion timed out
+
+As described in the [Query folders and multiple files](../sql/query-folders-multiple-csv-files.md) section, Serverless SQL pool supports reading multiple files/folders by using wildcards. There is a maximum limit of 10 wildcards per query. You must be aware that  this functionality comes at a cost. It takes time for the serverless pool to list all the files that can match the wildcard. This introduces latency and this latency can increase if the number of files you are trying to query is high. In this case you can run into the following error:
+
+```
+"Wildcard expansion timed out after X seconds." 
+```
+
+There are several mitigation steps that you can do to avoid this:
+- Apply best practices described in [Best Practices Serverless SQL Pool](../sql/best-practices-serverless-sql-pool.md).
+- Try to reduce the number of files you are trying to query, by compacting files into larger ones.  Try to keep your file sizes above 100MB. 
+- Make sure that filters over partitioning columns are used wherever possible. 
+- If you are using delta file format, use the optimize write feature in Spark.  This can improve the performance of queries by reducing the amount of data that needs to be read and processed. How to use optimize write is described in [Using optimize write on Apache Spark](../spark/optimize-write-for-apache-spark.md). 
+- To avoid some of the top-level wildcards by effectively hardcoding the implicit filters over partitioning columns use [dynamic SQL](../sql/develop-dynamic-sql.md). 
+
 ## Configuration
 
 Serverless SQL pools enable you to use T-SQL to configure database objects. There are some constraints: