Merge branch 'patch-1' of https://github.com/tempacct791/azure-docs into public-prs-feb-2025-2

whhender · whhender · commit 565970665e79 · 2025-02-28T14:29:21.000-05:00
diff --git a/articles/synapse-analytics/spark/apache-spark-data-visualization-tutorial.md b/articles/synapse-analytics/spark/apache-spark-data-visualization-tutorial.md
@@ -35,16 +35,16 @@ Create an Apache Spark Pool by following the [Create an Apache Spark pool tutori
 3. Because the raw data is in a Parquet format, you can use the Spark context to pull the file into memory as a DataFrame directly. Create a Spark DataFrame by retrieving the data via the Open Datasets API. Here, we use the Spark DataFrame *schema on read* properties to infer the datatypes and schema.
 
    ```python
-    from azureml.opendatasets import NycTlcYellow
-    
-    from datetime import datetime
-    from dateutil import parser
-    
-    end_date = parser.parse('2018-05-08 00:00:00')
-    start_date = parser.parse('2018-05-01 00:00:00')
-    
-    nyc_tlc = NycTlcYellow(start_date=start_date, end_date=end_date)
-    filtered_df = spark.createDataFrame(nyc_tlc.to_pandas_dataframe())
+   from azureml.opendatasets import NycTlcYellow
+   
+   from datetime import datetime
+   from dateutil import parser
+   
+   end_date = parser.parse('2018-05-08 00:00:00')
+   start_date = parser.parse('2018-05-01 00:00:00')
+   
+   nyc_tlc = NycTlcYellow(start_date=start_date, end_date=end_date)
+   df = spark.createDataFrame(nyc_tlc.to_pandas_dataframe())
 
    ```
 
@@ -174,4 +174,4 @@ After you finish running the application, shut down the notebook to release the
 ## Next steps
 
 - [Azure Synapse Analytics](../index.yml)
-- [Apache Spark official documentation](https://spark.apache.org/docs/latest/)
+- [Apache Spark official documentation](https://spark.apache.org/docs/latest/)