Skip to content

Commit dc69e1e

Browse files
Merge pull request #5896 from Albertyang0/2025_07-Monthly-broken-links-fix-ay-100
2025_07 - Fix monthly broken links
2 parents be2ddfa + ba7d78a commit dc69e1e

File tree

1 file changed

+1
-1
lines changed

1 file changed

+1
-1
lines changed

articles/machine-learning/how-to-access-data-interactive.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -655,7 +655,7 @@ df.head()
655655
> [!TIP]
656656
> Pandas is not designed to handle large datasets. Pandas can only process data that can fit into the memory of the compute instance.
657657
>
658-
> For large datasets, we recommend use of Azure Machine Learning managed Spark. This provides the [PySpark Pandas API](https://spark.apache.org/docs/latest/api/python/user_guide/pandas_on_spark/index.html).
658+
> For large datasets, we recommend use of Azure Machine Learning managed Spark. This provides the [PySpark Pandas API](https://spark.apache.org/docs/3.5.3/api/python/user_guide/pandas_on_spark/index.html).
659659

660660
You might want to iterate quickly on a smaller subset of a large dataset before scaling up to a remote asynchronous job. `mltable` provides in-built functionality to get samples of large data using the [take_random_sample](/python/api/mltable/mltable.mltable.mltable#mltable-mltable-mltable-take-random-sample) method:
661661

0 commit comments

Comments
 (0)