Skip to content

Commit ce7ea68

Browse files
Merge pull request #268974 from SturgeonMi/patch-28
Update how-to-mltable.md
2 parents a57712b + ec4994d commit ce7ea68

File tree

2 files changed

+12
-6
lines changed

2 files changed

+12
-6
lines changed

articles/machine-learning/how-to-mltable.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -486,7 +486,7 @@ MLTable supports the following path types:
486486
> `mltable` handles user credential passthrough for paths on Azure Storage and Azure Machine Learning datastores. If you don't have permission to the data on the underlying storage, you can't access the data.
487487
488488
#### A note on defining paths for Delta Lake Tables
489-
Defining paths to read Delta Lake tables is different compared to the other file types. For Delta Lake tables, the path points to a *single* folder (typically on ADLS gen2) that contains the Delta table. *time travel* is supported. The following code shows how to define a path for a Delta Lake table:
489+
Defining paths to read Delta Lake tables is different compared to the other file types. For Delta Lake tables, the path points to a *single* folder (typically on ADLS gen2) that contains the "_delta_log" folder and data files. *time travel* is supported. The following code shows how to define a path for a Delta Lake table:
490490

491491
```python
492492
import mltable
@@ -516,6 +516,13 @@ tbl = mltable.from_delta_lake(delta_table_path, timestamp_as_of=current_timestam
516516
df = tbl.to_pandas_dataframe()
517517
```
518518

519+
> [!IMPORTANT]
520+
> **Limitation**: `mltable` doesn't support extracting partition keys when reading data from Delta Lake.
521+
> The `mltable` transformation `extract_columns_from_partition_format` won't work when you are reading Delta Lake data via `mltable`.
522+
523+
> [!IMPORTANT]
524+
> `mltable` handles user credential passthrough for paths on Azure Storage and Azure Machine Learning datastores. If you don't have permission to the data on the underlying storage, you can't access the data.
525+
519526
### Files, folders and globs
520527

521528
Azure Machine Learning Tables support reading from:
@@ -525,10 +532,6 @@ Azure Machine Learning Tables support reading from:
525532
- [glob](https://wikipedia.org/wiki/Glob_(programming)) pattern(s), for example `abfss://<file_system>@<account_name>.dfs.core.windows.net/my-folder/*.csv`
526533
- Or, a combination of files, folders and globbing patterns
527534

528-
> [!IMPORTANT]
529-
> In your list of paths you **must**:
530-
> - Use the **same** schemed URI paths. For example, they must all be `abfss://` **or** `wasbs://` **or** `https://` **or** `./local_path`.
531-
> - Use Azure Machine Learning Datastores URI paths **or** Storage URI paths. For example, you cannot mix `azureml://` with `abfss://` URI paths in the list of paths.
532535

533536
### Supported data loading transformations
534537

articles/machine-learning/reference-yaml-mltable.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -232,6 +232,9 @@ transformations:
232232
# alternative:
233233
# version_as_of: 1
234234
```
235+
> [!IMPORTANT]
236+
> **Limitation**: `mltable` doesn't support extracting partition keys when reading data from Delta Lake.
237+
> The `mltable` transformation `extract_columns_from_partition_format` won't work when you are reading Delta Lake data via `mltable`.
235238

236239
### JSON
237240
```yaml
@@ -248,4 +251,4 @@ transformations:
248251
## Next steps
249252

250253
- [Install and use the CLI (v2)](how-to-configure-cli.md)
251-
- [Working with tables in Azure Machine Learning](how-to-mltable.md)
254+
- [Working with tables in Azure Machine Learning](how-to-mltable.md)

0 commit comments

Comments
 (0)