Skip to content

Commit 309733c

Browse files
authored
Update how-to-mltable.md
Recorded mltable's limitation when extracting partition keys for data from Delta Lake > [!IMPORTANT] > **Limitation**: `mltable` doesn't support extracting partition keys when reading data from Delta Lake. > The `mltable` API `extract_columns_from_partition_format() ` won't work when you are reading Delta Lake data via `mltable`.
1 parent b1973bf commit 309733c

File tree

1 file changed

+8
-5
lines changed

1 file changed

+8
-5
lines changed

articles/machine-learning/how-to-mltable.md

Lines changed: 8 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -486,7 +486,7 @@ MLTable supports the following path types:
486486
> `mltable` handles user credential passthrough for paths on Azure Storage and Azure Machine Learning datastores. If you don't have permission to the data on the underlying storage, you can't access the data.
487487
488488
#### A note on defining paths for Delta Lake Tables
489-
Defining paths to read Delta Lake tables is different compared to the other file types. For Delta Lake tables, the path points to a *single* folder (typically on ADLS gen2) that contains the Delta table. *time travel* is supported. The following code shows how to define a path for a Delta Lake table:
489+
Defining paths to read Delta Lake tables is different compared to the other file types. For Delta Lake tables, the path points to a *single* folder (typically on ADLS gen2) that contains the "_delta_log" folder and data files. *time travel* is supported. The following code shows how to define a path for a Delta Lake table:
490490

491491
```python
492492
import mltable
@@ -516,6 +516,13 @@ tbl = mltable.from_delta_lake(delta_table_path, timestamp_as_of=current_timestam
516516
df = tbl.to_pandas_dataframe()
517517
```
518518

519+
> [!IMPORTANT]
520+
> **Limitation**: `mltable` doesn't support extracting partition keys when reading data from Delta Lake.
521+
> The `mltable` API `extract_columns_from_partition_format() ` won't work when you are reading Delta Lake data via `mltable`.
522+
523+
> [!IMPORTANT]
524+
> `mltable` handles user credential passthrough for paths on Azure Storage and Azure Machine Learning datastores. If you don't have permission to the data on the underlying storage, you can't access the data.
525+
519526
### Files, folders and globs
520527

521528
Azure Machine Learning Tables support reading from:
@@ -525,10 +532,6 @@ Azure Machine Learning Tables support reading from:
525532
- [glob](https://wikipedia.org/wiki/Glob_(programming)) pattern(s), for example `abfss://<file_system>@<account_name>.dfs.core.windows.net/my-folder/*.csv`
526533
- Or, a combination of files, folders and globbing patterns
527534

528-
> [!IMPORTANT]
529-
> In your list of paths you **must**:
530-
> - Use the **same** schemed URI paths. For example, they must all be `abfss://` **or** `wasbs://` **or** `https://` **or** `./local_path`.
531-
> - Use Azure Machine Learning Datastores URI paths **or** Storage URI paths. For example, you cannot mix `azureml://` with `abfss://` URI paths in the list of paths.
532535

533536
### Supported data loading transformations
534537

0 commit comments

Comments
 (0)