You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/integrations/delta-lake-daft.md
+5-5Lines changed: 5 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,6 @@
1
1
# Using Delta Lake with Daft
2
2
3
-
[Daft](https://www.getdaft.io) is a framework for ETL, analytics, and ML/AI at scale with a familiar Python dataframe API, implemented in Rust.
3
+
[Daft](https://www.daft.ai) is a framework for ETL, analytics, and ML/AI at scale with a familiar Python dataframe API, implemented in Rust.
4
4
5
5
Daft and Delta Lake work really well together. Daft provides unified compute for Delta Lake’s unified storage. Together, Delta Lake and Daft give you high-performance query optimization and distributed compute on massive datasets.
6
6
@@ -85,7 +85,7 @@ You can use `write_deltalake` to write a Daft DataFrame to a Delta table:
Daft supports multiple write modes. See the [Daft documentation](https://www.getdaft.io/projects/docs/en/stable/api/dataframe/?h=write+delta#daft.DataFrame.write_deltalake) for more information.
88
+
Daft supports multiple write modes. See the [Daft documentation](https://docs.getdaft.io/en/stable/io/delta_lake/#write-to-delta-lake) for more information.
89
89
90
90
## What can I do with a Daft DataFrame?
91
91
@@ -97,7 +97,7 @@ On top of this, Daft also gives you:
97
97
-**Expressions API** for easy column transformations
98
98
-**UDFs** for multi-column transformation, incl. ML applications
99
99
100
-
Check out the [Daft User Guide](https://www.getdaft.io/projects/docs/en/stable/index.html) for a complete list of DataFrame operations.
100
+
Check out the [Daft User Guide](https://docs.getdaft.io/en/stable/) for a complete list of DataFrame operations.
101
101
102
102
## Data Skipping Optimizations
103
103
@@ -198,12 +198,12 @@ Read [High-Performance Querying on Massive Delta Lake Tables with Daft](https://
198
198
199
199
Daft has a rich multimodal type-system with support for Python objects, Images, URLs, Tensors and more.
200
200
201
-
The [Expressions API](https://www.getdaft.io/projects/docs/en/stable/api/expressions) provides useful tools to work with these data types. By combining multimodal data support with the [User-Defined Functions API](https://www.getdaft.io/projects/docs/en/stable/api/udf) you can run ML workloads right within your DataFrame.
201
+
The [Expressions API](https://docs.getdaft.io/en/stable/api/expressions/#expressions) provides useful tools to work with these data types. By combining multimodal data support with the [User-Defined Functions API](https://docs.getdaft.io/en/stable/api/udf/#daft.udf.UDF) you can run ML workloads right within your DataFrame.
202
202
203
203
Take a look at the notebook in the [`delta-examples` Github repository](https://github.com/delta-io/delta-examples) for a closer look at how Daft handles URLs, images and ML applications.
204
204
205
205
## Contribute to `daft`
206
206
207
207
Excited about Daft and want to contribute? Join them on [Github](https://github.com/Eventual-Inc/Daft) 🚀
208
208
209
-
Like many technologies, Daft collects some non-identifiable telemetry to improve the product. This is strictly non-identifiable metadata. You can disable telemetry by setting the following environment variable: `DAFT_ANALYTICS_ENABLED=0`. Read more in the [Daft documentation](https://www.getdaft.io/projects/docs/en/stable/resources/telemetry/).
209
+
Like many technologies, Daft collects some non-identifiable telemetry to improve the product. This is strictly non-identifiable metadata. You can disable telemetry by setting the following environment variable: `DAFT_ANALYTICS_ENABLED=0`. Read more in the [Daft documentation](https://docs.getdaft.io/en/stable/resources/telemetry/#telemetry).
0 commit comments