You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+31-31Lines changed: 31 additions & 31 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -138,7 +138,13 @@ See [examples](examples/README.md) for more information.
138
138
139
139
-[Serialize query plans using Substrait](https://github.com/apache/datafusion-python/blob/main/examples/substrait.py)
140
140
141
-
## How to install (from pip)
141
+
## How to install
142
+
143
+
### uv
144
+
145
+
```bash
146
+
uv add datafusion
147
+
```
142
148
143
149
### Pip
144
150
@@ -164,24 +170,21 @@ You can verify the installation by running:
164
170
165
171
## How to develop
166
172
167
-
This assumes that you have rust and cargo installed. We use the workflow recommended by [pyo3](https://github.com/PyO3/pyo3) and [maturin](https://github.com/PyO3/maturin).
173
+
This assumes that you have rust and cargo installed. We use the workflow recommended by [pyo3](https://github.com/PyO3/pyo3) and [maturin](https://github.com/PyO3/maturin). The Maturin tools used in this workflow can be installed either via `uv` or `pip`. Both approaches should offer the same experience. It is recommended to use `uv` since it has significant performance improvements
174
+
over `pip`.
168
175
169
-
The Maturin tools used in this workflow can be installed either via Conda or Pip. Both approaches should offer the same experience. Multiple approaches are only offered to appease developer preference. Bootstrapping for both Conda and Pip are as follows.
Copy file name to clipboardExpand all lines: dev/release/README.md
+2-21Lines changed: 2 additions & 21 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -218,28 +218,9 @@ uploading them using `twine`:
218
218
twine upload --repository pypi dist-release/*
219
219
```
220
220
221
-
### Publish Python Artifacts to Anaconda
221
+
### Publish Python Artifacts to conda-forge
222
222
223
-
Publishing artifacts to Anaconda is similar to PyPi. First, Download the source tarball created in the previous step and untar it.
224
-
225
-
```bash
226
-
# Assuming you have an existing conda environment named `datafusion-dev` if not see root README for instructions
227
-
conda activate datafusion-dev
228
-
conda build .
229
-
```
230
-
231
-
This will setup a virtual conda environment and build the artifacts inside of that virtual env. This step can take a few minutes as the entire build, host, and runtime environments are setup. Once complete a local filesystem path will be emitted for the location of the resulting package. Observe that path and copy to your clipboard.
Copy file name to clipboardExpand all lines: docs/mdbook/src/installation.md
+27-26Lines changed: 27 additions & 26 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -18,44 +18,45 @@
18
18
19
19
DataFusion is easy to install, just like any other Python library.
20
20
21
-
## Using pip
21
+
## Using uv
22
22
23
-
```bash
24
-
pip install datafusion
25
-
```
23
+
If you do not yet have a virtual environment, create one:
26
24
27
-
## Conda & JupyterLab setup
25
+
```bash
26
+
uv venv
27
+
```
28
28
29
-
This section explains how to install DataFusion in a conda environment with other libraries that allow for a nice Jupyter workflow. This setup is completely optional. These steps are only needed if you'd like to run DataFusion in a Jupyter notebook and have an interface like this:
29
+
You can add datafusion to your virtual environment with the usual:
30
30
31
-

31
+
```bash
32
+
uv pip install datafusion
33
+
```
32
34
33
-
Create a conda environment with DataFusion, Jupyter, and other useful dependencies in the `datafusion-env.yml` file:
35
+
Or, to add to a project:
34
36
37
+
```bash
38
+
uv add datafusion
35
39
```
36
-
name: datafusion-env
37
-
channels:
38
-
- conda-forge
39
-
- defaults
40
-
dependencies:
41
-
- python=3.9
42
-
- ipykernel
43
-
- nb_conda
44
-
- jupyterlab
45
-
- jupyterlab_code_formatter
46
-
- isort
47
-
- black
48
-
- pip
49
-
- pip:
50
-
- datafusion
51
40
41
+
## Using pip
42
+
43
+
```bash
44
+
pip install datafusion
52
45
```
53
46
54
-
Create the environment with `conda env create -f datafusion-env.yml`.
47
+
## uv & JupyterLab setup
55
48
56
-
Activate the environment with `conda activate datafusion-env`.
49
+
This section explains how to install DataFusion in a uv environment with other libraries that allow for a nice Jupyter workflow. This setup is completely optional. These steps are only needed if you'd like to run DataFusion in a Jupyter notebook and have an interface like this:
57
50
58
-
Run `jupyter lab` or open the [JupyterLab Desktop application](https://github.com/jupyterlab/jupyterlab-desktop) to start running DataFusion in a Jupyter notebook.
51
+

52
+
53
+
Create a virtual environment with DataFusion, Jupyter, and other useful dependencies and start the desktop application.
0 commit comments