Skip to content

Commit 0d7c306

Browse files
jsjmrjsj
authored andcommitted
Update README and documentation structure
1 parent d5bc86e commit 0d7c306

File tree

5 files changed

+29
-41
lines changed

5 files changed

+29
-41
lines changed

.github/workflows/docs.yml

Lines changed: 5 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -14,10 +14,15 @@ jobs:
1414
runs-on: ubuntu-latest
1515
steps:
1616
- uses: actions/checkout@v4
17+
1718
- uses: actions/setup-python@v4
1819
with:
1920
python-version: 3.x
21+
2022
- run: |
2123
python -m pip install --upgrade pip
2224
pip install -e .[docs]
25+
mv README.md docs/index.md
26+
mv assets/images docs/assets/images
27+
2328
- run: mkdocs gh-deploy --force

README.md

Lines changed: 22 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -2,7 +2,12 @@
22
A collection of Spark-free Python utilities for working with Microsoft Fabric in Python Notebooks.
33
Mainly focused on Data Engineering tasks utilising [Polars](https://github.com/pola-rs/polars) and [delta-rs](https://github.com/delta-io/delta-rs).
44

5-
![How to select Python Notebook](docs/images/select-python-notebooks.png)
5+
![How to select Python Notebook](assets/images/select-python-notebooks.png)
6+
7+
## Installation
8+
```bash
9+
pip install msfabricutils
10+
```
611

712
## Features
813

@@ -14,6 +19,19 @@ Mainly focused on Data Engineering tasks utilising [Polars](https://github.com/p
1419
- Common transformations (add audit columns, reorder columns, deduplicate etc.)
1520
- Load delta tables with one of the provided load methods (upsert, overwrite, append etc.)
1621

22+
23+
24+
### Fabric API
25+
- Create, update, delete workspaces, lakehouses, notebooks using the Fabric REST API.
26+
27+
### Command Line Interface
28+
- Create, update, delete workspaces, lakehouses, notebooks using the built-in CLI.
29+
30+
To get started, run:
31+
```bash
32+
msfu --help
33+
```
34+
1735
### Fabric DuckDB Connection
1836
Seamless integration between DuckDB and Microsoft Fabric Lakehouses for data exploration and analysis.
1937

@@ -23,20 +41,17 @@ Seamless integration between DuckDB and Microsoft Fabric Lakehouses for data exp
2341
- Support for Delta Lake tables
2442
- Flexible table name referencing (1-part to 4-part names)
2543

26-
## Installation
27-
```bash
28-
pip install msfabricutils
29-
```
3044

3145
## Documentation
3246
For quick start and detailed documentation, examples, and API reference, visit our [GitHub Pages documentation](https://mrjsj.github.io/msfabricutils/).
3347

34-
# Contributing
48+
## Contributing
3549
Contributions are welcome! Here are some ways you can contribute:
50+
3651
- Report bugs and feature requests through GitHub issues
3752
- Submit pull requests for bug fixes or new features
3853
- Improve documentation
3954
- Share ideas for new utilities
4055

41-
# License
56+
## License
4257
This project is licensed under the MIT License - see the LICENSE file for details.

docs/index.md

Lines changed: 1 addition & 33 deletions
Original file line numberDiff line numberDiff line change
@@ -1,33 +1 @@
1-
# MSFabricUtils
2-
3-
A collection of **Spark-free** and **local-first** Python utilities for working with Microsoft Fabric Lakehouses locally or in the Fabric Python Notebook experience.
4-
5-
## Features
6-
7-
- **ETL Utilities** - Extract, Transform, Load data from and to Microsoft Fabric Lakehouses. While the utilities can be configured to fit different needs, its defaults are highly opinionated for what we believe are sensible defaults for many use cases
8-
- **Fabric API** - Access Fabric APIs from Python, such as workspaces and lakehouses
9-
- **Local development first** - Aim to provide a local development for Microsoft Fabric solutions
10-
- **DuckDB Connection**
11-
- Seamless integration between DuckDB and Microsoft Fabric Lakehouses
12-
- Cross-lakehouse and cross-workspace querying
13-
- Delta Lake writing features
14-
15-
## Core dependencies
16-
17-
MSFabricUtils is built on top of modern, high-performance Python libraries:
18-
19-
- **[delta-rs](https://delta-io.github.io/delta-rs)** - A native Rust implementation of Delta Lake, providing fast and reliable Delta Lake operations without the need for a Spark cluster
20-
- **[Polars](https://pola.rs)** - A lightning-fast DataFrame library written in Rust, offering superior performance for data manipulation tasks
21-
- **[DuckDB](https://duckdb.org)** - An embedded analytical database engine, enabling SQL queries with at blazing speed
22-
23-
These dependencies were chosen specifically to:
24-
25-
- Provide Spark-like functionality without the overhead of a Spark cluster
26-
- Enable high-performance data processing on a single machine
27-
- Support both local development and cloud deployment scenarios
28-
- Maintain compatibility with Delta Lake format used in Microsoft Fabric
29-
30-
31-
## Ideas for improvements
32-
Got an idea? Add an issue on [github](https://www.github.com/mrjsj/msfabricutils/issues)!
33-
1+
# Placeholder

docs/usage/fabric-api.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,6 +1,6 @@
11
# Fabric API
22

3-
A collection of helper functions for working with the Fabric API. See the [API Reference](../core/fabric-api.md) for more details.
3+
A collection of helper functions for working with the Fabric API. See the [API Reference](../core/fabric-api/index.md) for more details.
44

55
## List workspaces
66

0 commit comments

Comments
 (0)