You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: web/pandas/community/ecosystem.md
+6-157Lines changed: 6 additions & 157 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -149,20 +149,6 @@ or MATLAB, modified in a GUI, or embedded in apps and dashboards. Plotly
149
149
is free for unlimited sharing, and has cloud, offline, or on-premise
150
150
accounts for private use.
151
151
152
-
### [Lux](https://github.com/lux-org/lux)
153
-
154
-
Lux is a Python library that facilitates fast and easy experimentation with data by automating the visual data exploration process. To use Lux, simply add an extra import alongside pandas:
155
-
156
-
```python
157
-
import lux
158
-
import pandas as pd
159
-
160
-
df = pd.read_csv("data.csv")
161
-
df # discover interesting insights!
162
-
```
163
-
164
-
By printing out a dataframe, Lux automatically [recommends a set of visualizations](https://raw.githubusercontent.com/lux-org/lux-resources/master/readme_img/demohighlight.gif) that highlights interesting trends and patterns in the dataframe. Users can leverage any existing pandas commands without modifying their code, while being able to visualize their pandas data structures (e.g., DataFrame, Series, Index) at the same time. Lux also offers a [powerful, intuitive language](https://lux-api.readthedocs.io/en/latest/source/guide/vis.html) that allow users to create Altair, matplotlib, or Vega-Lite visualizations without having to think at the level of code.
165
-
166
152
### [D-Tale](https://github.com/man-group/dtale)
167
153
168
154
D-Tale is a lightweight web client for visualizing pandas data structures. It
@@ -384,92 +370,14 @@ Use `pandas_gbq.read_gbq` and `pandas_gbq.to_gbq`, instead.
ArcticDB is a serverless DataFrame database engine designed for the Python Data Science ecosystem. ArcticDB enables you to store, retrieve, and process pandas DataFrames at scale. It is a storage engine designed for object storage and also supports local-disk storage using LMDB. ArcticDB requires zero additional infrastructure beyond a running Python environment and access to object storage and can be installed in seconds. Please find full documentation [here](https://docs.arcticdb.io/latest/).
388
-
389
-
#### ArcticDB Terminology
390
-
391
-
ArcticDB is structured to provide a scalable and efficient way to manage and retrieve DataFrames, organized into several key components:
392
-
393
-
-`Object Store` Collections of libraries. Used to separate logical environments from each other. Analogous to a database server.
394
-
-`Library` Contains multiple symbols which are grouped in a certain way (different users, markets, etc). Analogous to a database.
395
-
-`Symbol` Atomic unit of data storage. Identified by a string name. Data stored under a symbol strongly resembles a pandas DataFrame. Analogous to tables.
396
-
-`Version` Every modifying action (write, append, update) performed on a symbol creates a new version of that object.
397
-
398
-
#### Installation
399
-
400
-
To install, simply run:
401
-
402
-
```console
403
-
pip install arcticdb
404
-
```
405
-
406
-
To get started, we can import ArcticDB and instantiate it:
407
-
408
-
```python
409
-
import arcticdb as adb
410
-
import numpy as np
411
-
import pandas as pd
412
-
# this will set up the storage using the local file system
413
-
arctic = adb.Arctic("lmdb://arcticdb_test")
414
-
```
415
-
416
-
> **Note:** ArcticDB supports any S3 API compatible storage, including AWS. ArcticDB also supports Azure Blob storage.
417
-
> ArcticDB also supports LMDB for local/file based storage - to use LMDB, pass an LMDB path as the URI: `adb.Arctic('lmdb://path/to/desired/database')`.
418
-
419
-
#### Library Setup
420
-
421
-
ArcticDB is geared towards storing many (potentially millions) of tables. Individual tables (DataFrames) are called symbols and are stored in collections called libraries. A single library can store many symbols. Libraries must first be initialized prior to use:
Now we have a library set up, we can get to reading and writing data. ArcticDB has a set of simple functions for DataFrame storage. Let's write a DataFrame to storage.
430
-
431
-
```python
432
-
df = pd.DataFrame(
433
-
{
434
-
"a": list("abc"),
435
-
"b": list(range(1, 4)),
436
-
"c": np.arange(3, 6).astype("u1"),
437
-
"d": np.arange(4.0, 7.0, dtype="float64"),
438
-
"e": [True, False, True],
439
-
"f": pd.date_range("20130101", periods=3)
440
-
}
441
-
)
442
-
443
-
df
444
-
df.dtypes
445
-
```
446
-
447
-
Write to ArcticDB.
448
-
449
-
```python
450
-
write_record = lib.write("test", df)
451
-
```
452
-
453
-
> **Note:** When writing pandas DataFrames, ArcticDB supports the following index types:
> The "row" concept in `head`/`tail` refers to the row number ('iloc'), not the value in the `pandas.Index` ('loc').
373
+
ArcticDB is a serverless DataFrame database engine designed for the Python Data Science ecosystem.
374
+
ArcticDB enables you to store, retrieve, and process pandas DataFrames at scale.
375
+
It is a storage engine designed for object storage and also supports local-disk storage using LMDB.
376
+
ArcticDB requires zero additional infrastructure beyond a running Python environment and access
377
+
to object storage and can be installed in seconds.
461
378
462
-
#### Reading Data from ArcticDB
379
+
Please find full documentation [here](https://docs.arcticdb.io/latest/).
463
380
464
-
Read the data back from storage:
465
-
466
-
```python
467
-
read_record = lib.read("test")
468
-
read_record.data
469
-
df.dtypes
470
-
```
471
-
472
-
ArcticDB also supports appending, updating, and querying data from storage to a pandas DataFrame. Please find more information [here](https://docs.arcticdb.io/latest/api/processing/#arcticdb.QueryBuilder).
0 commit comments