You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
* Import druiddb
* Update requirements and entry points
* Remove fstrings
* Fix unit test
* Move unit test to sub directory
* Update docs
* Change history filename
* Fix name
Copy file name to clipboardExpand all lines: README.md
+59-9Lines changed: 59 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,6 +1,8 @@
1
-
#pydruid
2
-
pydruid exposes a simple API to create, execute, and analyze [Druid](http://druid.io/) queries. pydruid can parse query results into [Pandas](http://pandas.pydata.org/) DataFrame objects for subsequent data analysis -- this offers a tight integration between [Druid](http://druid.io/), the [SciPy](http://www.scipy.org/stackspec.html) stack (for scientific computing) and [scikit-learn](http://scikit-learn.org/stable/) (for machine learning). Additionally, pydruid can export query results into TSV or JSON for further processing with your favorite tool, e.g., R, Julia, Matlab, Excel.
3
-
It provides both synchronous and asynchronous clients.
1
+
# pydruid
2
+
3
+
pydruid exposes a simple API to create, execute, and analyze [Druid](http://druid.io/) queries. pydruid can parse query results into [Pandas](http://pandas.pydata.org/) DataFrame objects for subsequent data analysis -- this offers a tight integration between [Druid](http://druid.io/), the [SciPy](http://www.scipy.org/stackspec.html) stack (for scientific computing) and [scikit-learn](http://scikit-learn.org/stable/) (for machine learning). pydruid can export query results into TSV or JSON for further processing with your favorite tool, e.g., R, Julia, Matlab, Excel. It provides both synchronous and asynchronous clients.
4
+
5
+
Additionally, pydruid implements the [Python DB API 2.0](https://www.python.org/dev/peps/pep-0249/), a [SQLAlchemy dialect](http://docs.sqlalchemy.org/en/latest/dialects/), and a provides a command line interface to interact with Druid.
4
6
5
7
To install:
6
8
```python
@@ -11,10 +13,15 @@ pip install pydruid[async]
11
13
pip install pydruid[pandas]
12
14
# or, if you intend to do both
13
15
pip install pydruid[async, pandas]
16
+
# or, if you want to use the SQLAlchemy engine
17
+
pip install pydruid[sqlalchemy]
18
+
# or, if you want to use the CLI
19
+
pip install pydruid[cli]
14
20
```
15
21
Documentation: https://pythonhosted.org/pydruid/.
16
22
17
-
#examples
23
+
# examples
24
+
18
25
The following exampes show how to execute and analyze the results of three types of queries: timeseries, topN, and groupby. We will use these queries to ask simple questions about twitter's public data set.
0 commit comments