Skip to content

Commit c95dc6d

Browse files
committed
add docs to readme until GH pages successfully deployed
1 parent 0085386 commit c95dc6d

File tree

1 file changed

+64
-0
lines changed

1 file changed

+64
-0
lines changed

README.md

Lines changed: 64 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -19,3 +19,67 @@ to [MLJBase.jl](https://github.com/JuliaAI/MLJBase.jl) which do not
1919
appear in the commit history of this repository.
2020

2121
Package documentation is [here](https://JuliaAI.github.io/OpenML.jl/dev).
22+
23+
## Summary of functionality
24+
25+
- `OpenML.list_tags()`: for listing all dataset tags
26+
27+
- `OpenML.list_datasets(; tag=nothing, filter=nothing, output_format=...)`: for listing available datasets
28+
29+
- `OpenML.describe_dataset(id)`: to describe a particular dataset
30+
31+
- `OpenML.load(id; parser=:arff)`: to download a dataset
32+
33+
34+
## Installation
35+
36+
```julia
37+
using Pkg
38+
Pkg.add("OpenML")
39+
```
40+
41+
If running the demonstration below:
42+
43+
```julia
44+
Pkg.add("DataFrames")
45+
Pkg.add("ScientificTypes")
46+
```
47+
48+
## Sample usage
49+
50+
```julia
51+
using OpenML # or using MLJ
52+
using DataFrames
53+
54+
OpenML.list_tags()
55+
```
56+
57+
Listing all datasets with the "OpenML100" tag which also have `n`
58+
instances and `p` features, where `100 < n < 1000` and `1 < p < 10`:
59+
60+
```julia
61+
ds = OpenML.list_datasets(
62+
tag = "OpenML100",
63+
filter = "number_instances/100..1000/number_features/1..10",
64+
output_format = DataFrame)
65+
```
66+
67+
Describing and loading one of these datasets:
68+
69+
```julia
70+
OpenML.describe_dataset(15)
71+
table = OpenML.load(15)
72+
```
73+
74+
Converting to a data frame:
75+
76+
```julia
77+
df = DataFrame(table)
78+
```
79+
80+
Inspecting it's schema:
81+
82+
```julia
83+
using ScientificTypes
84+
schema(table)
85+
```

0 commit comments

Comments
 (0)