JuliaAI
diff --git a/‎Project.toml‎
Lines changed: 1 addition & 0 deletions b/‎Project.toml‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎README.md‎
Lines changed: 1 addition & 1 deletion b/‎README.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/make.jl‎
Lines changed: 1 addition & 1 deletion b/‎docs/make.jl‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/src/accessor_functions.md‎
Lines changed: 2 additions & 2 deletions b/‎docs/src/accessor_functions.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎docs/src/algorithm_traits.md‎
Lines changed: 144 additions & 0 deletions b/‎docs/src/algorithm_traits.md‎
Lines changed: 144 additions & 0 deletions
@@ -4,6 +4,7 @@ authors = ["Anthony D. Blaom <[email protected]>"]
 version = "0.1.0"
 
 [deps]
+InteractiveUtils = "b77e0a4c-d291-57a0-90e8-8db25a27a240"
 Statistics = "10745b16-79ce-11e8-11f9-7d13ad32a3b2"
 
 [extras]
 
@@ -1,6 +1,6 @@
 # LearnAPI.jl
 
-A Julia interface for training and applying machine learning models. 
+A base Julia interface for machine learning and statistics
 
 
 **Devlopement Status:**
 
@@ -16,7 +16,7 @@ makedocs(;
         "Predict and other operations" => "operations.md",
         "Accessor Functions" => "accessor_functions.md",
         "Optional Data Interface" => "optional_data_interface.md",
-        "Model Traits" => "model_traits.md",
+        "Algorithm Traits" => "algorithm_traits.md",
         "Common Implementation Patterns" => "common_implementation_patterns.md",
         "Testing an Implementation" => "testing_an_implementation.md",
     ],
 
@@ -1,8 +1,8 @@
 # Accessor Functions 
 
 > **Summary.** While byproducts of training are ordinarily recorded in the `report`
-> component of the output of `fit`/`update!`/`ingest!`, some families of models report an
-> item that is likely shared by multiple model types, and it is useful to have common
+> component of the output of `fit`/`update!`/`ingest!`, some families of algorithms report an
+> item that is likely shared by multiple algorithm types, and it is useful to have common
 > interface for accessing these directly. Training losses and feature importances are two
 > examples.
 
 
@@ -0,0 +1,144 @@
+# Algorithm Traits
+
+> **Summary.** Traits allow one to promise particular behaviour for an algorithm, such as:
+> *This algorithm supports per-observation weights, which must appear as the third
+> argument of `fit`*, or *This algorithm's `transform` method predicts `Real` vectors*.
+
+For any (non-trivial) algorithm, [`LearnAPI.functions`](@ref)`(algorithm)` must be
+overloaded to list the LearnAPI methods that have been explicitly implemented/overloaded
+(algorithm traits excluded). Overloading other traits is optional, except where required
+by the implementation of some LearnAPI method and explicitly documented in that method's
+docstring.
+
+Traits are often called on instances but are usually *defined* on algorithm *types*, as in
+
+```julia
+LearnAPI.is_pure_julia(::Type{<:MyAlgorithmType}) = true
+```
+
+which has the shorthand
+
+```julia
+@trait MyAlgorithmType is_pure_julia=true
+```
+
+So, for convenience, every trait `t` is provided the fallback implementation
+
+```julia
+t(algorithm) = t(typeof(algorithm))
+```
+
+This means `LearnAPI.is_pure_julia(algorithm) = true` whenever `algorithm isa MyAlgorithmType` in the
+above example.
+
+Every trait has a global fallback implementation for `::Type`. See the table below.
+
+## When traits depdend on more than algorithm type
+
+Traits that vary from instance to instance of the same type are disallowed, except in the
+case of composite algorithms (`is_wrapper(algorithm) = true`) where this is typically
+unavoidable. The reason for this is so one can associate, with each non-composite
+algorithm type, unique trait-based "algorithm metadata", for inclusion in searchable
+algorithm databases. This requirement occasionally requires that an existing algorithm
+implementation be split into separate LearnAPI implementations (e.g., one for regression
+and another for classification).
+
+## Special two-argument traits
+
+The two-argument version of [`LearnAPI.predict_output_scitype`](@ref) and
+[`LearnAPI.predict_output_scitype`](@ref) are the only overloadable traits with more than
+one argument. They cannot be declared using the `@trait` macro.
+
+## Trait summary
+
+**Overloadable traits** are available for overloading by any new LearnAPI
+implementation. **Derived traits** are not, and should not be called by performance
+critical code
+
+## Overloadable traits
+
+In the examples column of the table below, `Table`, `Continuous`, `Sampleable` are names owned by the
+package [ScientificTypesBase.jl](https://github.com/JuliaAI/ScientificTypesBase.jl/).
+
+| trait                                            | fallback value        | return value  | example |
+|:-------------------------------------------------|:----------------------|:--------------|:--------|
+| [`LearnAPI.functions`](@ref)`(algorithm)`            | `()`                  | implemented LearnAPI functions (traits excluded) | `(:fit, :predict)` |
+| [`LearnAPI.preferred_kind_of_proxy`](@ref)`(algorithm)` | `LearnAPI.None()`   | an instance `tp` of `KindOfProxy` for which an implementation of `LearnAPI.predict(algorithm, tp, ...)` is guaranteed. | `LearnAPI.Distribution()` |
+| [`LearnAPI.position_of_target`](@ref)`(algorithm)`   | `0`                   | ¹ the positional index of the **target** in `data` in `fit(..., data...; metadata)` calls | 2 |
+| [`LearnAPI.position_of_weights`](@ref)`(algorithm)`  | `0`                   | ¹ the positional index of **per-observation weights** in `data` in `fit(..., data...; metadata)` | 3 |
+| [`LearnAPI.descriptors`](@ref)`(algorithm)`          | `()`                  | lists one or more suggestive algorithm descriptors from `LearnAPI.descriptors()` | (:classifier, :probabilistic) |
+| [`LearnAPI.is_pure_julia`](@ref)`(algorithm)`        | `false`               | is `true` if implementation is 100% Julia code | `true` |
+| [`LearnAPI.pkg_name`](@ref)`(algorithm)`             | `"unknown"`           | name of package providing core code (may be different from package providing LearnAPI.jl implementation) | `"DecisionTree"` |
+| [`LearnAPI.pkg_license`](@ref)`(algorithm)`          | `"unknown"`             | name of license of package providing core code | `"MIT"` |
+| [`LearnAPI.doc_url`](@ref)`(algorithm)`               | `"unknown"`             | url providing documentation of the core code  | `"https://en.wikipedia.org/wiki/Decision_tree_learning"` |
+| [`LearnAPI.load_path`](@ref)`(algorithm)`            | `"unknown"`             | a string indicating where the struct for `typeof(algorithm)` is defined, beginning with name of package providing implementation | `FastTrees.LearnAPI.DecisionTreeClassifier` |
+| [`LearnAPI.is_wrapper`](@ref)`(algorithm)`          | `false`                | is `true` if one or more properties (fields) of `algorithm` may be an algorithm | `true` |
+| [`LearnAPI.human_name`](@ref)`(algorithm)`          | type name with spaces  | human name for the algorithm; should be a noun | "elastic net regressor" |
+| [`LearnAPI.iteration_parameter`](@ref)`(algorithm)` | `nothing`                | symbolic name of an iteration parameter | :epochs |
+| [`LearnAPI.fit_keywords`](@ref)`(algorithm)`        |  `()`                  | tuple of symbols for keyword arguments accepted by `fit` (corresponding  to metadata) | `(:class_weights,)` |
+| [`LearnAPI.fit_scitype`](@ref)`(algorithm)`      | `Union{}` | upper bound on `scitype(data)` in `fit(algorithm, verbosity, data...)`² | `Tuple{Table(Continuous), AbstractVector{Continuous}}` |
+| [`LearnAPI.fit_observation_scitype`](@ref)`(algorithm)` | `Union{}`| upper bound on `scitype(observation)` for `observation` in `data` and `data` in `fit(algorithm, verbosity, data...)`² | `Tuple{AbstractVector{Continuous}, Continuous}` |
+| [`LearnAPI.fit_type`](@ref)`(algorithm)`            | `Union{}` | upper bound on `type(data)` in `fit(algorithm, verbosity, data...)`² | `Tuple{AbstractMatrix{<:Real}, AbstractVector{<:Real}}` |
+| [`LearnAPI.fit_observation_type`](@ref)`(algorithm)`    | `Union{}`| upper bound on `type(observation)` for `observation` in `data` and `data` in `fit(algorithm, verbosity, data...)`*    | `Tuple{AbstractVector{<:Real}, Real}` |
+| [`LearnAPI.predict_input_scitype`](@ref)`(algorithm)`  | `Union{}` | upper bound on `scitype(data)` in `predict(algorithm, fitted_params, data...)`²   | `Table(Continuous)` |
+| [`LearnAPI.predict_output_scitype`](@ref)`(algorithm, kind_of_proxy)` | `Any`     | upper bound on `scitype(first(predict(algorithm, kind_of_proxy, ...)))` | `AbstractVector{Continuous}` |
+| [`LearnAPI.predict_input_type`](@ref)`(algorithm)`     | `Union{}` | upper bound on `typeof(data)` in `predict(algorithm, fitted_params, data...)`²    | `AbstractMatrix{<:Real}` |
+| [`LearnAPI.predict_output_type`](@ref)`(algorithm, kind_of_proxy)`    | `Any`     | upper bound on `typeof(first(predict(algorithm, kind_of_proxy, ...)))`                           | `AbstractVector{<:Real}` |
+| [`LearnAPI.transform_input_scitype`](@ref)`(algorithm)`  | `Union{}` | upper bound on `scitype(data)` in `transform(algorithm, fitted_params, data...)`²   | `Table(Continuous)` |
+| [`LearnAPI.transform_output_scitype`](@ref)`(algorithm)` | `Any`     | upper bound on `scitype(first(transform(algorithm, ...)))`                          |  `Table(Continuous)` |
+| [`LearnAPI.transform_input_type`](@ref)`(algorithm)`     | `Union{}` | upper bound on `typeof(data)` in `transform(algorithm, fitted_params, data...)`²    | `AbstractMatrix{<:Real}}` |
+| [`LearnAPI.transform_output_type`](@ref)`(algorithm)`    | `Any`     | upper bound on `typeof(first(transform(algorithm, ...)))`                           | `AbstractMatrix{<:Real}` |
+
+¹ If the value is `0`, then the variable in boldface type is not supported and not
+expected to appear in `data`. If `length(data)` is less than the trait value, then `data`
+is understood to exclude the variable, but note that `fit` can have multiple signatures of
+varying lengths, as in `fit(algorithm, verbosity, X, y)` and `fit(algorithm, verbosity, X, y,
+w)`. A non-zero value is a promise that `fit` includes a signature of sufficient length to
+include the variable.
+
+² Assuming no [optional data interface](@ref data_interface) is implemented. See docstring
+for the general case.
+
+
+## Derived Traits
+
+The following convenience methods are provided but intended for overloading:
+
+| trait                                | return value                              | example    |
+|:-------------------------------------|:------------------------------------------|:-----------|
+| `LearnAPI.name(algorithm)`           | algorithm type name as string                 | "PCA"  |
+| `LearnAPI.is_algorithm(algorithm)`   | `true` if `functions(algorithm)` is not empty | `true` |
+| [`LearnAPI.predict_output_scitype`](@ref)(algorithm) | dictionary of upper bounds on the scitype of predictions, keyed on subtypes of [`LearnAPI.KindOfProxy`](@ref) |
+| [`LearnAPI.predict_output_type`](@ref)(algorithm)    | dictionary of upper bounds on the type of predictions, keyed on subtypes of [`LearnAPI.KindOfProxy`](@ref)    |
+
+
+## Reference
+
+```@docs
+LearnAPI.functions
+LearnAPI.preferred_kind_of_proxy
+LearnAPI.position_of_target
+LearnAPI.position_of_weights
+LearnAPI.descriptors
+LearnAPI.is_pure_julia
+LearnAPI.pkg_name
+LearnAPI.pkg_license
+LearnAPI.doc_url
+LearnAPI.load_path
+LearnAPI.is_wrapper
+LearnAPI.fit_keywords
+LearnAPI.human_name
+LearnAPI.iteration_parameter
+LearnAPI.fit_scitype
+LearnAPI.fit_type
+LearnAPI.fit_observation_scitype
+LearnAPI.fit_observation_type
+LearnAPI.predict_input_scitype
+LearnAPI.predict_output_scitype
+LearnAPI.predict_input_type
+LearnAPI.predict_output_type
+LearnAPI.transform_input_scitype
+LearnAPI.transform_output_scitype
+LearnAPI.transform_input_type
+LearnAPI.transform_output_type
+```