For a 0.2.0 release (#20)

OkonSamuel · ablaom · web-flow · commit 8f548adb0374 · 2024-07-10T03:26:35.000+01:00
* fix docstring and build documentation * fix typos in ci file * add MLJDecisionTreeInterface to docs/Project.toml file * update index.md * Traits (#18) * remove dependence of is_wrapper trait on base model. * bug fixes and doc build * fix typo in docstring Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com> --------- Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com> * Fix doctests (#19) * remove dependence of is_wrapper trait on base model. * bug fixes and doc build * fix typo in docstring Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com> * fix doctests --------- Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com> * update codecov version to avoid github rate limit * update codecov badge * bump 0.2.0 --------- Co-authored-by: Anthony Blaom, PhD <anthony.blaom@gmail.com>
diff --git a/.github/workflows/ci.yml b/.github/workflows/ci.yml
@@ -66,9 +66,10 @@ jobs:
         env:
           JULIA_NUM_THREADS: 2
       - uses: julia-actions/julia-processcoverage@v1
-      - uses: codecov/codecov-action@v1
+      - uses: codecov/codecov-action@v4
         with:
           file: lcov.info
+          token: ${{ secrets.CODECOV_TOKEN }}
   docs:
     name: Documentation
     runs-on: ubuntu-latest
@@ -125,9 +126,9 @@ jobs:
           julia --project=docs -e '
             if ENV["BUILD_DOCS"] == "true"
                 using Documenter: doctest
-                using MLJBase
+                using FeatureSelection
                 @info "attempting to run the doctests"
-                doctest(MLJBase)
+                doctest(FeatureSelection)
             else
                 @info "skipping the doctests"
             end'
@@ -142,3 +143,4 @@ jobs:
         env:
           GITHUB_TOKEN: ${{ secrets.GITHUB_TOKEN }}
           DOCUMENTER_KEY: ${{ secrets.DOCUMENTER_KEY }}
+          
diff --git a/Project.toml b/Project.toml
@@ -1,7 +1,7 @@
 name = "FeatureSelection"
 uuid = "33837fe5-dbff-4c9e-8c2f-c5612fe2b8b6"
 authors = ["Anthony D. Blaom <anthony.blaom@gmail.com>", "Samuel Okon <okonsamuel50@gmail.com"]
-version = "0.1.1"
+version = "0.2.0"
 
 [deps]
 MLJModelInterface = "e80e1ace-859a-464e-9ed9-23947d8ae3ea"
@@ -45,4 +45,4 @@ test = [
     "StableRNGs", 
     "StatisticalMeasures",
     "Test"
-]
+]
diff --git a/README.md b/README.md
@@ -1,7 +1,24 @@
 # FeatureSelection.jl
 
-| Linux | Coverage | Code Style
-| :------------ | :------- | :------------- |
-| [![Build Status](https://github.com/JuliaAI/FeatureSelection.jl/workflows/CI/badge.svg)](https://github.com/JuliaAI/FeatureSelection.jl/actions) | [![Coverage](https://codecov.io/gh/JuliaAI/FeatureSelection.jl/branch/master/graph/badge.svg)](https://codecov.io/github/JuliaAI/FeatureSelection.jl?branch=dev) | [![Code Style: Blue](https://img.shields.io/badge/code%20style-blue-4495d1.svg)](https://github.com/invenia/BlueStyle) |
+| Linux | Coverage | Documentation | Code Style
+| :------------ | :------- | :------------- | :------------- |
+| [![Build Status](https://github.com/JuliaAI/FeatureSelection.jl/workflows/CI/badge.svg)](https://github.com/JuliaAI/FeatureSelection.jl/actions) | [![Coverage](https://codecov.io/gh/JuliaAI/FeatureSelection.jl/branch/dev/graph/badge.svg)](https://codecov.io/github/JuliaAI/FeatureSelection.jl?branch=dev) | [![Stable](https://img.shields.io/badge/docs-stable-blue.svg)](https://juliaai.github.io/FeatureSelection.jl/dev/) | [![Code Style: Blue](https://img.shields.io/badge/code%20style-blue-4495d1.svg)](https://github.com/invenia/BlueStyle) |
 
 Repository housing feature selection algorithms for use with the machine learning toolbox [MLJ](https://juliaai.github.io/MLJ.jl/dev/).
+
+This package provides a collection of feature selection algorithms designed for use with MLJ, a powerful machine learning toolbox in Julia. It aims to facilitate the process of selecting the most relevant features from your datasets, enhancing the performance and interpretability of your machine learning models.
+
+## Key Features
+- Integration with MLJ: Seamlessly integrates with MLJ's extensive suite of tools and models.
+- Variety of Algorithms: Includes multiple feature selection algorithms to suit different types of data and models.
+- User-friendly: Easy to use with clear documentation and examples.
+
+## Getting Started
+To get started with this package, refer to the documentation for installation instructions, usage guides, and API references.
+
+## Contributing
+Contributions are welcome! Please refer to MLJ contributing [guidelines](https://github.com/JuliaAI/MLJ.jl/blob/dev/CONTRIBUTING.md) for more information.
+
+## License
+This project is licensed under the MIT License - see the [LICENSE](LICENSE) file for details.
+
diff --git a/docs/Project.toml b/docs/Project.toml
@@ -1,11 +1,13 @@
 [deps]
 Documenter = "e30172f5-a6a5-5a46-863b-614d45cd2de4"
 MLJ = "add582a8-e3ab-11e8-2d5e-e98b27df1bc7"
+MLJDecisionTreeInterface = "c6f25543-311c-4c74-83dc-3ea6d1015661"
 FeatureSelection = "33837fe5-dbff-4c9e-8c2f-c5612fe2b8b6"
 StableRNGs = "860ef19b-820b-49d6-a774-d7a799459cd3"
 
 [compat]
 Documenter = "^1.4"
 MLJ = "^0.20"
+MLJDecisionTreeInterface = "^0.4.2"
 StableRNGs = "^1.0"
 julia = "^1.0"
diff --git a/docs/make.jl b/docs/make.jl
@@ -30,5 +30,6 @@ makedocs(;
 deploydocs(;
     deploy_config = Documenter.GitHubActions(),
     repo="github.com/JuliaAI/FeatureSelection.jl.git",
+    devbranch="dev",
     push_preview=true
 )
diff --git a/docs/src/api.md b/docs/src/api.md
@@ -6,4 +6,9 @@ CurrentModule = FeatureSelection
 ```@docs
 FeatureSelector
 RecursiveFeatureElimination
+```
+# Internal Utils
+```@docs
+abs_last
+score_features!
 ```
diff --git a/docs/src/index.md b/docs/src/index.md
@@ -20,7 +20,7 @@ recursive feature elimination should return the first columns as important featu
 ```@meta
 DocTestSetup = quote
   using MLJ, FeatureSelection, StableRNGs
-  rng = StableRNG(10)
+  rng = StableRNG(123)
   A = rand(rng, 50, 10)
   X = MLJ.table(A) # features
   y = @views(
@@ -52,16 +52,16 @@ end
 ```
 ```@example example1
 using MLJ, FeatureSelection, StableRNGs
-rng = StableRNG(10)
+rng = StableRNG(123)
 A = rand(rng, 50, 10)
 X = MLJ.table(A) # features
-y = @views(
-    10 .* sin.(
-        pi .* A[:, 1] .* A[:, 2]
-    ) .+ 20 .* (A[:, 3] .- 0.5).^ 2 .+ 10 .* A[:, 4] .+ 5 * A[:, 5]
+ y = @views(
+        10 .* sin.(
+            pi .* A[:, 1] .* A[:, 2]
+        ) + 20 .* (A[:, 3] .- 0.5).^ 2 .+ 10 .* A[:, 4] .+ 5 * A[:, 5]
 ) # target
 ```
-Now we that we have our data we can create our recursive feature elimination model and 
+Now we that we have our data, we can create our recursive feature elimination model and 
 train it on our dataset
 ```@example example1
 RandomForestRegressor = @load RandomForestRegressor pkg=DecisionTree
@@ -74,51 +74,49 @@ fit!(mach)
 ```
 We can inspect the feature importances in two ways:
 ```jldoctest
-julia> report(mach).ranking
-10-element Vector{Int64}:
- 1
- 1
- 1
- 1
- 1
- 2
- 3
- 4
- 5
- 6
+julia> report(mach).scores
+Dict{Symbol, Int64} with 10 entries:
+  :x9  => 4
+  :x2  => 6
+  :x5  => 6
+  :x6  => 3
+  :x7  => 2
+  :x3  => 6
+  :x8  => 1
+  :x4  => 6
+  :x10 => 5
+  :x1  => 6
 
 julia> feature_importances(mach)
 10-element Vector{Pair{Symbol, Int64}}:
-  :x1 => 6
-  :x2 => 5
-  :x3 => 4
-  :x4 => 3
-  :x5 => 2
-  :x6 => 1
-  :x7 => 1
+  :x9 => 4
+  :x2 => 6
+  :x5 => 6
+  :x6 => 3
+  :x7 => 2
+  :x3 => 6
   :x8 => 1
-  :x9 => 1
- :x10 => 1
+  :x4 => 6
+ :x10 => 5
+  :x1 => 6
 ```
-Note that a variable with lower rank has more significance than a variable with higher rank while a variable with higher feature importance is better than a variable with lower feature importance.
-
 We can view the important features used by our model by inspecting the `fitted_params` 
 object.
 ```jldoctest
 julia> p = fitted_params(mach)
-(features_left = [:x1, :x2, :x3, :x4, :x5],
+(features_left = [:x4, :x2, :x1, :x5, :x3],
  model_fitresult = (forest = Ensemble of Decision Trees
 Trees:      100
-Avg Leaves: 25.26
-Avg Depth:  8.36,),)
+Avg Leaves: 25.3
+Avg Depth:  8.01,),)
 
 julia> p.features_left
 5-element Vector{Symbol}:
- :x1
- :x2
- :x3
  :x4
+ :x2
+ :x1
  :x5
+ :x3
 ```
 We can also call the `predict` method on the fitted machine, to predict using a 
 random forest regressor trained using only the important features, or call the `transform` 
@@ -149,24 +147,24 @@ As before we can inspect the important features by inspecting the object returne
 ```jldoctest
 julia> fitted_params(self_tuning_rfe_mach).best_fitted_params.features_left
 5-element Vector{Symbol}:
- :x1
- :x2
- :x3
  :x4
+ :x2
+ :x1
  :x5
+ :x3
 
 julia> feature_importances(self_tuning_rfe_mach)
 10-element Vector{Pair{Symbol, Int64}}:
-  :x1 => 6
-  :x2 => 5
-  :x3 => 4
-  :x4 => 3
-  :x5 => 2
-  :x6 => 1
+  :x9 => 2
+  :x2 => 6
+  :x5 => 6
+  :x6 => 4
   :x7 => 1
-  :x8 => 1
-  :x9 => 1
- :x10 => 1
+  :x3 => 6
+  :x8 => 5
+  :x4 => 6
+ :x10 => 3
+  :x1 => 6
 ```
 and call `predict` on the tuned model machine as shown below
 ```@example example1
diff --git a/src/models/rfe.jl b/src/models/rfe.jl
diff --git a/test/models/rfe.jl b/test/models/rfe.jl

Original file line number	Diff line number	Diff line change
`@@ -30,5 +30,6 @@ makedocs(;`
`30`	`30`	`deploydocs(;`
`31`	`31`	`deploy_config = Documenter.GitHubActions(),`
`32`	`32`	`repo="github.com/JuliaAI/FeatureSelection.jl.git",`
	`33`	`+ devbranch="dev",`
`33`	`34`	`push_preview=true`
`34`	`35`	`)`