docs

Michael Abbott · Michael Abbott · commit ce17dd51fc9c · 2019-06-11T09:47:46.000+02:00
diff --git a/LICENSE.md b/LICENSE.md
@@ -0,0 +1,22 @@
+The SliceMap.jl package is licensed under the MIT "Expat" License:
+
+> Copyright (c) 2019: Michael Abbott.
+>
+> Permission is hereby granted, free of charge, to any person obtaining a copy
+> of this software and associated documentation files (the "Software"), to deal
+> in the Software without restriction, including without limitation the rights
+> to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
+> copies of the Software, and to permit persons to whom the Software is
+> furnished to do so, subject to the following conditions:
+>
+> The above copyright notice and this permission notice shall be included in all
+> copies or substantial portions of the Software.
+>
+> THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
+> IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
+> FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
+> AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
+> LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
+> OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
+> SOFTWARE.
+>
diff --git a/README.md b/README.md
@@ -15,140 +15,14 @@ maprows(f, M) ≈ mapslices(f, M, dims=2)
 slicemap(f, A; dims) ≈ mapslices(f, A, dims=dims) # only Zygote
 ```
 
-<!--
-It also defines Zygote gradients for the Slice/Align functions in 
+The capitalised functions differ both in using [StaticArrays](https://github.com/JuliaArrays/StaticArrays.jl) 
+slices, and using [ForwardDiff](https://github.com/JuliaDiff/ForwardDiff.jl) for the gradient of each slice,
+instead of the same reverse-mode Tracker/Zygote.
+For small slices, this will often be much faster, with or without gradients. 
+
+The package also defines Zygote gradients for the Slice/Align functions in 
 [JuliennedArrays](https://github.com/bramtayl/JuliennedArrays.jl), 
 and the slice/glue functions in [TensorCast](https://github.com/mcabbott/TensorCast.jl), 
-both of which are good ways to roll-your-own `mapslices`-like behaviour.
--->
-
-### Simple example
-
-```julia
-mat = rand(1:9, 3,10)
-fun(x) = 2 .+ x.^2
-mapslices(fun, mat, dims=1)
-
-using SliceMap
-mapcols(fun, mat)     # eachcol(m)
-MapCols{3}(fun, mat)  # reinterpret(SArray,...)
-
-using ForwardDiff, Tracker, Zygote
-ForwardDiff.gradient(m -> sum(sin, mapslices(fun, m, dims=1)), mat)
-
-Tracker.gradient(m -> sum(sin, mapcols(fun, m)), mat)[1]     # Tracker.forward per slice
-Tracker.gradient(m -> sum(sin, MapCols{3}(fun, m)), mat)[1]  # ForwardDiff on slices
-
-Zygote.gradient(m -> sum(sin, mapcols(fun, m)), mat)[1]      # Zygote.forward per slice
-Zygote.gradient(m -> sum(sin, MapCols{3}(fun, m)), mat)[1]
-```
-
-These are a bit faster than `mapslices` too. Although storing all the backward functions, 
-which is what `mapcols` does, seems not to be so quick:
-
-```julia
-using BenchmarkTools
-mat1k = rand(3,1000);
-
-@btime mapreduce(fun, hcat, eachcol($mat1k)) # 1.522 ms
-@btime mapslices(fun, $mat1k, dims=1)        # 1.017 ms
-
-@btime mapcols(fun, $mat1k)                  #   399.016 μs
-@btime MapCols{3}(fun, $mat1k)               #    15.564 μs
-@btime MapCols(fun, $mat1k)                  #    16.774 μs  without size
-
-@btime ForwardDiff.gradient(m -> sum(sin, mapslices(fun, m, dims=1)), $mat1k); # 372.705 ms
-@btime Tracker.gradient(m -> sum(sin, mapcols(fun, m)), $mat1k);               #  70.203 ms
-@btime Tracker.gradient(m -> sum(sin, MapCols{3}(fun, m)), $mat1k);            #     146.561 μs, 330.51 KiB
-@btime Zygote.gradient(m -> sum(sin, mapcols(fun, m)), $mat1k);                #  20.018 ms, 3.82 MiB
-@btime Zygote.gradient(m -> sum(sin, MapCols{3}(fun, m)), $mat1k);             #     245.550 μs
-```
-
-### Other packages
-
-This package also provides Zygote gradients for the slice/glue functions in 
-[TensorCast](https://github.com/mcabbott/TensorCast.jl),
-which can be used to write many mapslices-like operations.
-(The function `slicemap(f, A, dims)` uses these functions, without having to write index notation.)
-
-```julia
-using TensorCast
-@cast [i,j] := fun(mat[:,j])[i]                        # same as mapcols
-
-tcm(mat) = @cast out[i,j] := fun(mat[:,j])[i]
-Zygote.gradient(m -> sum(sin, tcm(m)), mat)[1]
-
-@btime tcm($mat1k)                                     #    407.176 μs
-@btime Zygote.gradient(m -> sum(sin, tcm(m)), $mat1k); # 19.086 ms
-```
-
-Similar gradients work for the Slice/Align functions in 
-[JuliennedArrays](https://github.com/bramtayl/JuliennedArrays.jl),
-so it defines these too:
-
-```julia
-using JuliennedArrays
-jumap(f,m) = Align(map(f, Slices(m, True(), False())), True(), False())
-jumap(fun, mat)                                               # same as mapcols
-Zygote.gradient(m -> sum(sin, jumap(fun, m)), mat)[1]
-
-@btime jumap(fun, $mat1k);                                    #    408.259 μs
-@btime Zygote.gradient(m -> sum(sin, jumap(fun, m)), $mat1k); # 18.638 ms
-```
-
-That's a 2-line gradient definition, so borrowing it may be easier than depending on this package. 
-
-The original purpose of `MapCols`, with ForwardDiff on slices, was that this works well when
-the function being mapped integrates some differential equation. 
-
-```julia
-using DifferentialEquations, ParameterizedFunctions
-ode = @ode_def begin
-  du = ( - k2 * u )/(k1 + u) # an equation with 2 parameters
-end k1 k2
-
-function g(k::AbstractVector{T}, times) where T
-    u0 = T[ 1.0 ] # NB convert initial values to eltype(k)
-    prob = ODEProblem(ode, u0, (0.0, 0.0+maximum(times)), k)
-    Array(solve(prob, saveat=times))::Matrix{T}
-end
-
-kay = rand(2,50);
-MapCols{2}(g, kay, 1:5) # 5 time steps, for each col of parameters
-
-Tracker.gradient(k -> sum(sin, MapCols{2}(g, k, 1:5)), kay)[1]
-```
-
-This is both quite efficient, and seems to go well with multi-threading:
-
-```julia
-@btime MapCols{2}(g, $kay, 1:5)        # 1.369 ms
-@btime ThreadMapCols{2}(g, $kay, 1:5)  #   670.384 μs
-
-@btime Tracker.gradient(k -> sum(sin, MapCols{2}(g, k, 1:5)), $kay)[1]       # 2.438 ms
-@btime Tracker.gradient(k -> sum(sin, ThreadMapCols{2}(g, k, 1:5)), $kay)[1] # 1.229 ms
-
-Threads.nthreads() == 4
-```
-
-### Elsewhere
-
-Issues about mapslices:
-* https://github.com/FluxML/Zygote.jl/issues/92
-* https://github.com/FluxML/Flux.jl/issues/741
-* https://github.com/JuliaLang/julia/issues/29146
-
-Differential equations:
-* https://arxiv.org/abs/1812.01892 "DSAAD"
-* http://docs.juliadiffeq.org/latest/analysis/sensitivity.html
-
-Other packages which define gradients of possible interest:
-* https://github.com/GiggleLiu/LinalgBackwards.jl
-* https://github.com/mcabbott/ArrayAllez.jl
+both of which are good ways to roll-your-own `mapslices`-like things.
 
-Differentiation packages this could perhaps support, quite the zoo:
-* https://github.com/dfdx/Yota.jl
-* https://github.com/invenia/Nabla.jl
-* https://github.com/denizyuret/AutoGrad.jl
-* https://github.com/Roger-luo/YAAD.jl
-* And perhaps one day, just https://github.com/JuliaDiff/ChainRules.jl
+There are more details & examples at [docs/intro.md](docs/intro.md). 
diff --git a/docs/intro.md b/docs/intro.md
@@ -0,0 +1,134 @@
+# SliceMap.jl
+
+Some examples & benchmarks. 
+
+## Simple example
+
+```julia
+mat = rand(1:9, 3,10)
+fun(x) = 2 .+ x.^2
+mapslices(fun, mat, dims=1)
+
+using SliceMap
+mapcols(fun, mat)     # eachcol(m)
+MapCols{3}(fun, mat)  # reinterpret(SArray,...)
+
+using ForwardDiff, Tracker, Zygote
+ForwardDiff.gradient(m -> sum(sin, mapslices(fun, m, dims=1)), mat)
+
+Tracker.gradient(m -> sum(sin, mapcols(fun, m)), mat)[1]     # Tracker.forward per slice
+Tracker.gradient(m -> sum(sin, MapCols{3}(fun, m)), mat)[1]  # ForwardDiff on slices
+
+Zygote.gradient(m -> sum(sin, mapcols(fun, m)), mat)[1]      # Zygote.forward per slice
+Zygote.gradient(m -> sum(sin, MapCols{3}(fun, m)), mat)[1]
+```
+
+These are a bit faster than `mapslices` too. Although storing all the backward functions, 
+which is what `mapcols` does, has some overhead:
+
+```julia
+using BenchmarkTools
+mat1k = rand(3,1000);
+
+@btime mapreduce(fun, hcat, eachcol($mat1k)) # 1.522 ms
+@btime mapslices(fun, $mat1k, dims=1)        # 1.017 ms
+
+@btime mapcols(fun, $mat1k)                  #   399.016 μs
+@btime MapCols{3}(fun, $mat1k)               #    15.564 μs
+@btime MapCols(fun, $mat1k)                  #    16.774 μs  without size
+
+@btime ForwardDiff.gradient(m -> sum(sin, mapslices(fun, m, dims=1)), $mat1k); # 372.705 ms
+@btime Tracker.gradient(m -> sum(sin, mapcols(fun, m)), $mat1k);               #  70.203 ms
+@btime Tracker.gradient(m -> sum(sin, MapCols{3}(fun, m)), $mat1k);            #     146.561 μs, 330.51 KiB
+@btime Zygote.gradient(m -> sum(sin, mapcols(fun, m)), $mat1k);                #  20.018 ms, 3.82 MiB
+@btime Zygote.gradient(m -> sum(sin, MapCols{3}(fun, m)), $mat1k);             #     245.550 μs
+```
+
+## Other packages
+
+This package also provides Zygote gradients for the slice/glue functions in 
+[TensorCast](https://github.com/mcabbott/TensorCast.jl),
+which can be used to write many mapslices-like operations.
+(The function `slicemap(f, A, dims)` uses these functions, without having to write index notation.)
+
+```julia
+using TensorCast
+@cast [i,j] := fun(mat[:,j])[i]                        # same as mapcols
+
+tcm(mat) = @cast out[i,j] := fun(mat[:,j])[i]
+Zygote.gradient(m -> sum(sin, tcm(m)), mat)[1]
+
+@btime tcm($mat1k)                                     #    407.176 μs
+@btime Zygote.gradient(m -> sum(sin, tcm(m)), $mat1k); # 19.086 ms
+```
+
+Similar gradients work for the Slice/Align functions in 
+[JuliennedArrays](https://github.com/bramtayl/JuliennedArrays.jl),
+so it defines these too:
+
+```julia
+using JuliennedArrays
+jumap(f,m) = Align(map(f, Slices(m, True(), False())), True(), False())
+jumap(fun, mat)                                               # same as mapcols
+Zygote.gradient(m -> sum(sin, jumap(fun, m)), mat)[1]
+
+@btime jumap(fun, $mat1k);                                    #    408.259 μs
+@btime Zygote.gradient(m -> sum(sin, jumap(fun, m)), $mat1k); # 18.638 ms
+```
+
+That's a 2-line gradient definition, so borrowing it may be easier than depending on this package. 
+
+The original purpose of `MapCols`, with ForwardDiff on slices, was that this works well when
+the function being mapped integrates some differential equation. 
+
+```julia
+using DifferentialEquations, ParameterizedFunctions
+ode = @ode_def begin
+  du = ( - k2 * u )/(k1 + u) # an equation with 2 parameters
+end k1 k2
+
+function g(k::AbstractVector{T}, times) where T
+    u0 = T[ 1.0 ] # NB convert initial values to eltype(k)
+    prob = ODEProblem(ode, u0, (0.0, 0.0+maximum(times)), k)
+    Array(solve(prob, saveat=times))::Matrix{T}
+end
+
+kay = rand(2,50);
+MapCols{2}(g, kay, 1:5) # 5 time steps, for each col of parameters
+
+Tracker.gradient(k -> sum(sin, MapCols{2}(g, k, 1:5)), kay)[1]
+```
+
+This is quite efficient, and seems to go well with multi-threading:
+
+```julia
+@btime MapCols{2}(g, $kay, 1:5)        # 1.423 ms
+@btime ThreadMapCols{2}(g, $kay, 1:5)  #   713.748 μs
+
+@btime Tracker.gradient(k -> sum(sin, MapCols{2}(g, k, 1:5)), $kay)[1]       # 2.535 ms
+@btime Tracker.gradient(k -> sum(sin, ThreadMapCols{2}(g, k, 1:5)), $kay)[1] # 1.333 ms
+
+Threads.nthreads() == 4 # on my 2/4-core laptop
+```
+
+## Elsewhere
+
+Issues about mapslices:
+* https://github.com/FluxML/Zygote.jl/issues/92
+* https://github.com/FluxML/Flux.jl/issues/741
+* https://github.com/JuliaLang/julia/issues/29146
+
+Differential equations:
+* https://arxiv.org/abs/1812.01892 "DSAAD"
+* http://docs.juliadiffeq.org/latest/analysis/sensitivity.html
+
+Other packages which define gradients of possible interest:
+* https://github.com/GiggleLiu/LinalgBackwards.jl
+* https://github.com/mcabbott/ArrayAllez.jl
+
+Differentiation packages this could perhaps support, quite the zoo:
+* https://github.com/dfdx/Yota.jl
+* https://github.com/invenia/Nabla.jl
+* https://github.com/denizyuret/AutoGrad.jl
+* https://github.com/Roger-luo/YAAD.jl
+* And perhaps one day, just https://github.com/JuliaDiff/ChainRules.jl