new index, setfill, one=>pair

rayegun · rayegun · commit 421e267ffa37 · 2022-03-27T19:39:13.000-04:00
diff --git a/docs/src/index.md b/docs/src/index.md
@@ -1,7 +1,8 @@
 # SuiteSparseGraphBLAS.jl
 
-SuiteSparseGraphBLAS.jl is a package for sparse linear algebra on arbitrary semirings, with a particular focus on graph computations.
-It aims to provide a Julian wrapper over Tim Davis' SuiteSparse:GraphBLAS reference implementation of the GraphBLAS C specification.
+Fast sparse linear algebra is an essential part of the scientific computing toolkit. Outside of the usual applications, like differential equations, sparse linear algebra provides an elegant way to express graph algorithms on adjacency and incidence matrices. The GraphBLAS standard specifies a set of operations for computing sparse matrix graph algorithm in a vein similar to the BLAS or LAPACK standards.
+
+SuiteSparseGraphBLAS.jl is a blazing fast package for shared memory sparse matrix operations which wraps Tim Davis' SuiteSparse:GraphBLAS implementation of the GraphBLAS C specification.
 
 # Installation
 
@@ -18,26 +19,27 @@ using Pkg
 Pkg.add("SuiteSparseGraphBLAS")
 ```
 
-The SuiteSparse:GraphBLAS binary is installed automatically as `SSGraphBLAS_jll`.
+The SuiteSparse:GraphBLAS binary, SSGraphBLAS_jll.jl, is installed automatically.
 
-Then in the REPL or script `using SuiteSparseGraphBLAS` will import the package.
+Then in the REPL or script `using SuiteSparseGraphBLAS` will make the package available for use.
 
 # Introduction
 
 GraphBLAS harnesses the well-understood duality between graphs and matrices.
 Specifically a graph can be represented by the [adjacency matrix](https://en.wikipedia.org/wiki/Adjacency_matrix) and/or [incidence matrix](https://en.wikipedia.org/wiki/Incidence_matrix), or one of the many variations on those formats. 
 With this matrix representation in hand we have a method to operate on the graph with linear algebra.
 
-Below is an example of the adjacency matrix of a directed graph, and finding the neighbors of a single vertex using basic matrix-vector multiplication on the arithemtic semiring.
+One important algorithm that maps well to linear algebra is Breadth First Search (BFS). 
+A simple BFS is just a matrix-vector multiplication, where `A` is the adjacency matrix and `v` is the set of source nodes, as illustrated below.
 
 ![BFS and Adjacency Matrix](./assets/AdjacencyBFS.png)
 
 ## GBArrays
 
-The core SuiteSparseGraphBLAS.jl array types are `GBVector` and `GBMatrix` which are subtypes `SparseArrays.AbstractSparseVector` and `SparseArrays.AbstractSparseMatrix` respectively.
+The core SuiteSparseGraphBLAS.jl array types are `GBVector` and `GBMatrix` which are subtypes `SparseArrays.AbstractSparseVector` and `SparseArrays.AbstractSparseMatrix` respectively. There are also several auxiliary array types that restrict one or more behaviors, like row or column orientation. More info on those types can be found ### HERE ###
 
 !!! note "GBArray"
-    These docs will often refer to the `GBArray` type, which is the union of `GBVector`, `GBMatrix` and their lazy Transpose objects.
+    These docs will often refer to the `GBArray` type, which is the union of `AbstractGBVector`, `AbstractGBMatrix` and their lazy Transpose objects.
 
 ```@setup intro
 using SuiteSparseGraphBLAS
@@ -59,7 +61,8 @@ Here we can already see several differences compared to `SparseArrays.SparseMatr
 The first is that `A` is stored in `hypersparse` format, and by row.
 
 `GBArrays` are (technically) opaque to the user in order to allow the library author to choose the best storage format.\
-GraphBLAS takes advantage of this by storing matrices in one of four formats: `dense`, `bitmap`, `sparse-compressed`, or `hypersparse-compressed`; and in either `row` or `column` major orientation.
+GraphBLAS takes advantage of this by storing matrices in one of four formats: `dense`, `bitmap`, `sparse-compressed`, or `hypersparse-compressed`; and in either `row` or `column` major orientation.\
+Different matrices may be better suited to storage in one of those formats, and certain operations may perform differently on `row` or `column` major matrices.
 
 !!! warning "Default Orientation"
     The default orientation of a `GBMatrix` is by-row, the opposite of Julia arrays. However, a `GBMatrix` constructed from a `SparseMatrixCSC` or 
@@ -72,31 +75,38 @@ Information about storage formats, orientation, conversion, construction and mor
 The second difference is that a `GBArray` doesn't assume the fill-in value of a sparse array.\
 Since `A[1,5]` isn't stored in the matrix (it's been "compressed" out), we return `nothing`.\
 
-This matches the GraphBLAS spec, where `NO_VALUE` is returned, rather than `zero(eltype(A))`. 
+This better matches the GraphBLAS spec, where `NO_VALUE` is returned, rather than `zero(eltype(A))`. This is better suited to graph algorithms where returning `zero(eltype(A))` might imply the presence of an edge with weight `zero`.\
+However this behavior can be changed with the [`setfill!`](@ref) and [`setfill`](@ref) functions.
+
+```@repl intro
+A[1, 1] === nothing
+
+B = setfill(A, 0) # no-copy alias
+B[1, 1]
+```
 
 An empty matrix and vector won't do us much good, so let's see how to construct the matrix and vector from the graphic above. Both `A` and `v` below are constructed from coordinate format or COO.
 
 ```@repl intro
-#GBMatrix(I::Vector{<:Integer}, J::Vector{<:Integer}, V::Vector{T})
 A = GBMatrix([1,1,2,2,3,4,4,5,6,7,7,7], [2,4,5,7,6,1,3,6,3,3,4,5], [1:12...])
 
-#GBVector(I::Vector{<:Integer}, V::Vector{T})
 v = GBVector([4], [10])
 ```
+
 ## GraphBLAS Operations
 
 The complete documentation of supported operations can be found in [Operations](@ref).
-GraphBLAS operations are, where possible, methods of existing Julia functions  listed in the third column.
+GraphBLAS operations are, where possible, methods of existing Julia functions listed in the third column.
 
 | GraphBLAS           | Operation                                                        | Julia                                      |
 |:--------------------|:----------------------------------------:                        |----------:                                 |
 |`mxm`, `mxv`, `vxm`  |``\bf C \langle M \rangle = C \odot AB``                          |`mul[!]` or `*`                             |
 |`eWiseMult`          |``\bf C \langle M \rangle = C \odot (A \otimes B)``               |`emul[!]` or `.` broadcasting               |
 |`eWiseAdd`           |``\bf C \langle M \rangle = C \odot (A \oplus  B)``               |`eadd[!]`                                   |
-|`extract`            |``\bf C \langle M \rangle = C \odot A(I,J)``                      |`extract[!]`, `getindex` or `A[i...]`       |
-|`subassign`          |``\bf C (I,J) \langle M \rangle = C(I,J) \odot A``                |`subassign[!]`, `setindex!` or `A[i...]=3.5`|
+|`extract`            |``\bf C \langle M \rangle = C \odot A(I,J)``                      |`extract[!]`, `getindex`       |
+|`subassign`          |``\bf C (I,J) \langle M \rangle = C(I,J) \odot A``                |`subassign[!]` or `setindex!`|
 |`assign`             |``\bf C \langle M \rangle (I,J) = C(I,J) \odot A``                |`assign[!]`                                 |
-|`apply`              |``{\bf C \langle M \rangle = C \odot} f{\bf (A)}``                |`map[!]` or `.` broadcasting                |
+|`apply`              |``{\bf C \langle M \rangle = C \odot} f{\bf (A)}``                |`apply[!]`, `map[!]` or `.` broadcasting                |
 |                     |``{\bf C \langle M \rangle = C \odot} f({\bf A},y)``              |                                            |
 |                     |``{\bf C \langle M \rangle = C \odot} f(x,{\bf A})``              |                                            |
 |`select`             |``{\bf C \langle M \rangle = C \odot} f({\bf A},k)``              |`select[!]`                                 |
@@ -105,41 +115,53 @@ GraphBLAS operations are, where possible, methods of existing Julia functions  l
 |`transpose`          |``\bf C \langle M \rangle = C \odot A^{\sf T}``                   |`gbtranspose[!]`, lazy: `transpose`, `'`    |
 |`kronecker`          |``\bf C \langle M \rangle = C \odot \text{kron}(A, B)``           |`kron[!]`                                   |
 
-where ``\bf M`` is a `GBArray` mask, ``\odot`` is a binary operator for accumulating into ``\bf C``, and ``\otimes`` and ``\oplus`` are a binary operation and commutative monoid respectively. 
+where ``\bf M`` is a `GBArray` mask, ``\odot`` is a binary operator for accumulating into ``\bf C``, and ``\otimes`` and ``\oplus`` are a binary operation and commutative monoid respectively. ``f`` is either a unary or binary operator. 
 
 ## GraphBLAS Operators
 
+Many GraphBLAS operations take additional arguments called *operators*. In the table above operators are denoted by ``\odot``, ``\otimes``, and ``\oplus`` and ``f``, and they behave similar to the function argument of `map`. A closer look at operators can be found in [Operators](@ref)
+
 A GraphBLAS operator is a unary or binary function, the commutative monoid form of a binary function,
 or a semiring, made up of a binary op and a commutative monoid.
 SuiteSparse:GraphBLAS ships with many of the common unary and binary operators as built-ins,
 along with monoids and semirings built commonly used in graph algorithms. 
-In most cases these operators can be used with familiar Julia syntax and functions, which then map to
-objects found in the submodules below:
+These built-in operators are *fast*, and should be used where possible. However, users are also free to provide their own functions as operators when necessary.
 
-- `UnaryOps` such as `SIN`, `SQRT`, `ABS`
-- `BinaryOps` such as `GE`, `MAX`, `POW`, `FIRSTJ`
-- `Monoids` such as `PLUS_MONOID`, `LXOR_MONOID`
-- `Semirings` such as `PLUS_TIMES` (the arithmetic semiring), `MAX_PLUS` (a tropical semiring), `PLUS_PLUS`, ...
+SuiteSparseGraphBLAS.jl will *mostly* take care of operators behind the scenes, and in most cases users should pass in normal functions like `+` and `sin`. For example:
 
-The above objects should, in almost all cases, be used by instead passing the equivalent functions, `sin` for `SIN`, `+` for `PLUS_MONOID` etc.
+```@repl intro
+emul(A, A, ^) # elementwise exponent
 
-A user may choose to call a function in multiple different forms: `A .+ B`, `eadd(A, B, +)`,
-or `eadd(A, B, BinaryOps.PLUS)`. 
+map(sin, A)
+```
+
+Broadcasting functionality is also supported, `A .^ A` will lower to `emul(A, A, ^)`, and `sin.(A)` will lower to `map(sin, A)`.
+
+Matrix multiplication, which accepts a semiring, can be called with either `*(max, +)(A, B)` or
+`mul(A, B, (max, +))`.
+
+We can also use functions that are not already built into SuiteSparseGraphBLAS.jl:
+
+```@repl intro
+M = GBMatrix([[1,2] [3,4]])
+increment(x) = x + 1
+map(increment, M)
+```
 
-Functions which only accept monoids like `reduce` will automatically find the correct monoid,
-so a call to `reduce(+, A)`, will lower to `reduce(Monoids.PLUS_MONOID, A)`.
+Unfortunately this has a couple problems. The first is that it's slow.\
+Compared to `A .+ 1` which lowers to `apply(+, A, 1)` the `map` call above is ~2.5x slower due to function pointer overhead.
 
-Matrix multiplication, which accepts a semiring, can be called with either `*(max, +)(A, B)`,
-`mul(A, B, (max, +))`, or `mul(A, B, Semirings.MAX_PLUS)`. 
+The second is that everytime we call `map(increment, M)` we will be re-creating the function pointer for `increment` matched to the type of `M`.\
+To avoid this there's a convenience macro [`@unop`](@ref) which will provide a permanent constant which is used every time `increment` is called with a GraphBLAS operation. See [Operators](@ref) for more information.
 
 !!! warning "Performance of User Defined Functions"
     Operators which are not already built-in are automatically constructed using function pointers when called. 
     Note, however, that their performance is significantly degraded compared to built-in operators,
-    and where possible user code should avoid this capability.
+    and where possible user code should avoid this capability. See [Operators](@ref).
 
 ## Example
 
-Here is an example of two different methods of triangle counting with GraphBLAS.
+Here is a quick example of two different methods of triangle counting with GraphBLAS.
 The methods are drawn from the LAGraph [repo](https://github.com/GraphBLAS/LAGraph).
 
 Input `A` must be a square, symmetric matrix with any element type.
diff --git a/src/SuiteSparseGraphBLAS.jl b/src/SuiteSparseGraphBLAS.jl
@@ -121,7 +121,7 @@ export clear!, extract, extract!, subassign!, assign!, hvcat! #array functions
 
 #operations
 export mul, select, select!, eadd, eadd!, emul, emul!, map, map!, gbtranspose, gbtranspose!,
-gbrand, eunion, eunion!, mask, mask!, apply, apply!
+gbrand, eunion, eunion!, mask, mask!, apply, apply!, setfill, setfill!
 # Reexports from LinAlg
 export diag, diagm, mul!, kron, kron!, transpose, reduce, tril, triu
 
diff --git a/src/abstractgbarray.jl b/src/abstractgbarray.jl
@@ -548,4 +548,14 @@ function Base.getindex(
     mask = nothing, accum = nothing, desc = nothing
 )
     return extract(A, i, j; mask, accum, desc)
+end
+
+function setfill!(A::AbstractGBArray, x)
+    A.fill = x
+end
+
+function setfill(A::AbstractGBArray, x) # aliasing form.
+    B = similar(A; fill=x)
+    B.p = A.p
+    return B
 end
diff --git a/src/operators/binaryops.jl b/src/operators/binaryops.jl
@@ -124,7 +124,7 @@ end
 @binop first GrB_FIRST T=>T
 @binop new second GrB_SECOND T=>T
 @binop any GxB_ANY T=>T # this doesn't match the semantics of Julia's any, but that may be ok...
-@binop one GrB_ONEB T=>T # I prefer pair, but to keep up with the spec I'll match...
+@binop new pair GrB_ONEB T=>T # I prefer pair, but to keep up with the spec I'll match...
 @binop (+) GrB_PLUS T=>T
 @binop (-) GrB_MINUS T=>T
 @binop new rminus GxB_RMINUS T=>T
diff --git a/src/operators/semirings.jl b/src/operators/semirings.jl
@@ -2,7 +2,7 @@ module Semirings
 import ..SuiteSparseGraphBLAS
 using ..SuiteSparseGraphBLAS: isGxB, isGrB, TypedSemiring, AbstractSemiring, GBType,
     valid_vec, juliaop, gbtype, symtotype, Itypes, Ftypes, Ztypes, FZtypes,
-    Rtypes, Ntypes, Ttypes, suffix, BinaryOps.BinaryOp, Monoids.Monoid, BinaryOps.second, BinaryOps.rminus,
+    Rtypes, Ntypes, Ttypes, suffix, BinaryOps.BinaryOp, Monoids.Monoid, BinaryOps.second, BinaryOps.rminus, BinaryOps.pair,
     BinaryOps.iseq, BinaryOps.isne, BinaryOps.isgt, BinaryOps.islt, BinaryOps.isge, BinaryOps.isle, BinaryOps.∨,
     BinaryOps.∧, BinaryOps.lxor, BinaryOps.xnor, BinaryOps.fmod, BinaryOps.bxnor, BinaryOps.bget, BinaryOps.bset,
     BinaryOps.bclr, BinaryOps.firsti0, BinaryOps.firsti, BinaryOps.firstj0, BinaryOps.firstj, BinaryOps.secondi0, 
@@ -112,7 +112,7 @@ end
 # (I..., F...)
 @rig (min, first) GxB_MIN_FIRST IF=>IF
 @rig (min, second) GxB_MIN_SECOND IF=>IF
-@rig (min, one) GxB_MIN_PAIR IF=>IF
+@rig (min, pair) GxB_MIN_PAIR IF=>IF
 @rig (min, min) GxB_MIN_MIN IF=>IF
 @rig (min, max) GxB_MIN_MAX IF=>IF
 @rig (min, +) GxB_MIN_PLUS IF=>IF
@@ -133,7 +133,7 @@ end
 
 @rig (max, first) GxB_MAX_FIRST IF=>IF
 @rig (max, second) GxB_MAX_SECOND IF=>IF
-@rig (max, one) GxB_MAX_PAIR IF=>IF
+@rig (max, pair) GxB_MAX_PAIR IF=>IF
 @rig (max, min) GxB_MAX_MIN IF=>IF
 @rig (max, max) GxB_MAX_MAX IF=>IF
 @rig (max, +) GxB_MAX_PLUS IF=>IF
@@ -154,7 +154,7 @@ end
 
 @rig (+, first) GxB_PLUS_FIRST IF=>IF
 @rig (+, second) GxB_PLUS_SECOND IF=>IF
-@rig (+, one) GxB_PLUS_PAIR IF=>IF
+@rig (+, pair) GxB_PLUS_PAIR IF=>IF
 @rig (+, min) GxB_PLUS_MIN IF=>IF
 @rig (+, max) GxB_PLUS_MAX IF=>IF
 @rig (+, +) GxB_PLUS_PLUS IF=>IF
@@ -175,7 +175,7 @@ end
 
 @rig (*, first) GxB_TIMES_FIRST IF=>IF
 @rig (*, second) GxB_TIMES_SECOND IF=>IF
-@rig (*, one) GxB_TIMES_PAIR IF=>IF
+@rig (*, pair) GxB_TIMES_PAIR IF=>IF
 @rig (*, min) GxB_TIMES_MIN IF=>IF
 @rig (*, max) GxB_TIMES_MAX IF=>IF
 @rig (*, +) GxB_TIMES_PLUS IF=>IF
@@ -196,7 +196,7 @@ end
 
 @rig (any, first) GxB_ANY_FIRST IF=>IF
 @rig (any, second) GxB_ANY_SECOND IF=>IF
-@rig (any, one) GxB_ANY_PAIR IF=>IF
+@rig (any, pair) GxB_ANY_PAIR IF=>IF
 @rig (any, min) GxB_ANY_MIN IF=>IF
 @rig (any, max) GxB_ANY_MAX IF=>IF
 @rig (any, +) GxB_ANY_PLUS IF=>IF
@@ -255,7 +255,7 @@ end
 
 @rig (∧, first) GxB_LAND_FIRST Bool=>Bool
 @rig (∧, second) GxB_LAND_SECOND Bool=>Bool
-@rig (∧, one) GxB_LAND_PAIR Bool=>Bool
+@rig (∧, pair) GxB_LAND_PAIR Bool=>Bool
 @rig (∧, ∨) GxB_LAND_LOR Bool=>Bool
 @rig (∧, ∧) GxB_LAND_LAND Bool=>Bool
 @rig (∧, lxor) GxB_LAND_LXOR Bool=>Bool
@@ -267,7 +267,7 @@ end
 
 @rig (∨, first) GxB_LOR_FIRST Bool=>Bool
 @rig (∨, second) GxB_LOR_SECOND Bool=>Bool
-@rig (∨, one) GxB_LOR_PAIR Bool=>Bool
+@rig (∨, pair) GxB_LOR_PAIR Bool=>Bool
 @rig (∨, ∨) GxB_LOR_LOR Bool=>Bool
 @rig (∨, ∧) GxB_LOR_LAND Bool=>Bool
 @rig (∨, lxor) GxB_LOR_LXOR Bool=>Bool
@@ -279,7 +279,7 @@ end
 
 @rig (lxor, first) GxB_LXOR_FIRST Bool=>Bool
 @rig (lxor, second) GxB_LXOR_SECOND Bool=>Bool
-@rig (lxor, one) GxB_LXOR_PAIR Bool=>Bool
+@rig (lxor, pair) GxB_LXOR_PAIR Bool=>Bool
 @rig (lxor, ∨) GxB_LXOR_LOR Bool=>Bool
 @rig (lxor, ∧) GxB_LXOR_LAND Bool=>Bool
 @rig (lxor, lxor) GxB_LXOR_LXOR Bool=>Bool
@@ -291,7 +291,7 @@ end
 
 @rig (==, first) GxB_EQ_FIRST Bool=>Bool
 @rig (==, second) GxB_EQ_SECOND Bool=>Bool
-@rig (==, one) GxB_EQ_PAIR Bool=>Bool
+@rig (==, pair) GxB_EQ_PAIR Bool=>Bool
 @rig (==, ∨) GxB_EQ_LOR Bool=>Bool
 @rig (==, ∧) GxB_EQ_LAND Bool=>Bool
 @rig (==, lxor) GxB_EQ_LXOR Bool=>Bool
@@ -303,7 +303,7 @@ end
 
 @rig (any, first) GxB_ANY_FIRST Bool=>Bool
 @rig (any, second) GxB_ANY_SECOND Bool=>Bool
-@rig (any, one) GxB_ANY_PAIR Bool=>Bool
+@rig (any, pair) GxB_ANY_PAIR Bool=>Bool
 @rig (any, ∨) GxB_ANY_LOR Bool=>Bool
 @rig (any, ∧) GxB_ANY_LAND Bool=>Bool
 @rig (any, lxor) GxB_ANY_LXOR Bool=>Bool
@@ -316,7 +316,7 @@ end
 # (PLUS, TIMES, ANY) × (FIRST, SECOND, PAIR(ONEB), PLUS, MINUS, RMINUS, TIMES, DIV, RDIV) × Z
 @rig (+, first) GxB_PLUS_FIRST Z=>Z
 @rig (+, second) GxB_PLUS_SECOND Z=>Z
-@rig (+, one) GxB_PLUS_PAIR Z=>Z
+@rig (+, pair) GxB_PLUS_PAIR Z=>Z
 @rig (+, +) GxB_PLUS_PLUS Z=>Z
 @rig (+, -) GxB_PLUS_MINUS Z=>Z
 @rig (+, rminus) GxB_PLUS_RMINUS Z=>Z
@@ -326,7 +326,7 @@ end
 
 @rig (*, first) GxB_TIMES_FIRST Z=>Z
 @rig (*, second) GxB_TIMES_SECOND Z=>Z
-@rig (*, one) GxB_TIMES_PAIR Z=>Z
+@rig (*, pair) GxB_TIMES_PAIR Z=>Z
 @rig (*, +) GxB_TIMES_PLUS Z=>Z
 @rig (*, -) GxB_TIMES_MINUS Z=>Z
 @rig (*, rminus) GxB_TIMES_RMINUS Z=>Z
@@ -336,7 +336,7 @@ end
 
 @rig (any, first) GxB_ANY_FIRST Z=>Z
 @rig (any, second) GxB_ANY_SECOND Z=>Z
-@rig (any, one) GxB_ANY_PAIR Z=>Z
+@rig (any, pair) GxB_ANY_PAIR Z=>Z
 @rig (any, +) GxB_ANY_PLUS Z=>Z
 @rig (any, -) GxB_ANY_MINUS Z=>Z
 @rig (any, rminus) GxB_ANY_RMINUS Z=>Z