@@ -148,7 +148,57 @@ class CholeskyCorr(Transform):
148
148
149
149
#### Mathematical Details
150
150
151
- [Include detailed mathematical explanations similar to the original TFP bijector.]
151
+ This bijector provides a change of variables from unconstrained reals to a
152
+ parameterization of the CholeskyLKJ distribution. The CholeskyLKJ distribution
153
+ [1] is a distribution on the set of Cholesky factors of positive definite
154
+ correlation matrices. The CholeskyLKJ probability density function is
155
+ obtained from the LKJ density on n x n matrices as follows:
156
+
157
+ 1 = int p(A | eta) dA
158
+ = int Z(eta) * det(A) ** (eta - 1) dA
159
+ = int Z(eta) L_ii ** {(n - i - 1) + 2 * (eta - 1)} ^dL_ij (0 <= i < j < n)
160
+
161
+ where Z(eta) is the normalizer; the matrix L is the Cholesky factor of the
162
+ correlation matrix A; and ^dL_ij denotes the wedge product (or differential)
163
+ of the strictly lower triangular entries of L. The entries L_ij are
164
+ constrained such that each entry lies in [-1, 1] and the norm of each row is
165
+ 1. The norm includes the diagonal; which is not included in the wedge product.
166
+ To preserve uniqueness, we further specify that the diagonal entries are
167
+ positive.
168
+
169
+ The image of unconstrained reals under the `CorrelationCholesky` bijector is
170
+ the set of correlation matrices which are positive definite. A [correlation
171
+ matrix](https://en.wikipedia.org/wiki/Correlation_and_dependence#Correlation_matrices)
172
+ can be characterized as a symmetric positive semidefinite matrix with 1s on
173
+ the main diagonal.
174
+
175
+ For a lower triangular matrix `L` to be a valid Cholesky-factor of a positive
176
+ definite correlation matrix, it is necessary and sufficient that each row of
177
+ `L` have unit Euclidean norm [1]. To see this, observe that if `L_i` is the
178
+ `i`th row of the Cholesky factor corresponding to the correlation matrix `R`,
179
+ then the `i`th diagonal entry of `R` satisfies:
180
+
181
+ 1 = R_i,i = L_i . L_i = ||L_i||^2
182
+
183
+ where '.' is the dot product of vectors and `||...||` denotes the Euclidean
184
+ norm.
185
+
186
+ Furthermore, observe that `R_i,j` lies in the interval `[-1, 1]`. By the
187
+ Cauchy-Schwarz inequality:
188
+
189
+ |R_i,j| = |L_i . L_j| <= ||L_i|| ||L_j|| = 1
190
+
191
+ This is a consequence of the fact that `R` is symmetric positive definite with
192
+ 1s on the main diagonal.
193
+
194
+ We choose the mapping from x in `R^{m}` to `R^{n^2}` where `m` is the
195
+ `(n - 1)`th triangular number; i.e. `m = 1 + 2 + ... + (n - 1)`.
196
+
197
+ L_ij = x_i,j / s_i (for i < j)
198
+ L_ii = 1 / s_i
199
+
200
+ where s_i = sqrt(1 + x_i,0^2 + x_i,1^2 + ... + x_(i,i-1)^2). We can check that
201
+ the required constraints on the image are satisfied.
152
202
153
203
#### Examples
154
204
0 commit comments