@@ -149,7 +149,57 @@ class CholeskyCorr(Transform):
149
149
150
150
#### Mathematical Details
151
151
152
- [Include detailed mathematical explanations similar to the original TFP bijector.]
152
+ This bijector provides a change of variables from unconstrained reals to a
153
+ parameterization of the CholeskyLKJ distribution. The CholeskyLKJ distribution
154
+ [1] is a distribution on the set of Cholesky factors of positive definite
155
+ correlation matrices. The CholeskyLKJ probability density function is
156
+ obtained from the LKJ density on n x n matrices as follows:
157
+
158
+ 1 = int p(A | eta) dA
159
+ = int Z(eta) * det(A) ** (eta - 1) dA
160
+ = int Z(eta) L_ii ** {(n - i - 1) + 2 * (eta - 1)} ^dL_ij (0 <= i < j < n)
161
+
162
+ where Z(eta) is the normalizer; the matrix L is the Cholesky factor of the
163
+ correlation matrix A; and ^dL_ij denotes the wedge product (or differential)
164
+ of the strictly lower triangular entries of L. The entries L_ij are
165
+ constrained such that each entry lies in [-1, 1] and the norm of each row is
166
+ 1. The norm includes the diagonal; which is not included in the wedge product.
167
+ To preserve uniqueness, we further specify that the diagonal entries are
168
+ positive.
169
+
170
+ The image of unconstrained reals under the `CorrelationCholesky` bijector is
171
+ the set of correlation matrices which are positive definite. A [correlation
172
+ matrix](https://en.wikipedia.org/wiki/Correlation_and_dependence#Correlation_matrices)
173
+ can be characterized as a symmetric positive semidefinite matrix with 1s on
174
+ the main diagonal.
175
+
176
+ For a lower triangular matrix `L` to be a valid Cholesky-factor of a positive
177
+ definite correlation matrix, it is necessary and sufficient that each row of
178
+ `L` have unit Euclidean norm [1]. To see this, observe that if `L_i` is the
179
+ `i`th row of the Cholesky factor corresponding to the correlation matrix `R`,
180
+ then the `i`th diagonal entry of `R` satisfies:
181
+
182
+ 1 = R_i,i = L_i . L_i = ||L_i||^2
183
+
184
+ where '.' is the dot product of vectors and `||...||` denotes the Euclidean
185
+ norm.
186
+
187
+ Furthermore, observe that `R_i,j` lies in the interval `[-1, 1]`. By the
188
+ Cauchy-Schwarz inequality:
189
+
190
+ |R_i,j| = |L_i . L_j| <= ||L_i|| ||L_j|| = 1
191
+
192
+ This is a consequence of the fact that `R` is symmetric positive definite with
193
+ 1s on the main diagonal.
194
+
195
+ We choose the mapping from x in `R^{m}` to `R^{n^2}` where `m` is the
196
+ `(n - 1)`th triangular number; i.e. `m = 1 + 2 + ... + (n - 1)`.
197
+
198
+ L_ij = x_i,j / s_i (for i < j)
199
+ L_ii = 1 / s_i
200
+
201
+ where s_i = sqrt(1 + x_i,0^2 + x_i,1^2 + ... + x_(i,i-1)^2). We can check that
202
+ the required constraints on the image are satisfied.
153
203
154
204
#### Examples
155
205
0 commit comments