You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+18-23Lines changed: 18 additions & 23 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,11 +9,6 @@ _o2plsda_ provides functions to do O2PLS-DA analysis for multiple omics integrat
9
9
In order to avoid overfitting of the model, the optimal number of latent variables for each model structure was estimated using group-balanced MCCV. The package could use the group information when we select the best paramaters with cross-validation. In cross-validation (CV) one minimizes a certain measure of error over some parameters that should be determined a priori. Here, we have three parameters: (nc, nx, ny). A popular measure is the prediction error ||Y - \hat{Y}||, where \hat{Y} is a prediction of Y. In our case the O2PLS method is symmetric in X and Y, so we minimize the sum of the prediction errors:
10
10
||X - \hat{X}||+||Y - \hat{Y}||.
11
11
12
-
And we also calculate the the average Q^2 values:
13
-
14
-
Q^2 = 1 - err / Var_{total};
15
-
16
-
err = Var_{expected} - Var_{estimated};
17
12
18
13
Here nc should be a positive integer, and nx and ny should be non-negative. The best integers are then the minimizers of the prediction error.
19
14
@@ -48,38 +43,38 @@ set.seed(123)
48
43
## ncores : parallel paramaters for large datasets
0 commit comments