You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
The `.Random.seed` state will be loaded at this exact state, and will always be repeated at this state as well (in case `c` is typed in the debugger, or somehow the error is harder to find while walking through the debug mode). Hence, users must type `Q` to exit the debugger after they have better understood the nature of the error message first-hand, after which the `load_seed` input should be removed.
150
150
151
-
# Converting warings to errors explicitly via `manageWarnings()`
151
+
# Converting warnings to errors explicitly via `manageWarnings()`
Copy file name to clipboardExpand all lines: vignettes/HPC-computing.Rmd
+9-4Lines changed: 9 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -13,8 +13,10 @@ output:
13
13
smooth_scroll: false
14
14
vignette: >
15
15
%\VignetteIndexEntry{HPC cluster array jobs (e.g., via Slurm)}
16
-
%\VignetteEngine{knitr::rmarkdown}
17
16
%\VignetteEncoding{UTF-8}
17
+
%\VignetteEngine{knitr::rmarkdown}
18
+
editor_options:
19
+
chunk_output_type: console
18
20
---
19
21
20
22
```{r nomessages, echo = FALSE}
@@ -121,6 +123,7 @@ While generally effective at distributing the computational load, there are a fe
121
123
5) Submitting independent multiple batches to the cluster makes it more difficult to guarantee the quality of the random numbers
122
124
- Setting the `seed` for each condition ensure that within each `design` condition the random numbers are high quality, however there is no guarantee that repeated use of `set.seed()` will result in high-quality random numbers (see next section for example)
123
125
- Hence, repeated job submissions of this type, even with unique seeds per condition, may not generate high quality numbers if repeated too many times (alternative is to isolate each `design` row and submit each row as a unique job, which is demonstrated near the end of this vignette)
126
+
- Note that there are workarounds with more recent `runSimulation()` implementations when combined with `expandDesign()` (see below), however given the nature of the complete job submission these remain less ideal
124
127
6) Finally, and perhaps most problematic in simulation experiment applications, schedulers frequently cap the maximum number of resources that can be requested (e.g., 256 GB of RAM, 200 CPUs), which limits the application of large RAM and CPU jobs
125
128
- Note that to avoid wasting time by swapping/paging, schedulers will never allocate jobs whose memory requirements exceed the amount of available memory
The above approach assumes that each `design` condition is equally balanced in terms of computing time and resources, though if this is not the case (e.g., the last condition contains notably higher computing times than the first two conditions) then `repeat_conditions` can be specified as a vector instead, such as `repeat_conditions = c(100, 100, 1000)`, which for the latter portion would be associated with a 10 replications per distributed node instead of 100.
Copy file name to clipboardExpand all lines: vignettes/SimDesign-intro.Rmd
+12-1Lines changed: 12 additions & 1 deletion
Original file line number
Diff line number
Diff line change
@@ -214,7 +214,18 @@ As can be seen from the printed results from the `res` object, each result from
214
214
respective condition, meta-statistics have been properly named, and three additional columns have been appended
215
215
to the results: `REPLICATIONS`, which indicates how many time the conditions were performed, `SIM_TIME`, indicating
216
216
the time (in seconds) it took to completely finish the respective conditions, and `SEED`, which indicates the random seeds used by `SimDesign` for each condition (for reproducibility). A call to `View()` in the
217
-
R console may also be a nice way to sift through the `res` object.
217
+
R console may also be a nice way to sift through the `res` object, while for the
218
+
`results` object one can use functions like `descript()` and verbs from the `dplyr` package to get a better understanding of the distributions.
219
+
220
+
```{r}
221
+
# summary statistics for complete results
222
+
descript(results)
223
+
224
+
# conditional summary statistics using dplyr verbs
0 commit comments