You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: vignettes/sampleSize_parallel_2A3E.Rmd
+18-18Lines changed: 18 additions & 18 deletions
Original file line number
Diff line number
Diff line change
@@ -24,9 +24,9 @@ doc.cache <- T #for cran; change to F
24
24
25
25
# Introduction
26
26
27
-
In many studies, it is necessary to evaluate equivalence across multiple primary variables. For instance, the European Medicines Agency (EMA) recommends demonstrating equivalence for both **Area Under the Curve** (AUC) and **maximum concentration** (Cmax) when assessing pharmacokinetic properties. This vignette presents advanced techniques for calculating sample size in parallel trial designs involving three treatment arms and two endpoints.
27
+
In many studies, the aim is to evaluate equivalence across multiple primary endpoints. The European Medicines Agency (EMA) recommends demonstrating bioequivalence for both **Area Under the Curve** (AUC) and **maximum concentration** (Cmax) when assessing pharmacokinetic properties. This vignette presents advanced techniques for calculating sample size in parallel trial designs involving three treatment arms and two endpoints.
28
28
29
-
As an illustrative example, we consider published data from the phase-1 trial [NCT01922336](https://clinicaltrials.gov/study/NCT01922336#study-overview). This trial assessed the pharmacokinetics of SB2 compared to its EU-sourced reference product (EU_Remicade). The following outcomes were reported following a single dose of SB2 or its EU reference product [@shin_randomized_2015]:
29
+
As an illustrative example, we consider published data from the phase-1 trial [NCT01922336](https://clinicaltrials.gov/study/NCT01922336#study-overview). This trial measured the pharmacokinetics (PK) of SB2 compared to its EU-sourced reference product (EU_Remicade). The following PK measures were reported following a single dose of SB2 or its EU reference product Remicade[@shin_randomized_2015]:
30
30
31
31
```{r, echo=FALSE}
32
32
data <- data.frame("PK measure" = c("AUCinf ($\\mu$g*h/mL)","AUClast ($\\mu$g*h/mL)","Cmax ($\\mu$g/mL)"),
In the sections below, we examine various strategies for determining the sample size required for a parallel trial to establish equivalence across three co-primary endpoints. These strategies are based on the Ratio of Means (ROM) approach, using equivalence bounds set between 80% and 125%. Additionally, we demonstrate how the test can be adjusted to establish equivalence for at least $k=1$ primary endpoint, providing flexibility in meeting equivalence criteria.
42
+
In the sections below, we describe various strategies for determining the sample size required for a parallel trial to establish equivalence across three co-primary endpoints. These strategies are based on the Ratio of Means (ROM) approach with equivalence bounds set between 80% and 125%. Additionally, we show how this approach can be adjusted to demonstrate equivalence for at least $k=1$ of $m=3$ co-primary endpoints, providing flexibility in meeting equivalence criteria.
43
43
44
44
# Independent Testing of Co-Primary Endpoints
45
-
A conservative approach to sample size calculation involves testing each pharmacokinetic (PK) measure independently. This method assumes that the endpoints are uncorrelated and that equivalence must be demonstrated for each endpoint separately. Consequently, the overall sample size required for the trial is the sum of the sample sizes for each PK measure.
45
+
A conservative approach to sample size calculation involves testing each pharmacokinetic (PK) measure independently. This approach assumes that endpoints are uncorrelated and that equivalence is to be demonstrated for each endpoint separately. Consequently, the overall sample size required for the trial is the sum of the sample sizes calculated for each PK measure separately.
46
46
47
47
```{r}
48
48
library(SimTOST)
@@ -91,18 +91,18 @@ library(SimTOST)
91
91
))
92
92
```
93
93
94
-
If we were to test each PK measure independently, we would find a total sample size of`r sim_AUCinf$response$n_total` for AUCinf, `r sim_AUClast$response$n_total` for AUClast, and `r sim_Cmax$response$n_total` for Cmax. This means that we would have to enroll `r sim_AUCinf$response$n_total` + `r sim_AUClast$response$n_total` + `r sim_Cmax$response$n_total` = `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients in order to reject $H_0$ at a significance level of 5\%. For context, the original trial was a randomized, single-blind, three-arm, parallel-group study conducted in 159 healthy subjects, slightly more than the `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients estimated as necessary. This suggests that the original trial had a small buffer above the calculated sample size requirements.
94
+
When testing each PK measure independently, the total sample size is`r sim_AUCinf$response$n_total` for AUCinf, `r sim_AUClast$response$n_total` for AUClast, and `r sim_Cmax$response$n_total` for Cmax. This means that we would have to enroll `r sim_AUCinf$response$n_total` + `r sim_AUClast$response$n_total` + `r sim_Cmax$response$n_total` = `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients in order to reject $H_0$ at a significance level of 5\%. For context, the original trial was a randomized, single-blind, three-arm, parallel-group study conducted in 159 healthy subjects, slightly more than the `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients estimated to be necessary. This suggests that the original trial had a small buffer above the calculated sample size requirements.
95
95
96
96
# Simultaneous Testing of Independent Co-Primary Endpoints
97
-
This approach focuses on simultaneous testing of pharmacokinetic (PK) measures while assuming independence between endpoints. Unlike the previous approach, which evaluated each PK measure independently, this method integrates comparisons across multiple endpoints, accounting for correlations (or lack thereof) between them. By doing so, it enables simultaneous testing for equivalence without inflating the overall Type I error rate.
97
+
This approach focuses on simultaneous testing of PK measures while assuming independence between endpoints. Unlike the previous approach, which tested each PK measure independently, this method integrates comparisons across multiple endpoints, accounting for correlations (or lack thereof) between them, thus enabling simultaneous testing for equivalence without inflating the overall Type I error rate.
98
98
99
99
## Key Assumptions
100
100
In the calculations below, the following assumptions are made:
101
101
102
102
* Hypothesis Testing Approach: Ratio of Means (ROM)
103
-
* Design: A parallel trial design
103
+
* Design: Parallel trial design
104
104
* Distribution: PK measures follow a log-normal distribution.
105
-
* Standard Deviation: All treatments share a common standard deviation for each endpoint
105
+
* Standard Deviation: All treatments share a common standard deviation for each endpoint
106
106
* Multiplicity: No multiplicity adjustments are applied.
107
107
* Equivalence Criterion: Equivalence is required for all $k=m=3$ endpoints.
108
108
* Independence: All endpoints are assumed to be uncorrelated, specified by setting the correlation parameter to $\rho=0$.
@@ -123,7 +123,7 @@ sigma_list <- list(
123
123
```
124
124
125
125
## Equivalence Boundaries
126
-
Since we are comparing multiple co-primary endpoints, it is essential to define the lower and upper equivalence boundaries for each endpoint. These boundaries determine the acceptable range for the Ratio of Means (ROM) within which equivalence is established.
126
+
It is essential to define the lower and upper equivalence boundaries for each co-primary endpoint. These boundaries determine the acceptable range for the Ratio of Means (ROM) within which equivalence is established.
127
127
128
128
For simplicity, the same equivalence boundaries are applied to all endpoints:
By default, it is required that all $k=m$ co-primary endpoints have to be equivalent:
137
+
By default, all $k=m$ co-primary endpoints are required to be equivalent.
138
138
139
139
## Computing Sample Size
140
140
@@ -155,15 +155,15 @@ By default, it is required that all $k=m$ co-primary endpoints have to be equiva
155
155
seed = 1234))
156
156
```
157
157
158
-
We can inspect more detailed sample size requirements as follows:
158
+
We can inspect the sample size requirements in more detail as follows:
159
159
160
160
```{r}
161
161
N_ss$response
162
162
```
163
163
164
164
# Simultaneous Testing of Correlated Co-Primary Endpoints
165
165
166
-
Incorporating the correlation among endpoints into power and sample size calculations for co-primary continuous endpoints offers significant advantages.[@sozu_sample_2015] Without accounting for correlation, adding more endpoints typically reduces the power. However, by including positive correlations in the calculations, power can be increased, and required sample sizes may be reduced.
166
+
Incorporating the correlations between endpoints in sample size calculations for continuous-valued co-primary endpoints offers significant advantages [@sozu_sample_2015]. Adding more endpoints typically reduces power if such correlations are not accounted for. However, by including positive correlations in the calculations, power can be increased, and the required sample sizes may consequently be reduced.
167
167
168
168
For this analysis, we proceed with the same values used previously but now assume that a correlation exists between endpoints. Specifically, we set $\rho = 0.6$, assuming a common correlation across all endpoints.
169
169
@@ -187,10 +187,10 @@ If correlations differ between endpoints, they can be specified individually usi
187
187
seed = 1234))
188
188
```
189
189
190
-
Referring to the output above, the required sample size for this setting is `r N_mult_corr$response$n_total`. This is `r N_ss$response$n_total - N_mult_corr$response$n_total` fewer patients than the scenario where the endpoints are assumed to be uncorrelated.
190
+
The required total sample size for this example is `r N_mult_corr$response$n_total`. This is `r N_ss$response$n_total - N_mult_corr$response$n_total` fewer patients than the scenario in which endpoints are assumed to be uncorrelated.
191
191
192
-
# Simultaneous Testing of Correlated Primary Endpoints
193
-
Consider now we are interested in demonstrating equivalence for at least $k=1$ of the $m=3$ primary endpoints. Unlike the previous cases, where equivalence was required for all endpoints, this setting necessitates an adjustment for multiplicity to control the family-wise error rate.
192
+
# Simultaneous Testing of Correlated Co-Primary Endpoints
193
+
Imagine that we are interested in demonstrating equivalence for at least $k=1$ of the $m=3$ co-primary endpoints. Unlike the previous cases, in which equivalence was required for all endpoints, this scenario requires an adjustment for multiplicity (Bonferroni correction) to control the family-wise error rate.
194
194
195
195
```{r}
196
196
(N_mp_bon <- sampleSize(
@@ -213,7 +213,7 @@ Consider now we are interested in demonstrating equivalence for at least $k=1$ o
213
213
seed = 1234 # Random seed for reproducibility
214
214
))
215
215
```
216
-
As mentioned in [the Introduction](../articles/intopkg.html), the Bonferroni correction is often overly conservative, especially in scenarios with correlated tests. A less restrictive alternative is the *k*-adjustment, which specifically accounts for the number of tests and the number of endpoints required for equivalence.
216
+
As mentioned in [the Introduction](../articles/intopkg.html), Bonferroni adjustment is often overly conservative, especially in scenarios with correlated tests. A less restrictive alternative is the *k*-adjustment, which specifically accounts for the number of tests and the number of endpoints required for equivalence.
217
217
218
218
```{r}
219
219
(N_mp_k <- sampleSize(
@@ -237,9 +237,9 @@ As mentioned in [the Introduction](../articles/intopkg.html), the Bonferroni cor
237
237
))
238
238
```
239
239
240
-
A more advanced testing approach involves Sequential Adjustment, where the significance level is adjusted separately for groups of endpoints (primary and secondary). A Bonferroni adjustment is applied to the primary endpoints based on their number. If the null hypothesis for the primary endpoints is rejected, testing proceeds sequentially to the secondary endpoints, which are also Bonferroni-adjusted based on the number of secondary endpoints.
240
+
A more advanced testing approach involves sequential adjustment, in which the significance level is adjusted separately for groups of endpoints (i.e., primary and secondary). A Bonferroni adjustment is first applied to the group of primary endpoints, based on the number of primary endpoints. If the null hypothesis for this group is rejected, testing proceeds sequentially to the group secondary endpoints, which are also Bonferroni-adjusted based on the number of secondary endpoints.
241
241
242
-
In this example, the sequential adjustment can be implemented by setting the function argument to `adjust = "seq"` and specifying the type of each endpoint through the `type_y` parameter. For instance, we define "AUCinf" as a primary endpoint, while "AUClast" and "Cmax" are designated as secondary endpoints.
242
+
In the below example, sequential adjustment can be implemented by setting the argument `adjust = "seq"` and for each endpoint specifying its type using the `type_y` parameter. For instance, we define "AUCinf" as a primary endpoint, while "AUClast" and "Cmax" are designated as secondary endpoints.
0 commit comments