You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: vignettes/sampleSize_parallel_2A3E.Rmd
+38-7Lines changed: 38 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -64,12 +64,12 @@ For both endpoints, the analysis assumes that:
64
64
65
65
To evaluate bioequivalence, we apply the $80\%/125\%$ rule, which defines equivalence bounds relative to the reference mean. The evaluation is conducted using a one-sided significance level of 5\%, with a target statistical power of 90\%.
66
66
67
-
# Testing the Difference of Means (DOM) for multiple Co-primary Endpoints
68
-
In biosimilar development, it is important to demonstrate equivalence across all relevant doses, routes of administration, patient populations, and endpoints. To establish equivalence between two treatments, the difference in means for each endpoint, $\mu_{T}^{(j)} - \mu_{R}^{(j)}$, must lie within a predefined equivalence margin around zero for all primary endpoints.
69
67
70
68
## Hypotheses
71
69
The null and alternative hypotheses for the equivalence test are as follows:
72
70
71
+
### Difference of Means (DOM)
72
+
73
73
Null Hypothesis ($H_0$): At least one endpoint does not meet the equivalence criteria:
74
74
75
75
$$H_0: \mu_T^{(j)} - \mu_R^{(j)} \le E_L ~~ \text{or}~~ \mu_T^{(j)} - \mu_R^{(j)} \ge E_U \quad \text{for at least one}\;j$$
@@ -78,14 +78,39 @@ Alternative Hypothesis ($H_1$): All endpoints meet the equivalence criteria:
The null hypothesis ($H_0$) is rejected if, and only if, all null hypotheses associated with the $K$ primary endpoints are rejected at a significance level of $\alpha$. This ensures that equivalence is established across all endpoints simultaneously.
81
+
The null hypothesis ($H_0$) is rejected if, and only if, all null hypotheses associated with the $K$ primary endpoints are rejected at a significance level of $\alpha$. This ensures that equivalence is established across all endpoints simultaneously.
82
+
83
+
### Ratio of Means (ROM)
84
+
The equivalence hypotheses can also be expressed as a Ratio of Means (ROM), which is often used in bioequivalence studies:
85
+
86
+
Null Hypothesis ($H_0$): At least one endpoint does not meet the equivalence criteria:
87
+
88
+
$$H_0: \frac{\mu_T^{(j)}}{\mu_R^{(j)}} \le \log(E_L) ~~ \text{or}~~ \frac{\mu_T^{(j)}}{\mu_R^{(j)}} \ge \log(E_U) \quad \text{for at least one}\;j$$
89
+
90
+
Alternative Hypothesis ($H_1$): All endpoints meet the equivalence criteria:
***Type I Error Control**: Since rejection of $H_0$ requires all individual null hypotheses to be rejected, there is no need for multiplicity adjustments. The Type I error rate is controlled by the design.
86
-
***Impact on Power**: Requiring equivalence across multiple endpoints decreases the overall power of the test. The Type II error increases as the number of primary endpoints ($K$) grows, which can make equivalence testing more challenging [@mielke_sample_2018].
97
+
### Consistency Across Endpoints
98
+
For equivalence to be established, all primary endpoints must simultaneously satisfy the equivalence criteria. This applies whether the criteria are expressed as:
87
99
88
-
## Independent Testing of Pharmacokinetic (PK) Measures
100
+
* The Difference of Means (DOM) approach measures absolute differences between treatment means.
101
+
* The Ratio of Means (ROM) approach captures relative differences and is commonly used when analyzing log-transformed data, such as in pharmacokinetic studies.
102
+
103
+
### Type I Error Control
104
+
Rejection of the null hypothesis ($H_0$) requires that all individual null hypotheses across endpoints be rejected. Since the test is designed to achieve equivalence simultaneously for all endpoints, there is no need for multiplicity adjustments, and the Type I error rate is controlled by the study design.
105
+
106
+
### Impact on Power
107
+
Requiring equivalence across multiple endpoints reduces the overall power of the test. Specifically:
108
+
109
+
* The Type II error increases as the number of primary endpoints ($K$) grows.
110
+
* This makes equivalence testing more challenging for studies with multiple endpoints, as additional endpoints require larger sample sizes or stronger effect sizes to achieve sufficient power [@mielke_sample_2018].
111
+
112
+
113
+
## Independent Testing of PK Measures
89
114
If each pharmacokinetic (PK) measure is tested independently, the following sample sizes would be required for each endpoint to achieve a 5\% significance level:
90
115
91
116
```{r}
@@ -135,8 +160,14 @@ library(SimTOST)
135
160
))
136
161
```
137
162
163
+
If we were to test each PK measure independently, we would find a total sample size of `r sim_AUCinf$response$n_total` for AUCinf, `r sim_AUClast$response$n_total` for AUClast, and `r sim_Cmax$response$n_total` for Cmax. This means that we would have to enroll `r sim_AUCinf$response$n_total` + `r sim_AUClast$response$n_total` + `r sim_Cmax$response$n_total` = `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients in order to reject $H_0$ at a significance level of 5\%. For context, the original trial was a randomized, single-blind, three-arm, parallel-group study conducted in 159 healthy subjects, slightly more than the `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients estimated as necessary. This suggests that the original trial had a small buffer above the calculated sample size requirements.
164
+
165
+
## Simultaneous Testing of PK Measures with Independent Endpoints
166
+
This approach focuses on simultaneous testing of pharmacokinetic (PK) measures while assuming independence between endpoints. Unlike the previous approach, which evaluated each PK measure independently, this method integrates comparisons across multiple endpoints, accounting for correlations (or lack thereof) between them. By doing so, it enables simultaneous testing for equivalence without inflating the overall Type I error rate.
167
+
168
+
In this setting, equivalence is required for at least one endpoint rather than all endpoints, reducing the overall sample size compared to independent testing. Furthermore, this approach allows for greater flexibility by enabling users to specify correlation structures or work with uncorrelated endpoints as a default assumption.
169
+
138
170
139
-
If we were to test each PK measure independently, we would find a total sample size of `r sim_AUCinf$response$n_total` for AUCinf, `r sim_AUClast$response$n_total` for AUClast, and `r sim_Cmax$response$n_total` for Cmax. This means that we would have to enroll `r sim_AUCinf$response$n_total` + `r sim_AUClast$response$n_total` + `r sim_Cmax$response$n_total` = `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total`$ patients in order to reject $H_0$ at a significance level of 5\%.
0 commit comments