You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: vignettes/intopkg.Rmd
+71Lines changed: 71 additions & 0 deletions
Original file line number
Diff line number
Diff line change
@@ -17,3 +17,74 @@ knitr::opts_chunk$set(
17
17
```{r setup}
18
18
library(SimTOST)
19
19
```
20
+
21
+
22
+
Methodology and Assumptions
23
+
24
+
25
+
# Hypotheses
26
+
The null and alternative hypotheses for the equivalence test are as follows:
27
+
28
+
## Difference of Means (DOM)
29
+
A common approach to assessing bioequivalence involves comparing the log-transformed pharmacokinetic (PK) measures between the test and reference products. This is done using the following interval (null) hypothesis:
30
+
31
+
Null Hypothesis ($H_0$): At least one endpoint does not meet the equivalence criteria:
32
+
33
+
$$H_0: m_T^{(j)} - m_R^{(j)} \le \delta_L ~~ \text{or}~~ m_T^{(j)} - m_R^{(j)} \ge \delta_U \quad \text{for at least one}\;j$$
34
+
35
+
Alternative Hypothesis ($H_1$): All endpoints meet the equivalence criteria:
Here, $m_T$ and $m_R$ represent the logarithmically transformed mean responses of the test product (the proposed biosimilar) and the reference product, respectively. The equivalence limits, $\delta_L$ and $\delta_u$, are typically chosen to be symmetric, such that $\delta = - \delta_L = \delta_U$. The FDA further recommends that the equivalence acceptance criterion (EAC) be defined as $\delta = EAC = 1.5 \sigma_R$, where $\sigma_R$ represents the variability of the log-transformed endpoint for the reference product.
40
+
41
+
The null hypothesis ($H_0$) is rejected if, and only if, all null hypotheses associated with the $K$ primary endpoints are rejected at a significance level of $\alpha$. This ensures that equivalence is established across all endpoints simultaneously.
42
+
43
+
44
+
## Ratio of Means (ROM)
45
+
The equivalence hypotheses can also be expressed as a Ratio of Means (ROM):
46
+
47
+
Null Hypothesis ($H_0$): At least one endpoint does not meet the equivalence criteria:
48
+
49
+
$$H_0: \frac{\mu_T^{(j)}}{\mu_R^{(j)}} \le E_L ~~ \text{or}~~ \frac{\mu_T^{(j)}}{\mu_R^{(j)}} \ge E_U \quad \text{for at least one}\;j$$
50
+
51
+
Alternative Hypothesis ($H_1$): All endpoints meet the equivalence criteria:
Here, $\mu_T$ and $\mu_R$ represent the arithmetic mean responses of the test product (the proposed biosimilar) and the reference product, respectively.
56
+
57
+
# Log-Transformation and Parameter Adjustments in sampleSize()
58
+
In [sampleSize()](../reference/sampleSize.html), Ratio of Means (ROM) tests are converted to Difference of Means (DOM) tests by log-transforming the data. Equivalence limits are applied to the log-transformed data, and the results are back-transformed to the original scale for interpretation. This approach leverages the log-normal distribution of pharmacokinetic (PK) measures like AUC and Cmax.
59
+
60
+
## Logarithmic Mean
61
+
The logarithmic mean is derived from the provided `mu_list` (arithmetic means) and `sigma_list` (variances) using the following formula:
This formula adjusts the arithmetic mean to account for the skewness of log-normal data, ensuring that the central tendency on the log scale aligns with the transformed data.
65
+
66
+
## Logarithmic Variance Transformation
67
+
To fully operate within the log-normal framework, the variances on the original scale (`sigma_list`) must also be transformed. The variance on the log scale is calculated using the normalized variance formula:
For equivalence to be established, all primary endpoints must simultaneously satisfy the equivalence criteria. This applies whether the criteria are expressed as:
77
+
78
+
* The Difference of Means (DOM) approach measures absolute differences between treatment means.
79
+
* The Ratio of Means (ROM) approach captures relative differences and is commonly used when analyzing log-transformed data, such as in pharmacokinetic studies.
80
+
81
+
## Type I Error Control
82
+
Rejection of the null hypothesis ($H_0$) requires that all individual null hypotheses across endpoints be rejected. Since the test is designed to achieve equivalence simultaneously for all endpoints, there is no need for multiplicity adjustments, and the Type I error rate is controlled by the study design.
83
+
84
+
## Impact on Power
85
+
Requiring equivalence across multiple endpoints reduces the overall power of the test. Specifically:
86
+
87
+
* The Type II error increases as the number of primary endpoints ($K$) grows.
88
+
* This makes equivalence testing more challenging for studies with multiple endpoints, as additional endpoints require larger sample sizes or stronger effect sizes to achieve sufficient power [@mielke_sample_2018].
In the sections below, we explore various strategies for determining the sample size required for a parallel trial to demonstrate equivalence across the three co-primary endpoints. These strategies are based on the Ratio of Means (ROM) approach, with equivalence bounds set between 80\%and 125\%.
54
54
55
-
The bioequivalence analysis focuses on two key pharmacokinetic endpoints:
56
-
57
-
* AUCinf: Area Under the Curve (infinity)
58
-
* Cmax: Maximum concentration
59
-
60
-
For both endpoints, the analysis assumes that:
61
-
62
-
* Summary data (e.g., mean and standard deviation) are available on the original scale.
63
-
* These data are provided for each treatment arm.
64
-
65
-
To evaluate bioequivalence, we apply the $80\%/125\%$ rule, which defines equivalence bounds relative to the reference mean. The evaluation is conducted using a one-sided significance level of 5\%, with a target statistical power of 90\%.
66
-
67
-
68
-
## Hypotheses
69
-
The null and alternative hypotheses for the equivalence test are as follows:
70
-
71
-
### Difference of Means (DOM)
72
-
73
-
Null Hypothesis ($H_0$): At least one endpoint does not meet the equivalence criteria:
74
-
75
-
$$H_0: \mu_T^{(j)} - \mu_R^{(j)} \le E_L ~~ \text{or}~~ \mu_T^{(j)} - \mu_R^{(j)} \ge E_U \quad \text{for at least one}\;j$$
76
-
77
-
Alternative Hypothesis ($H_1$): All endpoints meet the equivalence criteria:
The null hypothesis ($H_0$) is rejected if, and only if, all null hypotheses associated with the $K$ primary endpoints are rejected at a significance level of $\alpha$. This ensures that equivalence is established across all endpoints simultaneously.
82
-
83
-
### Ratio of Means (ROM)
84
-
The equivalence hypotheses can also be expressed as a Ratio of Means (ROM), which is often used in bioequivalence studies:
85
-
86
-
Null Hypothesis ($H_0$): At least one endpoint does not meet the equivalence criteria:
87
-
88
-
$$H_0: \frac{\mu_T^{(j)}}{\mu_R^{(j)}} \le \log(E_L) ~~ \text{or}~~ \frac{\mu_T^{(j)}}{\mu_R^{(j)}} \ge \log(E_U) \quad \text{for at least one}\;j$$
89
-
90
-
Alternative Hypothesis ($H_1$): All endpoints meet the equivalence criteria:
For equivalence to be established, all primary endpoints must simultaneously satisfy the equivalence criteria. This applies whether the criteria are expressed as:
99
-
100
-
* The Difference of Means (DOM) approach measures absolute differences between treatment means.
101
-
* The Ratio of Means (ROM) approach captures relative differences and is commonly used when analyzing log-transformed data, such as in pharmacokinetic studies.
102
-
103
-
### Type I Error Control
104
-
Rejection of the null hypothesis ($H_0$) requires that all individual null hypotheses across endpoints be rejected. Since the test is designed to achieve equivalence simultaneously for all endpoints, there is no need for multiplicity adjustments, and the Type I error rate is controlled by the study design.
105
-
106
-
### Impact on Power
107
-
Requiring equivalence across multiple endpoints reduces the overall power of the test. Specifically:
108
-
109
-
* The Type II error increases as the number of primary endpoints ($K$) grows.
110
-
* This makes equivalence testing more challenging for studies with multiple endpoints, as additional endpoints require larger sample sizes or stronger effect sizes to achieve sufficient power [@mielke_sample_2018].
111
-
112
-
113
-
## Independent Testing of PK Measures
114
-
If each pharmacokinetic (PK) measure is tested independently, the following sample sizes would be required for each endpoint to achieve a 5\% significance level:
55
+
# Independent Testing of PK Measures
56
+
A conservative approach to sample size calculation involves testing each pharmacokinetic (PK) measure independently. This method assumes that the endpoints are uncorrelated and that equivalence must be demonstrated for each endpoint separately. Consequently, the overall sample size required for the trial is the sum of the sample sizes for each PK measure.
115
57
116
58
```{r}
117
59
library(SimTOST)
@@ -162,9 +104,12 @@ library(SimTOST)
162
104
163
105
If we were to test each PK measure independently, we would find a total sample size of `r sim_AUCinf$response$n_total` for AUCinf, `r sim_AUClast$response$n_total` for AUClast, and `r sim_Cmax$response$n_total` for Cmax. This means that we would have to enroll `r sim_AUCinf$response$n_total` + `r sim_AUClast$response$n_total` + `r sim_Cmax$response$n_total` = `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients in order to reject $H_0$ at a significance level of 5\%. For context, the original trial was a randomized, single-blind, three-arm, parallel-group study conducted in 159 healthy subjects, slightly more than the `r sim_AUCinf$response$n_total + sim_AUClast$response$n_total + sim_Cmax$response$n_total` patients estimated as necessary. This suggests that the original trial had a small buffer above the calculated sample size requirements.
164
106
165
-
##Simultaneous Testing of PK Measures with Independent Endpoints
107
+
# Simultaneous Testing of PK Measures with Independent Endpoints
166
108
This approach focuses on simultaneous testing of pharmacokinetic (PK) measures while assuming independence between endpoints. Unlike the previous approach, which evaluated each PK measure independently, this method integrates comparisons across multiple endpoints, accounting for correlations (or lack thereof) between them. By doing so, it enables simultaneous testing for equivalence without inflating the overall Type I error rate.
167
109
110
+
111
+
112
+
168
113
In this setting, equivalence is required for at least one endpoint rather than all endpoints, reducing the overall sample size compared to independent testing. Furthermore, this approach allows for greater flexibility by enabling users to specify correlation structures or work with uncorrelated endpoints as a default assumption.
0 commit comments