You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Here, $\mu_T$ and $\mu_R$ represent the arithmetic mean responses of the test product (the proposed biosimilar) and the reference product, respectively.
56
60
61
+
## Regulatory Requirements
62
+
When evaluating bioequivalence, certain statistical and methodological requirements must be adhered to, as outlined in the European Medicines Agency's bioequivalence guidelines [@CHMP2010]. These requirements ensure that the test and reference products meet predefined criteria for equivalence in terms of pharmacokinetic parameters. The key considerations are summarized below:
63
+
64
+
* Hypothesis testing should be based on the ratio of the population geometric means
65
+
* The 90% confidence interval for the ratio of the test and reference products should be contained within the acceptance interval of 80.00 to 125.00%.
66
+
* A margin of clinical equivalence ($\Delta$) is chosen by defining the largest difference that is clinically acceptable, so that a difference bigger than this would matter in practice.
67
+
* The data should be transformed prior to analysis using a logarithmic transformation and subsequently be analyzed using ANOVA
68
+
69
+
57
70
# Log-Transformation and Parameter Adjustments in sampleSize()
58
71
In [sampleSize()](../reference/sampleSize.html), Ratio of Means (ROM) tests are converted to Difference of Means (DOM) tests by log-transforming the data. Equivalence limits are applied to the log-transformed data, and the results are back-transformed to the original scale for interpretation. This approach leverages the log-normal distribution of pharmacokinetic (PK) measures like AUC and Cmax.
59
72
@@ -70,21 +83,22 @@ To fully operate within the log-normal framework, the variances on the original
It is often required to investigate equivalence for more than one primary variable. [@sozu_sample_2015] For example, EMA recommends showing equivalence both for AUC and Cmax
88
+
89
+
A decision must be made as to whether it is desirable to
90
+
91
+
* Demonstrate equivalence for all primary endpoints: most common setting (also known as **multiple co-primary endpoints**)
92
+
* Demonstrate equivalence for at least one of the primary endpoints (also known as **multiple primary endpoints**)
93
+
74
94
75
-
## Consistency Across Endpoints
76
-
For equivalence to be established, all primary endpoints must simultaneously satisfy the equivalence criteria. This applies whether the criteria are expressed as:
95
+
## Multiplicity
77
96
78
-
* The Difference of Means (DOM) approach measures absolute differences between treatment means.
79
-
* The Ratio of Means (ROM) approach captures relative differences and is commonly used when analyzing log-transformed data, such as in pharmacokinetic studies.
97
+
When a trial aims to evaluate the joint effects across all $m$ co-primary endpoints [@sozu_sample_2015], no multiplicity adjustment is required to control the Type I error rate, as all null hypotheses must be rejected to establish equivalence. However, as the number of endpoints ($K$) increases, the Type II error rate also increases. [@mielke_sample_2018] This leads to the following implications:
80
98
81
-
## Type I Error Control
82
-
Rejection of the null hypothesis ($H_0$) requires that all individual null hypotheses across endpoints be rejected. Since the test is designed to achieve equivalence simultaneously for all endpoints, there is no need for multiplicity adjustments, and the Type I error rate is controlled by the study design.
99
+
* The power to detect equivalence decreases for a fixed sample size.
100
+
* The probability of trial success is reduced as more endpoints are evaluated simultaneously.
83
101
84
-
## Impact on Power
85
-
Requiring equivalence across multiple endpoints reduces the overall power of the test. Specifically:
86
102
87
-
* The Type II error increases as the number of primary endpoints ($K$) grows.
88
-
* This makes equivalence testing more challenging for studies with multiple endpoints, as additional endpoints require larger sample sizes or stronger effect sizes to achieve sufficient power [@mielke_sample_2018].
0 commit comments