Helmut ★★★ Vienna, Austria, 2024-08-26 14:52 (23 d 21:36 ago) Posting: # 24160 Views: 794 |
|
To whom it may concern. With ICH M13C on the horizon (initial work is expected to start in December 2024), a publication [1] come to my attention. A great review outlining the current approaches and conditions in various jurisdictions. However, I think that the authors erred in one specific case, which I give below in part. I changed only the numbering of references and added two [5, 9] which are missing in this section of the discussion (pages 14–15 of the PDF).
I became interested in adaptive designs for bioequivalence almost thirty years ago [22, 23]. It is disheartening to observe the lack of advancement and the prevalence of misinterpretations of methods [24].
-scripts:
— Dif-tor heh smusma 🖖🏼 Довге життя Україна! Helmut Schütz The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
Achievwin ★★ US, 2024-08-28 04:16 (22 d 08:12 ago) @ Helmut Posting: # 24166 Views: 492 |
|
I feel your pain, just curious if the two stage design went beyond 2x2 crossover or we are stuck in 1990s time? Any example how two stage adaptive design is applied/validated for parallel study or replicate design? In case of parallel design what kind of statistical factor we need to use Potvin or Bonferroni and same for 4-period or 3-period replicate design? |
Helmut ★★★ Vienna, Austria, 2024-08-28 12:54 (21 d 23:34 ago) @ Achievwin Posting: # 24168 Views: 469 |
|
Hi Achievwin, ❝ I feel your pain… ❝ Any example how two stage adaptive design is applied/validated for parallel study or replicate design? However, if you want something else, you need own simulations to find a suitable adjusted α. Optionally you can also explore a futility criterion for the maximum total sample size. Not complicated in the -package Power2Stage .Hint: In all TSDs the maximum inflation of the Type I Error occurs at a combination of low CV and small n1. Therefore, explore this area first. Once you found a suitable adjusted α, simulate power and the empiric Type I Error for the entire grid. Regulators will ask you for that. For parallel TSDs there are two functions in Power2Stage , namely power.tsd.pAF() and power.tsd.p() :
❝ In case of parallel design what kind of statistical factor we need to use Potvin … Anders used α = 0.294 for the analogues of Potvin’s methods B and C. As usual, a slight inflation of the Type I Error in method C with CV ≤ 20% – which is unlikely in parallel designs anyway. Evaluation by the Welch-Satterthwaite test (for unequal variances and group sizes). If someone knows what might be meant in the ICH M13A’s Section 2.2.3.4 … The use of stratification in the randomisation procedure based on a limited number of known relevant factors is therefore recommended. Those factors are also recommended to be accounted for […] … please enlighten me.❝ … or Bonferroni … I think (‼) that it will be an acceptable alternative because it is the most conservative one (strictly speaking, it is not correct in a TSD because the hypotheses are not independent). Assessors love Signore Bonferroni. ❝ … and same for 4-period or 3-period replicate design? If you mean reference-scaling, no idea. You can try Bonferroni as well. Recently something was published by the FDA but it is wacky (see this post why I think so). I’m not convinced that it is worth the efforts. Plan the study for the assumed CVwR (and the CVwT if you have the information). In reference-scaling the observed CVwR is taken into account anyway. If the variability is higher than assumed, you can scale more and will gain power. If it is lower than assumed, bad luck. However, the crucial point is – as always – the GMR… If you mean by ‘3-period replicate design’ the partial replicate (TRR|RTR|RRT) and want to use the FDA’s RSABE, please don’t (see this article why). It is fine for the EMA’s ABEL. If you want a 3-period replicate for the FDA, please opt for one of the full replicates (TRT|RTR, TTR|RRT, or TRR|RTT). Otherwise, you might be in deep shit.
0.0296 (Type I Error 0.050182 < 0.050360 ). If you are a disciple of Madame Potvin, even 0.0305 would be OK (0.051616 < 0.052 ) . Say, you opted for belt plus suspenders 0.0293 (0.049518 < 0.05 ), planned the first stage with 300 subjects, and observed a CV of 40%. You had some dropouts (15 in one group and 20 in the other). Therefore, instead of n1 = 300, specify n1 = c(135, 130) . What can you expect?
However, in this method you can specify one. Say, you don’t want more than 450 subjects:
Let’s compare now the empiric Type I Errors for both.
A caveat: Actually it is not that simple. In practice you have to repeat this exercise for a range of unequal variances and group sizes in the first stage. It might be that you have to adjust more based on the worst case combination. I did that some time ago. Took me a week, four simultaneous -sessions, CPU-load close to 90%… — Dif-tor heh smusma 🖖🏼 Довге життя Україна! Helmut Schütz The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |