d_labes ★★★ Berlin, Germany, 2013-03-15 16:56 (4417 d 09:38 ago) Posting: # 10202 Views: 31,070 |
|
Dear All! We had already from time to time the question which combination of CVwR and GMR apply to find alpha ≤ 0.05 from power calculations. No one was really sure. Except the statement that power calculations at the conventional BE limits 1.25 (or 0.8) as for the ABE test will not be appropriate. A rather natural choice IMHO would be to calculate the power at the border(s) of the widened BE acceptance limits. Thus I employed power.scABEL() and power.RSABE() from R-package PowerTOST. Here the results for each method on its own: GMR=1.25 if CV<=0.3 or GMR=exp(0.8925742*CV2se(CV)) if CV>0.3 for the FDA RSABE GMR=1.25 if CV<=0.3 or GMR=exp(0.7601283*CV2se(CV)) if CV>0.3 for the EMA scABEL and cap on if CV>0.5 CV GMR pBE_ABEL GMR2 pBE_RSABE ![]() Interesting! At least for me ![]() Both methods start for low CV's with pBE ~ 0.05. Classical ABE evaluation controls alpha≤0.05 ![]() Approaching CV=0.3 the probability of (falsely?) accepting BE raises, for the FDA method up to 0.116. This is understandable since a increasing part of our simulated studies will have sample CV's >0.3 which are then evaluated with widened limits or downscaled linear scaled ABE criterion. After/at CV=0.3 pBE for the FDA method drops sharply to or below 0.05 due to the known discontinuity in the widened BE limits and becomes with rising CV lower and lower (dominated by the point estimator criterion? conservative?). For the EMA method the pBE stays above 0.05 up to CV=0.45. The cap at CV≥0.5 then prevents the decrease for higher CV's seen in the FDA method. pBE first rises slightly with increasing CV. This behaviour (power increases with increasing CV) we have already seen here. But then a decrease is seen (due to the point estimator criterion?). If we really estimating here the type I error then both methods are cases of an alpha inflation!? The FDA method in the vicinity of CV≤0.3 and the EMA method also but additionally up to CV=0.45. And the 'alpha inflation' is much more pronounced as that what the EMA lead to question Potvin method C. — Regards, Detlew |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-15 18:27 (4417 d 08:07 ago) @ d_labes Posting: # 10203 Views: 24,563 |
|
Dear Detlew! ❝ A rather natural choice IMHO would be to calculate the power at the border(s) of the widened BE acceptance limits. Agree. ❝ GMR=1.25 if CV<=0.3 or GMR=exp(0.8925742*CV2se(CV)) if CV>0.3 for the FDA RSABE ❝ GMR=1.25 if CV<=0.3 or GMR=exp(0.7601283*CV2se(CV)) if CV>0.3 for the EMA scABEL and cap on if CV>0.5 <nitpicking> I would prefer a GMR of 0.80 since PowerTOST employs simulations – remember the “rounding thread”? Which designs / sample sizes have you used? I’m struggling reproducing your numbers. ❝ Interesting! At least for me For me as well. I will ask the two Lászlós and maybe Panos Macheras. — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
d_labes ★★★ Berlin, Germany, 2013-03-16 21:10 (4416 d 05:24 ago) @ Helmut Posting: # 10208 Views: 24,219 |
|
Dear Helmut! ❝ <nitpicking> I would prefer a GMR of 0.80 since PowerTOST employs simulations – remember the “rounding thread”? No rounding at all employed. Even 0.760... not. ❝ Which designs / sample sizes have you used? I’m struggling reproducing your numbers. No explicit design written down in the code. That means default -> "2x3x3". n=48 if I remember correctly. — Regards, Detlew |
d_labes ★★★ Berlin, Germany, 2013-03-18 09:11 (4414 d 17:23 ago) (edited on 2013-03-18 09:52) @ d_labes Posting: # 10216 Views: 24,074 |
|
Dear Helmut! ❝ No explicit design written down in the code. That means default -> "2x3x3". n=48 if I remember correctly. Must correct me: N=24 in the calculations above. nsims=1E6. N=48 comes here: CV GMR pBE_ABEL GMR2 pBE_RSABE ![]() — Regards, Detlew |
martin ★★ Austria, 2013-03-25 21:47 (4407 d 04:47 ago) @ d_labes Posting: # 10279 Views: 24,001 |
|
dear all! this behavior may be due to an adaptive choice of a statistical method in face of the data without any "adjustment" for this adaptation. this reminds me on the "flowchart" approach many CROs are using which should be avoided. best regards martin PS.: no idea at the moment how to account for this. PPS.: why flowcharts should not be used link. |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-26 16:21 (4406 d 10:13 ago) @ martin Posting: # 10286 Views: 24,500 |
|
Dear all! ❝ this behavior may be due to an adaptive choice of a statistical method in face of the data without any "adjustment" for this adaptation. Yep. The acceptance range is data-driven (random variable dependent on CVWR). ❝ PS.: no idea at the moment how to account for this. Adjust α? Quick shot: library(PowerTOST) ![]() ![]() Take my code with a grain of salt. Cases where no adjustment is necessary agree with Detlew’s results. Note: Any result <0.05035995 is not significantly >0.05 in 106 simulations. However, if you want to get a more conservative estimate replace
Edit: Much faster code in this post. — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
martin ★★ Austria, 2013-03-26 17:19 (4406 d 09:15 ago) @ Helmut Posting: # 10287 Views: 23,952 |
|
dear helmut! iteratively adjusted alpha – super idea! you may find of interest that two weirdos published a paper about iterative alpha adjustment a few years ago (paper) soley based on simulation results (not enough skilled to provide evidence via mathematical proof). best regards martin |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-26 19:02 (4406 d 07:32 ago) @ martin Posting: # 10289 Views: 24,031 |
|
Hi Martin! ❝ iteratively adjusted alpha – super idea! If the inflation of the consumer’s risk is real (based on our assumption to estimate pBE with a GMR at the borders of the scaled acceptance range) regulators should revise their methods. Would increase the sample size and IMHO requires a multi-step procedure.
❝ […] two weirdos published a paper […] Not their only crime. ![]() — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
martin ★★ Austria, 2013-03-26 20:44 (4406 d 05:50 ago) @ Helmut Posting: # 10290 Views: 23,889 |
|
dear Helmut! totally agree: If the inflation of the type-I-error is real = serious risk to public health (to quote a famous colleague of mine). best regards martin PS.: IMHO - a-priori specification of the method should control the type-I-error (i.e. CV from historical data or more stringent the corresponding lower confidence limit should be greater than 30% to allow scaling). |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-26 22:18 (4406 d 04:16 ago) @ martin Posting: # 10291 Views: 24,027 |
|
Hi Martin! ❝ totally agree: If the inflation of the type-I-error is real = serious risk to public health (to quote a famous colleague of mine). Is this the same guy who is (in)famous for publicly calling some approaches “bullshit”? BTW, α-inflation is not unknown. See the figures and table 1 of the two Lászlós.1 ❝ PS.: IMHO - a-priori specification of the method should control the type-I-error (i.e. CV from historical data or more stringent the corresponding lower confidence limit should be greater than 30% to allow scaling). I would also say so. Already noted in the past: Besides, it is not clear at present what kind of regulatory policy will be followed when several σWR estimates (i.e. results of previous submissions) are available to a drug regulatory agency. If regulators use all available data, then they can get an improved estimate for σWR, and possibly can draw a different conclusion from the sponsor.2 But likely it is wishful thinking to expect a list of acceptance ranges or CVs of reference formulations.
Edit: EMA’s data set I (full replicate, n1 36, n2 37, CVWR 0.4696) needs no α-adjustment, pBE 0.012061. If the two outliers are excluded (n1 34, n2 37, CVWR 0.3216) we require an adjusted α of 0.03149 in order to maintain the consumer’s risk. The 93.70% CIs of 106.03–126.16% (Method A) and 106.10–126.24% (Method B) lie still within the scaled acceptance range of 78.79–126.93%, but it is a close shave… — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-08-11 18:19 (4268 d 09:15 ago) @ Helmut Posting: # 11262 Views: 22,836 |
|
Hi all! slow code to estimate sample sizes for adjusted α. library(PowerTOST) ![]() I set the expected ratio to 0.9 (more realistic than 0.95 for HVDs/HVDPs). Observations: Sample size penality due to αadj is higher in the fully replicated design. Generally no more penalty for CVWR > ~40%. Bonus question: αadj depends on CVWR observed in the study, which regulators probably don’t like. In a deficiency letter on Potvin’s method C The Netherland’s MEB stated: “Adapting the confidence intervals based upon power is not acceptable and also not in accordance with the EMA guideline. Confidence intervals should be selected a priori, without evaluation of the power.” (my emphases)Let’s say in study planning we expect a CVWR of 40% (T/R 0.9, 90% power). In a fully replicate design we would estimate the sample size with 42 – but α will be inflated to 0.05555. So we adjust α to 0.04377 (91.25% CI), which would need 44 subjects. In the study CVWR turned out to be 35%, which would call for a more conservative αadj of 0.03812 (92.38% CI). If we would use the αadj from study planning, we would face an inflation to 0.05681. Maybe it is acceptable to regulators to apply a lower α (wider CI) than in study planning, but what if it is the other way round? If it is not acceptable to use a higher α (narrower CI) we would loose power. — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
apapanas ☆ Greece, 2024-10-31 07:02 (169 d 19:32 ago) @ Helmut Posting: # 24251 Views: 2,959 |
|
Noticed that a letter is missing below, which would not let me run the code ![]() ❝ ❝ ❝ ❝ |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2024-10-31 09:41 (169 d 16:53 ago) @ apapanas Posting: # 24252 Views: 2,980 |
|
Hi Antigoni, ❝ Noticed that a letter is missing below, which would not let me run the code Oops! I corrected it. However, much easier with the function sampleN.scABEL.ad() , which was implemented in version 1.3-3 (2016-01-15) of PowerTOST .
To get only the sample size use
![]() For examples and more elaborated — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
apapanas ☆ Greece, 2024-11-08 11:48 (161 d 14:46 ago) @ Helmut Posting: # 24264 Views: 2,099 |
|
Many thanks dear Helmut! |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2024-11-08 15:34 (161 d 11:00 ago) @ apapanas Posting: # 24265 Views: 2,052 |
|
Hi Antigoni, ❝ Many thanks dear Helmut! See also the function sample.size.adj() in this article. It implements our method1,2 and a more conservative one.3,4Of course, such a conservatism comes with a price. The sample sizes can be prohibitively large.
— Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-27 16:38 (4405 d 09:56 ago) @ martin Posting: # 10297 Views: 23,972 |
|
Hi Martin, a technical question (see the note at the end of my previous post). Though any result <0.05035995 is not statistically significant >0.05 shouldn’t I set the target ( sig ) in the algo to 0.05?— Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
martin ★★ Austria, 2013-03-27 17:36 (4405 d 08:58 ago) @ Helmut Posting: # 10298 Views: 23,885 |
|
Hi Helmut! IMHO: yes as this is a constant and should not dependent on the number of simulation runs. however, differences should be rather small by using 1E6 runs. best regards martin Ps.: ... and yes its the same guy who is famous for publicly calling some approaches “bullshit” ![]() |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-29 00:20 (4404 d 02:14 ago) @ martin Posting: # 10305 Views: 24,148 |
|
Hi Martin! ❝ IMHO: yes as this is a constant and should not dependent on the number of simulation runs. Oh yes, sure. Sorry for my stupidity. ❝ Ps.: ... and yes its the same guy who is famous for publicly calling some approaches “bullshit” I always thought he should work on his social skills. Here the results of my sim’s for target α 0.05. Partial and full replicate designs, n=24/48. ![]() ![]() The FDA’s method would lead to a large α-inflation (as presented by Detlew above) for CVWR ≤30%. This could be corrected by adjusting α. However, since scaling is only allowed for CVWR >30% nothing has to be done. Like the conventional TOST the test gets more conservative with increasing CV (and n). ![]() ![]() The EMA’s method is another cup of tea. α-inflation is seen up to 45%! The test is closer to the nominal level than the FDA’s. Interesting the behavior of the partial replicate with n=24. ![]() Code (10–25 iterations to reach convergence, runtime on my machine 15–45 seconds depending on design, n, CV): require(PowerTOST) — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
martin ★★ Austria, 2013-03-28 16:08 (4404 d 10:26 ago) @ Helmut Posting: # 10304 Views: 23,760 |
|
Hi Helmut & simulants ! What do you think about a kind of adaptive group-sequential design with one interim analysis (i.e. estimate CV) allowing to make a decision regarding average BE or scaled BE in the final analysis? This may solve the problem. best regards martin |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-29 00:34 (4404 d 02:00 ago) @ martin Posting: # 10306 Views: 24,051 |
|
Hi Martin! ❝ What do you think about a kind of adaptive group-sequential design with one interim analysis (i.e. estimate CV) allowing to make a decision regarding average BE or scaled BE in the final analysis? This may solve the problem. Nope. Actually the methods are (hidden) sequential. Most clear in the FDA’s guidance: Determine sWR, the within-subject standard deviation (SD) of the reference product, for the pharmacokinetic (PK) parameters AUC and Cmax.
— Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
martin ★★ Austria, 2013-03-29 16:07 (4403 d 10:27 ago) @ Helmut Posting: # 10311 Views: 23,735 |
|
Hi Helmut ! Yep; your are totally right (should have done my homework first ![]() Drawback: BUT what if the CV at interim is smaller than 0.3 ... ![]() best regards martin |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-03-29 17:10 (4403 d 09:24 ago) @ martin Posting: # 10312 Views: 24,023 |
|
Hi Martin! ❝ what about defining the extent of scaling after interim analysis of the first stage data (CV estimated in 1st stage) within a group-sequential adaptive design? Theoretically OK. ![]() ❝ Drawback: BUT what if the CV at interim is smaller than 0.3 ... … or it is larger than 0.3 in stage 1 and in the pooled data set (n↑ = more precise estimate) you find one which is ≤0.3? If the inflation is real (still not sure) we have a problem – only with EMA’s method – in the CV-range 0.3–0.5. IMHO there are two options:
— Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |