Bioequivalence and Bioavailability Forum • Power with a Danish twist

ElMaestro
★★★

Denmark,
2009-05-05 21:49
(5887 d 12:57 ago)

Posting: # 3649
Views: 11,388

Power with a Danish twist [Power / Sample Size]

Dear all,

I am still wondering about the Danish policy for BE, so I got curious and wanted to see some power curves for 2,2,2-BE studies when 1.0 must be part of the 90% CI. The data I have are based on a brute force metho, i.e. the calculated power asymptotically approaches the true power as the number of iterations/resmaples approach infinity.

So, here an example, using R lingo, where N is the number of sbj in each sequence, Pwr1 is power according to the Danish requirements, and Pwr2 is power with the usual requirements. CV was 28%, R/T was 95%, 120000 resamples used:

N=c(16, 20, 24, 28, 32, 36, 40, 44, 48, 50, 60, 70, 80, 90, 100, 110)
Pwr1=c(0.712, 0.751, 0.754, 0.738, 0.718, 0.696, 0.677, 0.654, 0.637, 0.629, 0.581, 0.535, 0.493, 0.451, 0.415, 0.38)
Pwr0=c(0.772, 0.856, 0.911, 0.946, 0.967, 0.98, 0.989, 0.993, 0.996, 0.997, 0.999, 1, 1, 1, 1, 1)

Example how to read: If we have 16 sbj in each sequence then the power with the standard requirements is approximately 77.2% (Fartssie gives 77.6% on my machine); with the Danish requirements it is approximately 71.2%.

These data are rather shocking to me, provided they are true. The power has a maximum in the early 20'ies at about 75%. So if a company believes their product has a CV of 28% and T/R=95% then there is no way of powering the study to 80%.

So my questions: Are these data correct - could somemone with more specialised software check those values? I think the assumed CV and R/T values are reasonable, agreed? Did anyone ever understand the Danish thinking? Talk to a regulator there perhaps?

Best regards
EM.

*: What's the real power at CV=65%, R/T=95%, 38 sbj (19 in each seq)??

martin
★★

Austria,
2009-05-05 23:32
(5887 d 11:14 ago)

@ ElMaestro
Posting: # 3650
Views: 9,623

population parameter

Post reply

Dear EM!

a larger sample size will result in a narrower confidence interval for the unknown population parameter (rule of thumb: quadruple the sample size will double precision). you choose R/T=95% and not R/T=100% as expected population parameter for your simulations and the narrower confidence intervals (for the expected true ratio of R/T=95%) for larger sample sizes will give you the results observed in your simulation study. just use R/T=100% as population parameter (i.e. assuming perfect BE ;-)

) and differences in power will be smaller.

hope this helps

martin

ElMaestro ★★★ Denmark, 2009-05-05 23:48 (5887 d 10:59 ago) @ martin Posting: # 3651 Views: 9,668	population parameter Post reply
	Dear Martin, I think you answered a question I did not ask. No worries, this has happened before. If we ask ourselves at which T/R the odd Danish requirement has the least impact (as compared to the standard criteria) then I would expect this to be at T/R = 1.0 (the chance of CI's traversing 1.0 highest for stochastic reasons), which I guess is what you also expressed. Best regards, EM.

martin
★★

Austria,
2009-05-06 00:59
(5887 d 09:47 ago)

@ ElMaestro
Posting: # 3652
Views: 9,592

population parameter

Post reply

Dear EM !

what I tried to explain is that your “shocking data” are due to the believe/assumption regarding the true R/T ratio. in the case of R/T=1, power based on the danish requirement increases as sample size increases but still requiring a larger sample size (IMHO) compared to the “usual requirement” for a given power.

A 2nd attempt to answer your question: yes; on the assumption of R/T=0.95 it will be rather difficult to find a sample size for a power of 80% (power does not monotonically increase with sample size) whereas on the assumption of R/T=1 you will find a sample size to show BE with a power of at least 80%.

best regards

martin

d_labes
★★★

Berlin, Germany,
2009-05-08 11:49
(5884 d 22:57 ago)

@ ElMaestro
Posting: # 3659
Views: 9,591

Wrong Question Answer in Danish

Post reply

Dear ElMaestro,

very interesting!

❝ So my questions: Are these data correct - could some one with more

❝ specialised software check those values?

I have not recalculated your values, but I have the very strong feeling they are correct. What your data show: The higher your N the higher the chance of failing the danish BE criterion if the variability is low enough.

This is reasonable to me because with higher N the confidence interval gets tighter (as Martin has already stated) and therefore the 1.0 is not contained in the CI if your point estimate of the BE ratio is distinct enough from 1.0.

❝ Did anyone ever understand the Danish thinking?

No, never. :no:

They have changed the usually accepted BE test from
"The BE ratio (population) is allowed to vary between 0.80 and 1.25"
by act of law to the test
"The ratio (population) must be not distinct from 1.0".

This results in your "strange" power numbers.

Suppose the BE metric varies only to a very low extent (zero as a limit, but then we statisticians become unemployed :-D

), then you will always conclude "No bioequivalence" in the danish thinking, provided your BE ratio is below or above 1.0.

IMHO this is not the correct answer to the Bioequivalence question.

But let me restate: "How unsearchable are Regulator's judgments and how inscrutable Regulator's ways! Amen." (Romans 11:33, 36)

—
Regards,

Detlew

d_labes
★★★

Berlin, Germany,
2009-05-08 17:53
(5884 d 16:54 ago)

@ d_labes
Posting: # 3661
Views: 9,580

Danish numbers

Post reply

Dear ElMaestro,

PS: Meanwhile I have tried a simulation with "The power to knoff".
Here my results using CV=28% T/R=95%:




             empirical   power

runs    n    power      danish

-------------------------------

2000    16    0.7745    0.7395

        20    0.8675    0.7830

        24    0.9120    0.7625

        28    0.9485    0.7520

        32    0.9625    0.7260

        36    0.9835    0.6975

        40    0.9915    0.6655

8000    16    0.7684    0.7323

        20    0.8606    0.7734

        24    0.9174    0.7645

        28    0.9445    0.7446

        32    0.9695    0.7214

        36    0.9836    0.7036

        40    0.9875    0.6743

Not the exactly the same numbers as yours, but the same trend.

BTW: How long did you wait for your 120 000 simulations each?
Are you got boring? :-D

—
Regards,

Detlew

ElMaestro
★★★

Denmark,
2009-05-08 22:19
(5884 d 12:27 ago)

@ d_labes
Posting: # 3667
Views: 9,547

Danish numbers

Post reply

Hi dlabes,

❝ Not the exactly the same numbers as yours, but the same trend.

❝ BTW: How long did you wait for your 120 000 simulations each?

As this a simulation study I think the numbers are VERY similar. Or did I overlook sumfin?
On my machine it takes 4.7 secs to do 120000 simulations. I bet I can get it working at a much faster speed once (if!) I optimise my code, see below.

My machine is a laptop with Celeron M 1.5 GHz (yes, they knew how to make those laptops back then). I used MinGW version 1.4.3 interfaced by DevC++ 4.9.9.2 with optimisations for x86 turned on.

Best regards
EM.

Helmut
★★★

Vienna, Austria,
2009-05-08 15:23
(5884 d 19:23 ago)

@ ElMaestro
Posting: # 3660
Views: 9,598

Power with a Danish twist

Post reply

Dear ElMaestro!

❝ I am still wondering about the Danish policy for BE [...]

You are not alone. :angry:

❝ *: What's the real power at CV=65%, R/T=95%, 38 sbj (19 in each seq)??

Good question. I think we are pushing software to the limits, i.e., running into troubles of getting a rasonable value of the noncentral t-distribution (numeric precision,...).
Fartssie comes up with -0.0278 (!), StudySize 2.01 simply gives up (-, with 20 subjects/sequence gives 0.421%), and my R-code

a       <- 0.05     # alpha

CV      <- 0.65     # intra-subject coefficient of variation

Theta1  <- 0.8      # lower acceptance limit

Theta2  <- 1/Theta1 # upper acceptance limit

Limit   <- 20000    # Upper Limit for Search

Ratio   <- 0.95

SigmaW  <- sqrt(log(1+CV^2))

s       <- sqrt(2)*SigmaW

for (Aimed in c(0.00001,0.0001,0.0005,0.001,0.005,0.01,0.5,0.7,0.8,0.9))

  {

  n     <- 4        # start value of sample size search

  repeat{

    df    <- n-2

    t1    <- qt(1-a,df)

    t2    <- -t1

    nc1   <- sqrt(n)*((log(Ratio)-log(Theta1))/s)

    nc2   <- sqrt(n)*((log(Ratio)-log(Theta2))/s)

    prob1 <- pt(t1,df,nc1)

    prob2 <- pt(t2,df,nc2)

    power <- prob2-prob1

    n     <- n+2

    if(power >= Aimed | (n-2) >= Limit) break

  }

Total   <- n-2

if(Total == Limit){

  cat("Aimed",Aimed*100,"%, CV",CV*100,"%, Stopped at Limit",Limit," Power",power*100,"%\n")

  } else

  cat("Aimed",Aimed*100,"%, CV",CV*100,"%, Sample Size",Total," Power",power*100,"%\n")

}

gets stuck at 40 subjects (power 0.454%)...

The nasty point in the Danish requirement are formulations with low CVs.
The current guideline states

"The clinical and analytical standards imposed may also influence the statistically determined number of subjects. However, generally the minimum number of subjects should be not smaller than 12 unless justified."

whereas the BE-draft comes up with

"The minimum number of subjects in a cross-over study should be 12."

Have you ever seen a BE study which was performed in less than 12 subjects including a justification of the sample size like: "low variability, small sample size in order to meet Danish requirements, "?
What about the draft? If we read "should be" = "has to be" it will be quite nasty.

—
Dif-tor heh smusma 🖖🏼 Довге життя Україна!
Helmut Schütz

The quality of responses received is directly proportional to the quality of the question asked. 🚮
Science Quotes

d_labes
★★★

Berlin, Germany,
2009-05-08 18:06
(5884 d 16:40 ago)

@ Helmut
Posting: # 3662
Views: 9,839

Power with a Danish twist

Post reply

Dear Helmut, dear ElMaestro!

❝ ❝ *: What's the real power at CV=65%, R/T=95%, 38 sbj (19 in each seq)??

❝

❝ Good question. I think we are pushing software to the limits, i.e.,

❝ running into troubles of getting a rasonable value of the noncentral

❝ t-distribution (numeric precision,...).

❝ Fartssie comes up with -0.0278 (!), StudySize 2.01 simply gives up (-),

❝ and my R-code [...]

❝ gets stuck at 40 subjects (power 0.454%)...

My Extreme computing SASophylistic gives
Power = 4.50940122 %
As you know precise up to the last digit :-P

.

Simulated with 8000 resamplings I come up with 4.6% (normal BE test).

—
Regards,

Detlew

Helmut ★★★ Vienna, Austria, 2009-05-08 18:15 (5884 d 16:32 ago) @ d_labes Posting: # 3663 Views: 9,602	Power with a Danish twist Post reply
	Dear D. Labes! ❝ My Extrem computing SASophylistic gives ❝ Power = 4.50940122 % ❝ As you know precise up to the last digit . Oh wow, what a beasty number cruncher you have! — Dif-tor heh smusma 🖖🏼 Довге життя Україна! Helmut Schütz The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes

ElMaestro
★★★

Denmark,
2009-05-08 21:40
(5884 d 13:07 ago)

(edited on 2009-05-09 10:13)
@ Helmut
Posting: # 3666
Views: 9,505

Power with a Danish twist

Post reply

Hi HS,

❝ Good question. I think we are pushing software to the limits, i.e.,

❝ running into troubles of getting a rasonable value of the noncentral

❝ t-distribution (numeric precision,...).

❝ Fartssie comes up with -0.0278 (!), StudySize 2.01 simply gives up (-,

❝ with 20 subjects/sequence gives 0.421%), and (blah blah blah)

I actualy meant this as a trick question. As you correctly point out this is, even with advanced and trusted software, not an exact science. While mathematicians may say that a problem (such as power in a BE study) has this and that exact solution and be pointing at some ridiculously complex integrals, many of such problems are solved numerically today. Integration is a typical example, it requires some Al Gore Rhythms which are heavily parameterised. Robustness is all about finding the set of parameters that give a consistent answer, but we can sometimes find condititions where the algorithm fails or miscalculates. The figures above illustrate it. Thus we might ask further: Under which conditions will Software X give me an answer that is 5% wrong? That is one helluva difficult question to answer when exact solutions are not available to compare with.

EM.

Helmut
★★★

Vienna, Austria,
2009-05-08 18:56
(5884 d 15:51 ago)

@ ElMaestro
Posting: # 3664
Views: 9,522

Power with a Danish twist

Post reply

Dear ElMaestro!

❝ N=c(16, 20, 24, 28, 32, 36, 40, 44, 48, 50, 60, 70, 80, 90, 100, 110)

❝ Pwr1=c(0.712, 0.751, 0.754, 0.738, 0.718, 0.696, 0.677, 0.654, 0.637,

❝ 0.629, 0.581, 0.535, 0.493, 0.451, 0.415, 0.38)

❝ Pwr0=c(0.772, 0.856, 0.911, 0.946, 0.967, 0.98, 0.989, 0.993, 0.996,

❝ 0.997, 0.999, 1, 1, 1, 1, 1)

❝

❝ Example how to read: If we have 16 sbj in each sequence then the power

❝ with the standard requirements is approximately 77.2%

❝ (Fartssie gives 77.6%

❝ on my machine); with the Danish requirements it is approximately 71.2%.

I get 77.62276% (N=n₁+n₂=32) in [image]

. FARTSIE on my machine: 77.6228% and StudySize 77.607% (120000 Monte Carlo Simulations: 77.81% after 9 seconds on a double Xeon 2.8GHz machine).
How did you set limits for the Danish requirements?

—
Dif-tor heh smusma 🖖🏼 Довге життя Україна!
Helmut Schütz

The quality of responses received is directly proportional to the quality of the question asked. 🚮
Science Quotes

ElMaestro
★★★

Denmark,
2009-05-08 21:17
(5884 d 13:29 ago)

(edited on 2009-05-09 10:51)
@ Helmut
Posting: # 3665
Views: 9,514

Power with a Danish twist

Post reply

Hi HS

❝ I get 77.62276% (N=n₁+n₂=32) in [image] . Fartsie on my machine: 77.6228% and StudySize 77.607% (120000 Monte Carlo Simulations: 77.81% after 9 seconds on a double Xeon 2.8GHz machine).

Good, seems I have good agreement with your software, I am happy to see that.

❝ How did you set limits for the Danish requirements?

I am not sure I understand what you mean when asking how I set the limits for the Danish reqs. I evaluate BE, then I use a further requirement which goes like this: If upper bound < 1.0 then it is a failure, and if lower bound > then also fail.
In C lingo:


if (FailWhen1NotPartOfCI)

         {

             if (exp(LogLo)>1.0) OK=0;

             if (exp(LogHi)<1.0) OK=0;                    

         }

Where OK is just my private own little boolean (actually in C it is an int, but that's another story) to indicate if a dataset if accepted or rejected. But right now I have a feeling this is not really what you meant?

Best regards.
EM

PS: If there are some geekolophystic souls out there to whom it looks like my code has not been optimised for speed (or who would like to tell me log(1)=0) then yes, you are right, there's still work to be done :-D