Bioequivalence and Bioavailability Forum

Dr_Dan
★★

Germany,
2011-09-01 13:27
(5400 d 15:53 ago)

Posting: # 7315
Views: 10,436

Adverse events [General Statistics]

Dear all
an assessor realized that the number of AEs differ between the treatment groups and he/she concluded that the tolerability of one product is worse than the other.
I would reply that bioavailability in terms of peak and total exposure is taken as a surrogate parameter for efficacy and safety and I would point out that bioequivalence studies are not designed to investigate the safety profile in terms of AEs of one formulation in comparison to another. Due to the small numbers of subjects and small number of AEs any differences become significant but statistically not relevant.
Is this correct or do you have other suggestions?
Looking forward to your replies.
Kind regards
Dan

—
Kind regards and have a nice day
Dr_Dan

Helmut
★★★

Vienna, Austria,
2011-09-01 14:18
(5400 d 15:02 ago)

@ Dr_Dan
Posting: # 7316
Views: 8,793

Significant differences in AEs

Post reply

Dear Dan!

❝ an assessor realized that the number of AEs differ between the treatment groups and he/she concluded that the tolerability of one product is worse than the other.

Oh no. I got the same question once – many, many years ago.

❝ I would reply that bioavailability in terms of peak and total exposure is taken as a surrogate parameter for efficacy and safety and I would point out that bioequivalence studies are not designed to investigate the safety profile in terms of AEs of one formulation in comparison to another.

Correct.

❝ Due to the small numbers of subjects and small number of AEs any differences become significant but statistically not relevant.

❝ Is this correct or do you have other suggestions?

First I would not accept the idea to simply compare the number of AEs. IMHO only a comparison of ADRs (relationship to IMP rated certainly or probably/likely) makes sense. Or would you include headache, which is known to occur in up to 30 % of subjects due to the setting of the study (less sleep, no breakfast until four hours, no coffee, :blahblah:

). Furthermore I would not pool ADRs (apples with oranges), but evaluate them separately. If done that way I would expect differences between treatments to diminish. Only if you still get significant differences, I would continue with your argument.

—
Dif-tor heh smusma 🖖🏼 Довге життя Україна!
Helmut Schütz

The quality of responses received is directly proportional to the quality of the question asked. 🚮
Science Quotes

Dr_Dan
★★

Germany,
2011-09-01 17:44
(5400 d 11:36 ago)

@ Helmut
Posting: # 7318
Views: 8,659

Significant differences in AEs

Post reply

Dear Helmut
In the study in question seven subjects who received Treatment A (Test Product) reported 15 treatment-emergent adverse events (TEAEs) and 8 TEAEs were reported by eight subjects who received Treatment B (Reference Product) leading to the misinterpretation that the test product causes almost twice number of adverse reactions in comparison with the reference product. A second study with the same formulations but with a higher strength shows the opposite picture in terms of TEAEs: 12 TEAEs were reported by ten of the 36 subjects who received Treatment A (Test Product) and 20 TEAEs reported by thirteen of the 34 subjects who received Treatment B (Reference Product)

study 1: A: 7 subjects with in total 15 TEAE (6 unrelated)
B: 8 subjects with in total 8 TEAE (2 unrelated)
Of the 24 TEAEs reported, the relationship of 1 was judged as “probable”, 14 as “possible”, and 9 as "unrelated". One TEAE that was experienced at an onset time unknown cannot be assigned with certainty to a treatment.

Treatment  A  B

general    1  -

Nerv       3  2

Musc       2  -

Inv        3  3

GI         -  1

You say you would not pool ADRs (apples with oranges), but evaluate them separately. If done that way you would expect differences between treatments to diminish.
Sorry, I do not understand. If you evaluate seperately then the differences become even bigger, right? see table above
Please advise
Kind regards
Dan

—
Kind regards and have a nice day
Dr_Dan

ElMaestro
★★★

Denmark,
2011-09-01 15:40
(5400 d 13:41 ago)

@ Dr_Dan
Posting: # 7317
Views: 8,690

Significant differences in AEs

Post reply

Dear Dr_Dan

Sorry to hear this.
If the CIs show BE and the assessor can raise a PSRtPH on basis of count data, then the BE guideline apparently needs to be re-written as the traditional metrics do not assure BE for drug products like yours.
Less clear of course if your CI goes from 1.18-1.24 etc, when the assessor can definitely argue the exposure is higher. But if the CI spans over 1.0 this should -at least in theory- be a piece of cake.

—
Pass or fail!
ElMaestro

ElMaestro
★★★

Denmark,
2011-09-01 20:12
(5400 d 09:09 ago)

@ Dr_Dan
Posting: # 7321
Views: 8,593

Adverse events

Post reply

Dear Dr_Dan,

the table you gave is a nice but sparse contingency table.
When traditional chi-square is calculated the resulting p-value is often imprecise if the table is sparse (low numbers and/or zeros being frequent) which is arguably the case for your data.
For this reason these tables are typically evaluated with resampling.

With R:

DrDansTable= as.table(rbind(c(1,3,2,3,0), c(0,2,0,3,1)))

chisq.test(DrDansTable, simulate.p.value = TRUE, B = 100000)



        Pearson's Chi-squared test with simulated p-value (based on 1e+05

        replicates)



data:  DrDansTable 

X-squared = 3.75, df = NA, p-value = 0.6612

If you do the table with ordinary calculation of chi square then you end up at p=0.44.

Thus:

"The applicant acknowledges the concern raised by the assessor and has undertaken a further analysis of the occurrence of adverse events for the two treatments in trial XV63-GFR-Q-61 as follows.
The p-value for the contigency table (insert it around here) as evaluated by resampling using 100.000 random replicates is approx. 0.66; for a significant difference to be present the p-values should be 0.05. The assessor's concern is therefore not supported by the numbers presented in the applicant's study report.
Evaluation of the table with ordinary chi-square gives a p-value of 0.44 which likewise is very much above the significance limit. Because the table is sparse, the applicant contends that preference should be given to the evaluation with resampling. It will always converge towards the true p-value with increasing sampling number.
Regardless of the evaluation method, the result does not in any way suggest the presence of a difference, which is in perfect accordance with the outcome of the trial in terms of the primary parameters Cmax and AUCt. Should the two types of outcomes differ in their interpretations, the primary parameters should prevail anyway, as the trial was dimensioned to compare these rather than the secondary parameters.
All in all, the primary trial outcomes as well as further analysis of the AEs observed in the study do not support the notion of the test product having a safety profile that differs from that of the reference product."

Someone should check my calculations, though, as I frequently err.

—
Pass or fail!
ElMaestro

Helmut
★★★

Vienna, Austria,
2011-09-01 20:45
(5400 d 08:35 ago)

@ ElMaestro
Posting: # 7322
Views: 8,653

Bravo!

Post reply

Dear ElMaestro,

great post and especially the jargon in your suggested reply! An alternative statistic would be Fisher’s exact test (again in R):

fisher.test(DrDansTable)



        Fisher's Exact Test for Count Data



data:  DrDansTable 

p-value = 0.7403

alternative hypothesis: two.sided

—
Dif-tor heh smusma 🖖🏼 Довге життя Україна!
Helmut Schütz

The quality of responses received is directly proportional to the quality of the question asked. 🚮
Science Quotes

Dr_Dan ★★ Germany, 2011-09-02 11:00 (5399 d 18:20 ago) @ Helmut Posting: # 7323 Views: 8,504	Bravo! Post reply
	Dear ElMaestro I can only agree with Helmut. Great post! Thank you very much to both of you. Kind regards Dan — Kind regards and have a nice day Dr_Dan

d_labes
★★★

Berlin, Germany,
2011-09-02 11:28
(5399 d 17:52 ago)

@ Helmut
Posting: # 7324
Views: 8,511

Bravo?

Post reply

Dear ElMaestro, dear Helmut!

Cough ... :smoke:

The statistics you do here is at least very approximative.

Consider that you evaluate data from a crossover study that are not independent, i.e. the AE counts under Test or Reference may come from the same subject(s).

Keywords (evaluation of incidence of one sort of AEs, f.i. SOC General):
McNemar test if you neglect period effects,
Mainland-Gart or Prescott's test considering period effects.

See for this
Stephen Senn
"Cross-over Trials in Clinical Research"
Chapter 4
Wiley, Chichester 2002

See this online paper for a SAS solution and/or for a description how these tests work.

A test of the whole AE profile from a cross-over I'm not aware of, sorry.

@EM: Full ACK with Helmut's rating of the jargon. Brilliant as always from you :cool:

—
Regards,

Detlew

ElMaestro
★★★

Denmark,
2011-09-02 20:48
(5399 d 08:32 ago)

@ d_labes
Posting: # 7329
Views: 8,527

Bravo?

Post reply

Dear d_labes,

❝ The statistics you do here is at least very approximative.

yes, just like mandatory parametric evaluation of CI's even when residuals are non-normal. But we never do a parametric test when we don't know the distribution of residuals do we? :-D

To be honest, I had no thought of your point at all when I wrote the post above. I guess one can argue that, there's probably both intra-row and intra-column correlation (same subject experiencing AEs in two or more categories for the same treatment, as well as same subject experiencing AE with test and ref respectively. There's always a whining subject with a ton of AEs, especially if the trial is in a Western country).

To HS: I had no idea Fisher's exact test could be done on tables larger than 2x2... This great forum is an eternal source of knowledge. Thank you for improving the statistic for Dr_Dan as well as extending my horizon.

—
Pass or fail!
ElMaestro

martin
★★

Austria,
2011-09-02 13:02
(5399 d 16:18 ago)

@ Dr_Dan
Posting: # 7325
Views: 8,634

Adverse events

Post reply

Dear Dr_Dan!

This is an interesting question but IMHO this is a non-inferiority hypothesis, i.e. the sponsor would like to show that the occurrence of AEs with the test treatment is non-inferior to the reference treatment. However, I think a consensus on a-priori definition of relevance threshold is difficult or hopeless. According to my opinion it’s very similar to statistical problems occurring in non-clinical toxicological studies. Hothorn (2010) suggests estimation of confidence intervals and their post-hoc interpretation in terms of tolerable thresholds when no a-priori definition of relevance thresholds is available.

What do you think by providing a confidence interval plot? The y-axis depicts the different type of AEs (e.g. according to MedDRA classification) and the x-axis shows the difference in proportion between test and reference and the corresponding confidence interval. A similar plot - for statistical analysis of organ weights - can be found in Hothorn (2010, slide 30) for illustration of this idea.

Please let me know what you think about this approach!

Best regards

Martin

Hothorn LA (2010). Statistical analysis of short-term studies in regulatory toxicology using R. Webinar organized by the American Statistician Association, Biopharmaceutical Section on Thursday, May 20.
http://www.biopharmnet.com/doc/doc03002-07.html or http://www.biopharmnet.com/doc/2010_05_20_webinar.pdf

PS.: when you would like to use the McNemar test you can calculate the odds ratio and the corresponding confidence interval instead of differences in proportions using the R function mcnemar.exact of R package exact2x2.

Adverse events [General Sta­tis­tics]

Significant differences in AEs

Significant differences in AEs

Significant differences in AEs

Adverse events

Bravo!

Bravo!

Bravo?

Bravo?

Adverse events

Adverse events [General Statistics]