|
d_labes ★★★ Berlin, Germany, 2011-08-23 15:01 (5424 d 02:28 ago) Posting: # 7290 Views: 10,079 |
|
|
Dear All! In this posting I had observed a type III sum-of-square peculiarity in the EMA recommended code (Helmut calls it crippled ) for obtaining the intra-subject variability for the Reference formulation in replicate crossover studies.Now I had coded (not remembering the code literally) Proc GLM data=EMAset1;The result was EMA set 1 ![]() Of course I knew that the type I sum-of-squares are order dependent. But df=0 was a great surprise for me. — Regards, Detlew |
|
ElMaestro ★★★ Denmark, 2011-08-23 15:15 (5424 d 02:14 ago) @ d_labes Posting: # 7291 Views: 8,823 |
|
|
Oh you mighty numbercruncher, ❝ model logData = period3 sequence1 subject(sequence)2; How does it look for sequence and typeIII SS if you fit the muddle without subject(sequence)? — Pass or fail! ElMaestro |
|
d_labes ★★★ Berlin, Germany, 2011-08-23 15:58 (5424 d 01:31 ago) @ ElMaestro Posting: # 7293 Views: 8,801 |
|
|
Old Pirate, ❝ How does it look for sequence and typeIII SS if you fit the muddle without subject(sequence)? Very high residual error (CVRef=121.8%) The GLM ProcedureSeems this does'nt make any sense. — Regards, Detlew |
|
ElMaestro ★★★ Denmark, 2011-08-23 17:02 (5424 d 00:27 ago) @ d_labes Posting: # 7295 Views: 8,752 |
|
|
Dear d_labes, ❝ Seems this does'nt make any sense. How extremely odd that is. There certainly is a sequence component which isn't zero. Have you found a bug in SAS? Further, I wonder: The dataset contains, if I am not mistaking: 4 periods 2 sequences 77 subjects (there is no subject called 61) I just wonder about another thing; in your original post that you link to above and this in this thread as well SAS says the model has 78 df's. Apprently (from your linked post) df periods = 2 df seqs =1 df subjs(seq) =75 why df periods =2 and not 3? (note: I just tried to construct the original unreduced model matrix: 4 columns for periods, 2 for seqs, and 77 for subjs. Then qr'ed the matrix in R and got a rank of 81 (had expected 78 if SAS is correct); prolly I made a mistake somewhere )— Pass or fail! ElMaestro |
|
d_labes ★★★ Berlin, Germany, 2011-08-23 17:28 (5424 d 00:01 ago) @ ElMaestro Posting: # 7296 Views: 8,778 |
|
|
Dear ElMaestro, ❝ I just wonder about another thing; in your original post that you link to above and this in this thread as well SAS says the model has 78 df's. ❝ Apprently (from your linked post) ❝ df periods = 2 ❝ df seqs =1 ❝ df subjs(seq) =75 ❝ why df periods =2 and not 3? Good question. Next question .Seems there is some inter-dependence between periods and sequences. Subjects with sequence TRTR have only periods 2 and 4, those with sequence RTRT have periods 1 and 3 (Note: we are dealing only with the data of the Reference). Maybe this is the source of this odd degree of freedom? BTW: If you have the data in R could you lm() them to see what's the result? — Regards, Detlew |
|
Helmut ★★★ ![]() Vienna, Austria, 2011-08-23 17:36 (5423 d 23:52 ago) @ d_labes Posting: # 7297 Views: 8,814 |
|
|
Dear both of you! Funny stuff! I’m a little bit too busy to be of help by now. ❝ BTW: If you have the data in R could you lm() them to see what's the result? @ElMaestro: If you don’t want to extract the data from EMA’s PDF, here is an Excel-sheet for download. — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
|
ElMaestro ★★★ Denmark, 2011-08-23 18:26 (5423 d 23:02 ago) (edited on 2011-08-23 20:54) @ d_labes Posting: # 7298 Views: 8,797 |
|
|
Dear d_labes, ❝ Seems there is some inter-dependence between periods and sequences. ❝ Subjects with sequence TRTR have only periods 2 and 4, those with sequence RTRT have periods 1 and 3 (Note: we are dealing only with the data of the Reference). ❝ Maybe this is the source of this odd degree of freedom? Important edit: I apologise - overlooked that we are only dealing with the ref. I will let my initial response remain below. I am not sure this is the explanation, here's my logic: If we for any observation for any subject within the i'th sequence have info abut which three periods the obserbation does not come from thenwe can infer which period it comes from. Thus df(per)=4-1=3. Same for sequence, df(seq)=2-1. For the subject; if we know which of the subjects it isn't then we can infer which subject (in seqs) it comes from (that's minus 1) and we can even infer which if it is the subject from seq 1 or seq 2 (minus 1 again). df(subj)=77-2=75. 75+3+1=79 (according to Excel, this is not on an early-age pentium machine ). ❝ BTW: If you have the data in R could you lm() them to see what's the result? Will do, sir! Please allow me a little time for that. — Pass or fail! ElMaestro |
|
ElMaestro ★★★ Denmark, 2011-08-23 18:58 (5423 d 22:31 ago) @ d_labes Posting: # 7299 Views: 8,763 |
|
|
This post is extensively edited as I did initially not catch that we are only dealing with observations of the Reference formulation. Dear d_labes and HS, Took the file provided by HS and and rn it through R as follows:
What it really, really strange is this: M=model.matrix(Muddle)Nützt nüchts. Ich werde's nie verstehen. Ist auch scheissegal. — Pass or fail! ElMaestro |
|
Helmut ★★★ ![]() Vienna, Austria, 2011-08-24 02:44 (5423 d 14:45 ago) @ ElMaestro Posting: # 7300 Views: 8,765 |
|
|
Dear comrades-in-arms! ❝ Nützt nüchts. Ich werde's nie verstehen. Ist auch scheissegal. Yep. Let me throw in Phoenix6.2 in the spirit of EMA’s Q&A p25: ‘The analysis presented above show that this approach (Method A) is feasible even for unbalanced replicate design studies. The advantage of this approach is that it is straightforward and that it appears to be software and software option independent.’ (my emphases)Sequential TestsPartial Tests‘Sequential Tests’ in Phoenix/WinNonlin = SAS I Quote from the manual about ‘Partial Tests’: The partial tests in LinMix are not equivalent to the Type III method in SAS though they coincide in most situations. Aha! — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
|
d_labes ★★★ Berlin, Germany, 2011-08-24 13:24 (5423 d 04:05 ago) @ Helmut Posting: # 7302 Views: 8,805 |
|
|
Dear brothers-in-arms! What does this all tell us? What lessons learned? ![]() First I hope that the statisticians at EMA responsible for that paragraph of the Q&A read this thread. It must be a splashy slap in their face considering Helmut's quotation of "the spirit of EMA’s Q&A". Second I come more and more to the conclusion that a decomposition of the subject effect into the terms sequence and subject nested within sequence is not really possible with those crippled datasets (reasons not really understood, but I'm only a bloody amateur and have no such deep understanding of concepts like rank of the model.matrix). The robust model without any peculiarities should therefore include period and subject effects only. Do I err here? Third the intra-individual variability (represented by the residual error) is fortunately not affected by all that muddle. And this is the only interesting number in this analysis. The rest is really "scheissegal" (best vulgar translation found: don't give a flying fuck ).Also I personally have a deep horror to see df=0 or 'not estimable' in a model. — Regards, Detlew |
|
Helmut ★★★ ![]() Vienna, Austria, 2011-08-24 15:51 (5423 d 01:38 ago) @ d_labes Posting: # 7304 Views: 8,752 |
|
|
Dear control freaks! ❝ Second I come more and more to the conclusion that a decomposition of the subject effect into the terms sequence and subject nested within sequence is not really possible with those crippled datasets […]. The robust model without any peculiarities should therefore include period and subject effects only. ❝ Do I err here? If you want to get a second amateur’s opinion: exactly. That’s why I called the datasets (after removal of T!) ‘crippled’. All information about sequences (RTRT ^ TRTR ¬ RR!) is lost. Martin (not an amateur) once told me that you can do only statistics if you understand the data generating process. In all replicate datasets I have seen until now, EMA’s method gives a smaller CVWR than the full model (= FDA’s or EMA’s ‘Method C’). Politics = narrower AR? If we throw away sequences, it doesn’t make any sense to include them in the model (not to speak about nested subjects either). BTW, more than 20 years after Freeman’s paper1, Senn’s book2, and the empiric study by D’Angelo et al.3 EMA has realised (see IR GL) that sequence effects are of no importance in a properly designed cross-over study. But: Would anybody evaluate a 2×2 cross-over by Y=treatment+period+subject only - though the CI stays the same? Agree with your third point. Quotes on a sad occasion:
— Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
|
ElMaestro ★★★ Denmark, 2011-08-28 23:34 (5418 d 17:54 ago) @ d_labes Posting: # 7307 Views: 8,504 |
|
|
Hi all, quick note: There is a sequence effect to be fit, no doubt about that. The fact that we boil TRTR and RTRT down to RR is not the direct culprit here as treatment is not considered. But as was said by Helmut there is an interaction or nesting of period in sequence. So just as is the case with subejcts, whenever period is in the Muddle, any type III anova goes bonkers. Compare the two sequential ones: Muddle=lm(log(b$Data)~factor(b$Period) + factor(b$Sequence))Remember earlier discussions about three-treatment designs where EMA occasionally does not like that the residual is polluted by an irrelevant treatment A when comparison of treatment B and C is done. In those cases the dataset is pruned down to a "pseudo" two-treatment, two-sequence, two-period dataset and analysed the usual way. We could do the same to periods here and get rid of the ugly 'issue', I think. I don't think this will change the residual SS (which is what is important), however, but I haven't done it or thought it fully through. — Pass or fail! ElMaestro |
|
d_labes ★★★ Berlin, Germany, 2011-08-24 12:47 (5423 d 04:42 ago) @ ElMaestro Posting: # 7301 Views: 8,770 |
|
|
Dear ElMaestro! ❝ Muddle=lm(log(b$Data)~factor(b$Period)+factor(b$Sequence)+factor(b$Subject)) ❝ anova(Muddle) ❝ Analysis of Variance Table ❝ ❝ ❝ factor(b$Period) 3 1.570 0.52326 2.6253 0.05703 . ❝ factor(b$Subject) 75 118.662 1.58216 7.9380 2.454e-16 *** ❝ Residuals 71 14.151 0.19931 factor(b$sequence) vanished in nirvana Clever or bug? In terms of the EMA code literally taken but treated in R you can observe a similar "cleverle" but now with type III (I have renamed Data to Cmax and done a factor transformation before calling lm()): muddle2 <- lm(log(Cmax)~ sequence + subject%in%sequence + period, data=refdata)Note the period DF's. Also note the reversed order of effects. ❝ Nützt nüchts. Ich werde's nie verstehen. Ist auch scheissegal. — Regards, Detlew |
|
ElMaestro ★★★ Denmark, 2011-08-24 15:04 (5423 d 02:25 ago) @ d_labes Posting: # 7303 Views: 8,658 |
|
|
Dear d_labes ❝ ❝ Clever or bug? ❝ ❝ drop1 does not work like in SAS when it comes to sequence. When R drops Sequence to get the deltaSS for this factor it does not automatically recognise that Sequence is already in the model because it is contained in Subject. SAS does that (or at least: should do it, but might obviously be failing here). Your example with first the sequential muddle with period last and then a drop1 is truly brilliant. Here I would expect exactly the same SS for period between the two anovas. It just isn't the case. Where's hrotter and jdetlor? There must be an explanation. — Pass or fail! ElMaestro |
|
Helmut ★★★ ![]() Vienna, Austria, 2011-08-24 16:01 (5423 d 01:28 ago) @ ElMaestro Posting: # 7305 Views: 8,733 |
|
|
Comrade! ❝ Where's hrotter and jdetlor? Wouldn’t be too optimistic about Hermann Rotter (last login 2010-04-08), maybe J. Detlor (last login 2011-07-29) is willing to step into the muddle. — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
|
jdetlor ☆ 2011-11-01 04:05 (5354 d 12:24 ago) @ Helmut Posting: # 7568 Views: 7,743 |
|
|
❝ Comrade! ❝ ❝ ❝ Where's hrotter and jdetlor? Dear All! It's been a while since I've logged on because I have left the pharmaceutical/bioequivalence consulting industry. I have gone back to finance. I'm now using various statistical methods to build decision trees to evaluate customer risk. Anyways, if I read the problem correctly, I believe the issue lies within the modification to your design matrix (X-matrix). As a full replicate design, the periods and sequences are independent with respect to treatment. When you delete a specific treatment from the data, the columns for sequence and period are not linearly independent anymore, and thus when PROC GLM tries to assign degrees of freedom based on independent columns, it fails to assign some to the sequence effect. Hope this helps with the issue, and I will be updating my profile with a working email address. Thanks J. Detlor Edit: Forgot to mention — in SAS you can see this problem if you use PROC GENMOD with the modified data set (same model statement though) and request the design matrix through the ODS output. |
|
ElMaestro ★★★ Denmark, 2011-11-02 10:48 (5353 d 05:41 ago) @ jdetlor Posting: # 7580 Views: 17,297 |
|
|
Hi jdetlor, ❝ It's been a while since I've logged on because I have left the pharmaceutical/bioequivalence consulting industry. I have gone back to finance. I am sure they need your competent help in the finance sector. Best regards, EM. |
) for obtaining the intra-subject variability for the Reference formulation in replicate crossover studies.
)
![[image]](https://static.bebac.at/pics/Blue_and_yellow_ribbon_UA.png)
![[image]](https://static.bebac.at/img/CC by.png)
Quotes on a sad occasion: