Oiinkie ☆ The Netherlands, 2013-01-16 17:42 (4484 d 18:27 ago) Posting: # 9869 Views: 16,444 |
|
Dear all, As already indicated here in the past by Dr. Dan, when analyzing imbalanced datasets (in my case N=23, n1(RT)=12, n2(TR)=11, 2x2x2) with bear v2.5.3 the output of ANOVA_stat.txt and Statistical_summaries.txt do not allign. Different PE's and CI's are presented, while it should be the same in both files. Thus, now it's the question which output is the correct one? In my data set 23 subjects have completed the study (subject 24 is a dropout who only has data on T). When I actually include the data available of subject 24, bear crashes after NCA (which actually is logical)... If you need more information or the actual data set (N=23) I am analyzing, please let me know. Many thanks in advance! Best regards, Oiinkie — Regards, Oiinkie |
yjlee168 ★★★ ![]() ![]() Kaohsiung, Taiwan, 2013-01-16 18:25 (4484 d 17:43 ago) (edited on 2013-01-16 19:22) @ Oiinkie Posting: # 9871 Views: 15,126 |
|
Dear Oiinkie, Yes, please provide your data set if possible. Also if you have validated results with SAS or SPSS (as your post here) vs. bear with the same imbalanced data set, we would like to see that too. It will be helpful. Thank you so much. ❝ ... ❝ If you need more information or the actual data set (N=23) I am analyzing, please let me know. ❝ ... — All the best, -- Yung-jin Lee bear v2.9.2:- created by Hsin-ya Lee & Yung-jin Lee Kaohsiung, Taiwan https://www.pkpd168.com/bear Download link (updated) -> here |
Oiinkie ☆ The Netherlands, 2013-01-16 20:18 (4484 d 15:50 ago) @ yjlee168 Posting: # 9875 Views: 15,208 |
|
Dear Yung-jin, Thanks for your reply! ❝ Yes, please provide your data set if possible. Sure, please find it below (csv). This is the full data set including subject 24 (the dropout who only completed period 1, T) which causes bear to crash after NCA. The imbalanced data set of concern is without this dropout, so N=23.
subject,sequence,period,time,conc Continued in next post... — Regards, Oiinkie |
Oiinkie ☆ The Netherlands, 2013-01-16 20:19 (4484 d 15:50 ago) @ yjlee168 Posting: # 9876 Views: 15,183 |
|
1,1,1,0.00,0 ❝ Also if you have validated results with SAS or SPSS (as your post here) vs. bear with the same imbalanced data set, we would like to see that too. Unfortunately, I currently do not have access to SAS or SPSS (dead laptop...) so I was not able to validate results. This is actually the reason why I asked which output file of bear "tells the truth": ANOVA_stat.txt or Statistical_summaries.txt? Would you already be able to answer this question? ![]() Many thanks! Oiinkie — Regards, Oiinkie |
yjlee168 ★★★ ![]() ![]() Kaohsiung, Taiwan, 2013-01-16 20:43 (4484 d 15:25 ago) @ Oiinkie Posting: # 9879 Views: 15,038 |
|
Dear Oiinkie, Thank you for your data set. At the end of 2010, I tried to fix the inconsistency between two outputs. Then I faced the same problem as you did: which one should be correct? Thus why I did not release bear v2.5.4 to public. The data set I used to test bear was obtained from full data set by deleting one subject's data. I was not sure if I was doing the right thing at that moment. I thought that I should confirm the final results with SAS. Don't quite remember everything right now. Anyway, I will test bear again with your data set, as well as using SAS. Sorry about this. ❝ ... ❝ Unfortunately, I currently do not have access to SAS or SPSS (dead laptop...) so I was not able to validate results. This is actually the reason why I asked which output file of bear "tells the truth": ANOVA_stat.txt or Statistical_summaries.txt? Would you already be able to answer this question? ❝ ... — All the best, -- Yung-jin Lee bear v2.9.2:- created by Hsin-ya Lee & Yung-jin Lee Kaohsiung, Taiwan https://www.pkpd168.com/bear Download link (updated) -> here |
ElMaestro ★★★ Denmark, 2013-01-16 23:10 (4484 d 12:58 ago) @ yjlee168 Posting: # 9880 Views: 15,016 |
|
Dear yjlee, ❝ Thank you for your data set. At the end of 2010, I tried to fix the inconsistency between two outputs. Then I faced the same problem as you did: which one should be correct? You can easily check the result by fitting an lm without intertcept, e.g. with Formulation specified as the first term. The difference in 'LSMeans' for lack of better wording on the log scale will be directly extractable from the coefficients vector's first and second values. — Pass or fail! ElMaestro |
Oiinkie ☆ The Netherlands, 2013-01-22 14:28 (4478 d 21:41 ago) @ yjlee168 Posting: # 9897 Views: 14,571 |
|
Dear Yung-jin (and other forum members), ❝ I will test bear again with your data set, as well as using SAS. Have you already been able to test the data set in order to find out whether ANOVA_stat.txt or Statistical_summaries.txt gives the "correct answer"? This would help me a lot. Many thanks! — Regards, Oiinkie |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-01-22 16:06 (4478 d 20:03 ago) @ Oiinkie Posting: # 9898 Views: 14,668 |
|
Dear Oiinkie, maybe you are interested in an independent evaluation (Phoenix 6.3). Only Cmax since I don’t know your AUC-algo.
— Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |
Oiinkie ☆ The Netherlands, 2013-01-23 13:58 (4477 d 22:10 ago) @ Helmut Posting: # 9904 Views: 14,570 |
|
Dear Helmut, Many thanks for your assessment! ❝ • subject(sequence) fixed, n=23/23: ❝ 104.76% (99.61–110.18%) CVintra 9.95% ❝ EMA wants only the last variant. I am aware of these "rules" of the EMA ![]() ❝ Only Cmax since I don’t know your AUC-algo. AUC would be AUC(0-t) calculated by linear trapezoid method (bear's standard; not the nicest one, but I want to stick with method of the CRO for this reanalysis of this very old study). AUC(0-inf) is not critical at the moment (secondary parameter). — Regards, Oiinkie |
Oiinkie ☆ The Netherlands, 2013-01-23 16:07 (4477 d 20:02 ago) @ Oiinkie Posting: # 9906 Views: 14,489 |
|
In addition to the previous post and for information purposes... ❝ It seems that with transfer of the results from ANOVA_stat.txt to Statistical_summaries.txt (I assume the latter is compiled based on the former) something goes wrong for imbalanced data sets... After looking into the output files a bit more, Statistical_summaries.txt presents the PE by calculating (geometric mean T)/(geometric mean R), which is in general not correct for imbalanced data sets (PE should be the square root of (lower limit CI)*(upper limit CI)). It looks like (wild guess ![]() Could my assessment be correct? — Regards, Oiinkie |
yicaoting ★ NanKing, China, 2013-02-15 19:58 (4454 d 16:11 ago) @ Helmut Posting: # 10037 Views: 14,299 |
|
❝ • subject(sequence) random, n=24/23: ❝ 104.98% (99.83–110.40%), CVintra 9.96% ❝ • subject(sequence) fixed, n=24/23: ❝ 104.76% (99.61–110.18%), CVintra 9.95% ❝ • subject(sequence) random, n=23/23: ❝ 104.76% (99.61–110.18%) CVintra 9.95% ❝ • subject(sequence) fixed, n=23/23: ❝ 104.76% (99.61–110.18%) CVintra 9.95%[/list] Using the same data with SAS's GLM procedure, results are: n=24/23 Least Squares Means for Effect formulation n=23/23 Least Squares Means for Effect formulation Exactly the same. Transform the above results to PE and CI: PE: 1.047582365 |
yjlee168 ★★★ ![]() ![]() Kaohsiung, Taiwan, 2013-01-22 20:57 (4478 d 15:12 ago) @ Oiinkie Posting: # 9899 Views: 14,601 |
|
Dear Oiinkie, Not yet and still working on it. I will post the final results here. Apparently, validating with SAS does not provide enough information. Will figure out Elmaestro's method posted in this thread. ❝ Have you already been able to test the data set in order to find out whether ANOVA_stat.txt or Statistical_summaries.txt gives the "correct answer"? This would help me a lot. — All the best, -- Yung-jin Lee bear v2.9.2:- created by Hsin-ya Lee & Yung-jin Lee Kaohsiung, Taiwan https://www.pkpd168.com/bear Download link (updated) -> here |
yjlee168 ★★★ ![]() ![]() Kaohsiung, Taiwan, 2013-03-01 02:01 (4441 d 10:08 ago) @ Oiinkie Posting: # 10137 Views: 14,098 |
|
Dear Oiinkie, Helmut and yicaoting Thank you all for running WNL (by Helmut) and SAS (by yicaoting) for imbalanced data provided by Oiinkie. This is definitely a great Forum. O.K., bear will crash with incomplete data. The original question was: with imbalanced data, bear got two output files (ANOVA_stat.txt and Statistical_summaries.txt). The results showed inconsistent. After running the imbalance data, it can be concluded:
I only abstract the differences as mentioned by Oiinkie as follows:
[from ANOVA_stat.txt]
... and
[from Statistical_summaries.txt]
... ps2: A good news is that I think the inconsistency has been fixed back to 2011-12-22 (bear v2.5.4, not released to the public yet). However, the crash with incomplete data has been not yet.[edited] ❝ Have you already been able to test the data set in order to find out whether ANOVA_stat.txt or Statistical_summaries.txt gives the "correct answer"? This would help me a lot. — All the best, -- Yung-jin Lee bear v2.9.2:- created by Hsin-ya Lee & Yung-jin Lee Kaohsiung, Taiwan https://www.pkpd168.com/bear Download link (updated) -> here |
yjlee168 ★★★ ![]() ![]() Kaohsiung, Taiwan, 2013-03-15 08:58 (4427 d 03:11 ago) @ yjlee168 Posting: # 10198 Views: 14,114 |
|
Dear all, I am working on bear v2.5.4 right now and try to solve this problem. The error (something like ..error in data.frame... arguments imply differing number of rows...) with incomplete dataset with bear is due to "...data.frame [in R] is an object, which resembles a table from Excel, it has (must have!) the same number of rows in each column." That's funny rule in R. ![]() ❝ ... ❝ 3. the error only occurs in imbalanced data; for incomplete data, bear will be crashed. ❝ ... — All the best, -- Yung-jin Lee bear v2.9.2:- created by Hsin-ya Lee & Yung-jin Lee Kaohsiung, Taiwan https://www.pkpd168.com/bear Download link (updated) -> here |
yicaoting ★ NanKing, China, 2013-02-15 19:48 (4454 d 16:21 ago) @ Oiinkie Posting: # 10036 Views: 14,364 |
|
Dear Oiinkie, yjlee168, ElMaestro, Helmut, I don't think Oiinkie's data is a set of imbalanced data, but a set of incomplete data. In my personal opinion, In 2*2*2 BE study, imbalanced data: the number of subjects of Sequence(RT) is not equal to that of Sequence(TR). incomplete data: the number of subjects in Period 1 is not equal to that in Period 2. Oiinkie's data included Subject(24) in Period 1 and missed Subject(24) in Period 2, so Oiinkie's data is a set of incomplete data. If we delete Subject(24) in Period 1, it becomes a set of imbalanced data. Is it right? |
Helmut ★★★ ![]() ![]() Vienna, Austria, 2013-02-16 15:41 (4453 d 20:28 ago) @ yicaoting Posting: # 10043 Views: 14,237 |
|
Dear Zhang Yong! ❝ imbalanced data: the number of subjects of Sequence(RT) is not equal to that of Sequence(TR). ❝ incomplete data: the number of subjects in Period 1 is not equal to that in Period 2. ❝ ❝ Is it right? Exactly. — Dif-tor heh smusma 🖖🏼 Довге життя Україна! ![]() Helmut Schütz ![]() The quality of responses received is directly proportional to the quality of the question asked. 🚮 Science Quotes |