Discovery of Owen's Q in SAS [Power / Sample Size]
Hi Helmut!
Mentioned above the Analyst application in SAS before V9.1 allowed the sample size estimation for the paired design and the parallel group design. Within that application (designed as some sort of "point and click") the user had the possibility to look behind the scenes and get the code the
really used.
And oh wonder there where code lines like
Remembering having heard the name Owen sometimes ago, getting knowledge of the formulas behind that AlGore-Rhythms it was not too hard -but costs nevertheless some sweat- for me amateur to figure out what the arguments of the OwensQ where
.
Regarding your validation question:
I would say in principle it could be validated what we do with such undocumented functions, features or whatever if we could show the results are what is expected.
I have seen all around in the software industry usage of undocumented functions, features, data structures, especially in using features of the Betriebssystem Windoofs. Maybe the well-known stability of Progs under Kleinweich (measured as blue screen probability) was partly due to that.
The problem with such undocumented features/functions is that they may change in future versions, maintenance releases or even in hot-/bug-fixes without any notice. In the best case our use of them will then throw errors (or blue screens). But according to McMurphy's laws I expect we get wrong results without noticing them and are surprised
if others do.
I had decided to use OwenQ because at that time I had no other opportunity to do my sample size planning the right way within "The validated power ...". Thus the alternatives were "use or die".
Meanwhile R and the author of PowerTOST came to rescue
.
Now Owen's Q function is documented, the implementation code can be inspected by every body, but is it validated? What does this mean?
Of course the author of PowerTOST has checked the results of using it in sample size estimation as far as possible (regarding time, money and other resources like literature tables, access to other power/sample size software and and ...). Really! Nevertheless during the life cycle of PowerTOST up to now cases arose (extremal with respect to the usual use, see NEWS file of PowerTOST) where the implementation lead to implausible or erroneous results. What does this mean with respect to validation? Does this mean we have to validate every of our application cases because we don't know if this is an extremal? If yes how? Extremal cases are usually not tabulated. Other software may use other algorithms/approximations which by themselves call for validation/qualification. Hier beisst sich die Katze in den Schwanz. Two pices of software each giving 2+2=5 (a well known result) validate one another.
I meanwhile think the whole software validation fairy tale is a Chimera invented by the Greeks having nothing to do (the Greeks are nowadays responsible for all)
.
Validated - Has been tried once before.
Guernsey McPearson's Drug Development Dictionary
❝ BTW, @Detlew: How did you discover the existence of Owen’s Q in SAS?
Mentioned above the Analyst application in SAS before V9.1 allowed the sample size estimation for the paired design and the parallel group design. Within that application (designed as some sort of "point and click") the user had the possibility to look behind the scenes and get the code the
really used.And oh wonder there where code lines like
...
df=n-1
t1=tinv(1-alpha,df);
P1=OwenQ(t1,d1,0,R,df);
P2=OwenQ(-t1,d2,0,R,df);
power=P2-P1;
...Remembering having heard the name Owen sometimes ago, getting knowledge of the formulas behind that AlGore-Rhythms it was not too hard -but costs nevertheless some sweat- for me amateur to figure out what the arguments of the OwensQ where
.Regarding your validation question:
I would say in principle it could be validated what we do with such undocumented functions, features or whatever if we could show the results are what is expected.
I have seen all around in the software industry usage of undocumented functions, features, data structures, especially in using features of the Betriebssystem Windoofs. Maybe the well-known stability of Progs under Kleinweich (measured as blue screen probability) was partly due to that.
The problem with such undocumented features/functions is that they may change in future versions, maintenance releases or even in hot-/bug-fixes without any notice. In the best case our use of them will then throw errors (or blue screens). But according to McMurphy's laws I expect we get wrong results without noticing them and are surprised
if others do.I had decided to use OwenQ because at that time I had no other opportunity to do my sample size planning the right way within "The validated power ...". Thus the alternatives were "use or die".
Meanwhile R and the author of PowerTOST came to rescue
. Now Owen's Q function is documented, the implementation code can be inspected by every body, but is it validated? What does this mean?
Of course the author of PowerTOST has checked the results of using it in sample size estimation as far as possible (regarding time, money and other resources like literature tables, access to other power/sample size software and and ...). Really! Nevertheless during the life cycle of PowerTOST up to now cases arose (extremal with respect to the usual use, see NEWS file of PowerTOST) where the implementation lead to implausible or erroneous results. What does this mean with respect to validation? Does this mean we have to validate every of our application cases because we don't know if this is an extremal? If yes how? Extremal cases are usually not tabulated. Other software may use other algorithms/approximations which by themselves call for validation/qualification. Hier beisst sich die Katze in den Schwanz. Two pices of software each giving 2+2=5 (a well known result) validate one another.
I meanwhile think the whole software validation fairy tale is a Chimera invented by the Greeks having nothing to do (the Greeks are nowadays responsible for all)
.Validated - Has been tried once before.
Guernsey McPearson's Drug Development Dictionary
—
Regards,
Detlew
Regards,
Detlew
Complete thread:
- Sample size calculations: SAS vs PowerTOST/FARTSSIE Oiinkie 2011-12-06 14:30
- Sample size calculations: SAS vs PowerTOST/FARTSSIE Helmut 2011-12-06 14:49
- Sample size calculations: SAS vs PowerTOST/FARTSSIE Oiinkie 2011-12-07 15:57
- The power not to know vs PowerTOST/FARTSSIE/... d_labes 2011-12-12 15:38
- Paired designs not that uncommon Helmut 2011-12-12 17:05
- Paired designs not that uncommon ElMaestro 2011-12-13 17:04
- Off topic: software validation Helmut 2011-12-13 18:10
- Off topic: software validation ElMaestro 2011-12-13 20:30
- Off topic: software validation Helmut 2011-12-14 14:14
- Discovery of Owen's Q in SASd_labes 2011-12-14 15:34
- Off topic: software validation ElMaestro 2011-12-13 20:30
- Off topic: software validation Helmut 2011-12-13 18:10
- Paired designs not that uncommon ElMaestro 2011-12-13 17:04
- Paired designs in PowerTOST 0.9-0 Helmut 2011-12-19 20:05
- Paired designs not that uncommon Helmut 2011-12-12 17:05
- The power not to know vs PowerTOST/FARTSSIE/... d_labes 2011-12-12 15:38
- Sample size calculations: SAS vs PowerTOST/FARTSSIE Oiinkie 2011-12-07 15:57
- Sample size calculations: SAS vs PowerTOST/FARTSSIE Helmut 2011-12-06 14:49
