Empiric distributions [PK / PD]
Hi HS,
this thread gave me an idea (sorry for going a bit off topic now):
The whole general use of log transformation is quite unjustified from a biological perspective, but has a lot of appeal from a mathematical perspective in that it rectifies an assumption that must hold for normal dists and which can strictly be said to be violated with untransformed values. But other empirical transformations might have appeal as well.
Someone should therefore make an Al Gore Rhythm which could find the 'best' transformation among a set of possible transformations (this would by nature have to be a limited number of known transformations) on a limited number of datasets and see if there's anything useful, like consensus, coming out for whichever parameters are of interest.
Of course, there's trouble ahead: It might not be easy to define an objective function that gives a clearcut winner. I am myself in uncharted territory here but I would perhaps naïvely start with something like the Shapiro-Wilks statistic or a regression goodness-of-fit statistic from the QQ plot, and then make sure not to compare it between datasets but only within.
Just another useless idea from ElMaestro... I should perhaps just stick to poetry?!?
this thread gave me an idea (sorry for going a bit off topic now):
The whole general use of log transformation is quite unjustified from a biological perspective, but has a lot of appeal from a mathematical perspective in that it rectifies an assumption that must hold for normal dists and which can strictly be said to be violated with untransformed values. But other empirical transformations might have appeal as well.
Someone should therefore make an Al Gore Rhythm which could find the 'best' transformation among a set of possible transformations (this would by nature have to be a limited number of known transformations) on a limited number of datasets and see if there's anything useful, like consensus, coming out for whichever parameters are of interest.
Of course, there's trouble ahead: It might not be easy to define an objective function that gives a clearcut winner. I am myself in uncharted territory here but I would perhaps naïvely start with something like the Shapiro-Wilks statistic or a regression goodness-of-fit statistic from the QQ plot, and then make sure not to compare it between datasets but only within.
Just another useless idea from ElMaestro... I should perhaps just stick to poetry?!?
—
Pass or fail!
ElMaestro
Pass or fail!
ElMaestro
Complete thread:
- Empiric distributions Helmut 2012-03-21 16:18 [PK / PD]
- nitpicking ElMaestro 2012-03-21 17:32
- nitpicking Helmut 2012-03-22 16:06
- Empiric distributions martin 2012-03-21 21:16
- QQ-Plots Helmut 2012-03-22 01:17
- Transform or not transform d_labes 2012-03-22 09:31
- Transform or not transform Helmut 2012-03-22 13:27
- Transform or not transform d_labes 2012-03-22 09:31
- QQ-Plots Helmut 2012-03-22 01:17
- Empiric distributionsElMaestro 2012-03-22 18:16
- Empiric distributions Helmut 2012-03-23 01:50
- nitpicking ElMaestro 2012-03-21 17:32