Next: Box-Cox Transformation
Up: Data Analysis Supplement
Previous: Between-Days Variability
Devising an appropriate estimator for the response ratio was
especially tricky. Each value of GPx activity usually consisted of
five imperfectly measured, unpaired values: three measurements of
activity and two measurements of protein. Thus we estimated
, where
and
are the true values. However, if
and
are each normally distributed with mean 0,
has a Cauchy
distribution [6], which has no finite moments of order one or more,
that is to say, no mean value. In this case the expectation of
would not exist, making its estimation
problematical. Furthermore, Jensen's inequality [6,1] applied to the
reciprocal function and independence of the numerator and denominator
together imply that the naive estimator of the ratio of the sample
averages of
and
is not an unbiased estimator of
. That is,
. Moreover, using as
an estimator the average of ratios obtained directly from the data is
not possible, since the repeated measurements from the numerator and
denominator have no natural pairing. Using an average of ratios from
all possible combinations of numerator and denominator circumvents the
problem of pairing, but then observations from a particular experiment
are no longer independent. Therefore, values were resampled from the
empirical distribution of specific activity from each experiment. For
an experiment with
activity measurements and
protein measurements,
we resampled with replacement
times from the
possible specific
activity measurements, thus obtaining independent replications that
reflected the measurement error inherent in the original values. The
resulting estimate can be had analytically, but it is easier to rely
upon computing methods.
Next: Box-Cox Transformation
Up: Data Analysis Supplement
Previous: Between-Days Variability
regina@stat.stanford.edu