Thanks Edzer,

I've requested Cressie's book from our library (just waiting on it).
My main concern was the many 0 counts. I also was not enthusiastic about odd transformations which then require appropriate back-transforms (I imagine the back transform of the kriging variance gets messy)

I've tried several linear and non-linear combinations....they all do not improve on predictions generated by using OK with the untransformed data. I am confident that the resultant grid outputs do capture the spatial structure quite well. I've also tried a 10 fold cross validation of the kriging model - this seems to give reasonable estimates for mean error, mean squared prediction error and mean square normalized error. I had interpreted this that the variogram model chosen was doing a reasonable job.

Edzer Pebesma wrote:
Hi Dave,

Dave Depew wrote:
Hi all,
A question for the more experienced geostats users....

I have a data set containing 2-3 variables relating to submerged plant characteristics inferred from acoustic survey. The distribution of the % cover variable is bounded (0-100) and highly left skewed (many 0's). The transect spacing is quite even, and I can't seem to notice much difference between a run of ordinary kriging and a variant of RK using a zeroinflated glm of the %cover residuals. None of the other co-variates show much correlation with the data (i.e. bottom depth, x and y). Is this a possible reason why OK and RK seem to give more or less the same predictions?
Well, yes, if there's not much of a trend, then RK will essentially simplify to OK.

my second question relates to transformation of the target variable...in this case zero inflated distributions are difficult to transform. Is it really a requirement of kriging that the data be transformed? or just that it will generally perform better with a target variable with a distribution close to normal?

I believe the argument is along the following lines: kriging is the BLUP in any case, but in case the data are normally distributed (around the trend), the BLUP (or more exactly the BLP, simple kriging) coincides with the conditional expectation, making it the best possible predictor. In other cases, meaning when data are not normally distributed, it is still the best linear predictor, but it may very well be that there are other, better, non-linear predictors that give a result much closer to the best predictor under those circumstances.

If there is a transformation for that data that makes them multivariate Gaussian, then transforming and kriging on that scale is the way to go. A catch that has gotten very little attention is that transformation typically looks at marginal distributions, and not at multivariate distributions, the latter being pretty hard to check with only one realisation of the random field.

Cressie's book is a good source to read this stuff; I've lost my copy when I moved jobs in the spring.
--
Edzer


_______________________________________________
R-sig-Geo mailing list
R-sig-Geo@stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/r-sig-geo

Reply via email to