Re: [R-sig-Geo] kriging question

Edzer Pebesma Tue, 26 Aug 2008 13:40:38 -0700

Dave,

Transformation to a continuous distribution when the data follow adiscrete distribution is always messy, and the back-transform may get worse.

While you're at the library, try to pick up Diggle & Ribeiro'sModel-based geostatistics; they describe a model-based approach thatextends glm models. It seems the most appropriate way for your kind ofdata. I'm not sure whether the accompanying software (packagesgeoR/geoRglm) supports zero-inflated Poissons. In case it does, itremains to be seen whether prediction will actually improve substantially.

--
Edzer

Dave Depew wrote:

Thanks Edzer,

I've requested Cressie's book from our library (just waiting on it).
My main concern was the many 0 counts. I also was not enthusiasticabout odd transformations which then require appropriateback-transforms (I imagine the back transform of the kriging variancegets messy)
I've tried several linear and non-linear combinations....they all donot improve on predictions generated by using OK with theuntransformed data. I am confident that the resultant grid outputs docapture the spatial structure quite well. I've also tried a 10 foldcross validation of the kriging model - this seems to give reasonableestimates for mean error, mean squared prediction error and meansquare normalized error. I had interpreted this that the variogrammodel chosen was doing a reasonable job.
Edzer Pebesma wrote:
Hi Dave,

Dave Depew wrote:
Hi all,
A question for the more experienced geostats users....
I have a data set containing 2-3 variables relating to submergedplant characteristics inferred from acoustic survey.The distribution of the % cover variable is bounded (0-100) andhighly left skewed (many 0's). The transect spacing is quite even,and I can't seem to notice much difference between a run of ordinarykriging and a variant of RK using a zeroinflated glm of the %coverresiduals.None of the other co-variates show much correlation with the data(i.e. bottom depth, x and y). Is this a possible reason why OK andRK seem to give more or less the same predictions?
Well, yes, if there's not much of a trend, then RK will essentiallysimplify to OK.
my second question relates to transformation of the targetvariable...in this case zero inflated distributions are difficult totransform. Is it really a requirement of kriging that the data betransformed? or just that it will generally perform better with atarget variable with a distribution close to normal?
I believe the argument is along the following lines: kriging is theBLUP in any case, but in case the data are normally distributed(around the trend), the BLUP (or more exactly the BLP, simplekriging) coincides with the conditional expectation, making it thebest possible predictor. In other cases, meaning when data are notnormally distributed, it is still the best linear predictor, but itmay very well be that there are other, better, non-linear predictorsthat give a result much closer to the best predictor under thosecircumstances.
If there is a transformation for that data that makes themmultivariate Gaussian, then transforming and kriging on that scale isthe way to go. A catch that has gotten very little attention is thattransformation typically looks at marginal distributions, and not atmultivariate distributions, the latter being pretty hard to checkwith only one realisation of the random field.
Cressie's book is a good source to read this stuff; I've lost my copywhen I moved jobs in the spring.
--
Edzer


--
Edzer Pebesma
Institute for Geoinformatics (ifgi), University of Münster,
Weseler Straße 253, 48151 Münster, Germany.  Phone: +49 251
8333081, Fax: +49 251 8339763  http://ifgi.uni-muenster.de/

_______________________________________________
R-sig-Geo mailing list
R-sig-Geo@stat.math.ethz.ch
https://stat.ethz.ch/mailman/listinfo/r-sig-geo

Re: [R-sig-Geo] kriging question

Reply via email to