Re: [R-sig-Geo] Holdout Sampling Adaptive Bandwidth SPGWR

Paul Bidanset Fri, 30 Aug 2013 05:58:13 -0700

Roger,

I think all I would like to know is if it is possible to apply a calibrated
GWR model to a hold-out sample, and if so, what the most accurate way to do
so is. I understand the pitfalls of GWR but would like to learn as much as
I can before progressing to the next spatial methodology I learn in R.



On Fri, Aug 30, 2013 at 3:37 AM, Roger Bivand <roger.biv...@nhh.no> wrote:

> Paul, Luis,
>
> I suspect that your speculations are completely wrong-headed. Please
> provide a reproducible example with a built-in data set, so that there is
> at least minimal clarity in what you are guessing. Note in addition that
> GWR as a technique should not be used for anything other than exploration
> of possible mis-specification in the underlying model with the given data,
> as patterning in coefficients is induced by GWR for simulated covariates
> with no pattern.
>
> Roger
>
>
> On Fri, 30 Aug 2013, Luis Guerra wrote:
>
>  Thank you Luis. When calibrating the adaptive model, using adapt=t in the
>>> bandwidth selection created the proportion you speak of, which then
>>> allowed
>>> me to create a bandwidth matrix using gwr.adapt. However, this has not
>>> worked for me with holdout samples. Have you had success in this regard?
>>>
>>>  Now I get what you mean. Let's show an example:
>>
>> bw <- gwr.sel(var ~ var1, data=yourdata, adapt=TRUE)
>> m <- gwr(var~var1, data=yourdata, adapt=bw, fit.points=newdata)
>>
>> So an adaptative bandwidth (bw) is calculated based on"yourdata", while
>> you
>> are fitting "newdata" later on using that previously found bw. I had not
>> thought about it previously. Let's see whether someone else can help you
>> (us).
>>
>>
>>  I do not know the intended influence of these "fit.points". I would think
>>> that new localized regressions are not calculated, as we're testing the
>>> model and previous data points' ability to predict for these new ones,
>>> but
>>> I could be wrong. My current method, however, is producing much poorer
>>> results with the holdouts, which I am fairly sure is related to my
>>> inability to incorporate the new points necessary bandwidths.
>>>
>>>  Coming back to the previously created example, imagine that "newdata"
>> is a
>> single point that you want to fit. Imagine now that "yourdata" is a sample
>> with 1000 cases. Then you are getting 1000 models with 1000 different
>> intercepts and 1000 different beta values to adjust var1, rigth? Which of
>> all these parameters do you use for fitting "newdata"? And something else,
>> what would happen with "newdata" if it is enough far away from "yourdata"
>> and we would be using a fixed bandwidth?
>>
>>
>>
>>
>>
>>
>>
>>  On Aug 29, 2013 8:56 PM, "Luis Guerra" <luispelay...@gmail.com> wrote:
>>>
>>>  Dear Paul,
>>>>
>>>> I am dealing with this kind of problems right now, and if I am not
>>>> wrong,
>>>> when you want to apply an adaptative bandwidth, you should introduce a
>>>> value for the "adapt" parameter instead of for the "bandwidth"
>>>> parameter.
>>>> This value will be between 0 and 1 and indicates the proportion of cases
>>>> around your regression point that should be included to estimate each
>>>> local
>>>> model. So depending on the amount of points around each case, the model
>>>> will use a different bandwidth for each point to be fitted.
>>>>
>>>> Related to your question, do you know what is the influence of the data
>>>> introduced in the "data" parameter to the data to be fitted (introduced
>>>> in
>>>> the "fit.points" parameter)? I mean, you have to obtain new local models
>>>> (one for each point to be fitted), so I do not understand whether the
>>>> "data" parameter is used somehow...
>>>>
>>>> Best regards,
>>>>
>>>> Luis
>>>>
>>>>
>>>> On Fri, Aug 30, 2013 at 1:26 AM, Paul Bidanset <pbidan...@gmail.com
>>>> >wrote:
>>>>
>>>>  Hi Folks,
>>>>>
>>>>> I was curious if anyone has had experience applying an SPGWR model with
>>>>> an
>>>>> adaptive bandwidth matrix to a holdout or validation sample. I am using
>>>>> the
>>>>> "fit.points" command, which does not seem to allow for a new bandwidth
>>>>> calibrated around the holdout samples XY coordinates. Any direction
>>>>> would
>>>>> be greatly appreciated.  I am also open to other viable methods.
>>>>>
>>>>> Cheers,
>>>>>
>>>>> Paul
>>>>>
>>>>>         [[alternative HTML version deleted]]
>>>>>
>>>>> ______________________________**_________________
>>>>> R-sig-Geo mailing list
>>>>> R-sig-Geo@r-project.org
>>>>> https://stat.ethz.ch/mailman/**listinfo/r-sig-geo<https://stat.ethz.ch/mailman/listinfo/r-sig-geo>
>>>>>
>>>>>
>>>>
>>>>
>>         [[alternative HTML version deleted]]
>>
>> ______________________________**_________________
>> R-sig-Geo mailing list
>> R-sig-Geo@r-project.org
>> https://stat.ethz.ch/mailman/**listinfo/r-sig-geo<https://stat.ethz.ch/mailman/listinfo/r-sig-geo>
>>
>>
> --
> Roger Bivand
> Department of Economics, NHH Norwegian School of Economics,
> Helleveien 30, N-5045 Bergen, Norway.
> voice: +47 55 95 93 55; fax +47 55 95 95 43
> e-mail: roger.biv...@nhh.no
>
>


-- 
Paul Bidanset
(757) 412-9217
pbidan...@gmail.com

        [[alternative HTML version deleted]]

_______________________________________________
R-sig-Geo mailing list
R-sig-Geo@r-project.org
https://stat.ethz.ch/mailman/listinfo/r-sig-geo

Re: [R-sig-Geo] Holdout Sampling Adaptive Bandwidth SPGWR

Reply via email to