Hi Chenliang,
Will we hear from you tomorrow at 10AM Pacific, or in a few weeks when the
call is a better time for Asia-based callers?

-Greg

On Thu, Jan 14, 2016 at 8:18 AM, chenliang wang <hi181904...@msn.com> wrote:

> Cool!  I'd like to join the next discussion.
>
> Best,
> Chenliang Wang
>
>
> On 01/13/2016 06:36 PM, Greg Chase wrote:
>
>> As I said, our next call is not China-friendly:
>>
>> http://mail-archives.apache.org/mod_mbox/incubator-madlib-dev/201601.mbox/%3CCAMg1VtnKB-WoyVqCstfMNCcJVOn2HKQQ6wNfqdovhgnB7zd5cw%40mail.gmail.com%3E
>>
>> This is this Friday, 10AM Pacifc Standard Time which is 2AM Saturday
>> Beijing time.
>>
>> We will arrange a next call in a couple weeks at an Asia friendly time to
>> support contributors in Asia.
>>
>> However, if you make the next call, we will make time for you to talk :)
>>
>> Regards,
>>
>> -Greg
>>
>> On Wed, Jan 13, 2016 at 2:18 AM, Kuien Liu <k...@pivotal.io> wrote:
>>
>> Great, I would like to join it, please send me an invitation if possible.
>>>
>>> Cheers,
>>> Kuien Liu
>>>
>>> On Wed, Jan 13, 2016 at 6:10 PM, Greg Chase <gch...@gmail.com> wrote:
>>>
>>> Perhaps ChenLiang would like to join a call with the MADlib community and
>>>> discuss his contribution?
>>>>
>>>> We have a call this Friday 10AM PST which is not a friendly time for
>>>> China, but we can schedule a next call at a friendlier time.
>>>>
>>>> This email encrypted by tiny buttons & fat thumbs, beta voice
>>>> recognition, and autocorrect on my iPhone.
>>>>
>>>> On Jan 13, 2016, at 1:53 AM, Ivan Novick <inov...@pivotal.io> wrote:
>>>>>
>>>>> Cool!
>>>>>
>>>>> On Wed, Jan 13, 2016 at 5:52 PM, Kuien Liu <k...@pivotal.io> wrote:
>>>>>>
>>>>>> Got it, I think I can have a (f2f) talk with Chenliang Wang, as he was
>>>>>> graduated from an institute of CAS which is not far from our Beijing
>>>>>> office, and I am familiar with his supervisor and lab director. So I
>>>>>>
>>>>> think
>>>>
>>>>> it is highly possible to find him directly in Beijing.
>>>>>>
>>>>>> Cheers,
>>>>>> Kuien Liu
>>>>>>
>>>>>> On Wed, Jan 13, 2016 at 3:05 PM, Ivan Novick <inov...@pivotal.io>
>>>>>>>
>>>>>> wrote:
>>>>
>>>>> Hello ChenLiang,
>>>>>>>
>>>>>>> I have read your description of the interface and to my understanding
>>>>>>> this is a supervised machine learning algorithm that supports
>>>>>>> geometry
>>>>>>> data.  Am I correct?
>>>>>>>
>>>>>>> What could be a good industrial use case for this model for some
>>>>>>> examples?  Could you train a system based on locations and weather to
>>>>>>>
>>>>>> find
>>>>
>>>>> bad signals for cell phone?  Can you provide any real world example
>>>>>>> scenario where this type of model will be useful for end users?
>>>>>>>
>>>>>>> Also I am adding CC to some of my colleagues at work.  Kuien, Max,
>>>>>>> Yandong can you provide any feedback on this proposal from your Point
>>>>>>>
>>>>>> of
>>>>
>>>>> View?
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>> http://mail-archives.apache.org/mod_mbox/incubator-madlib-dev/201601.mbox/%3cblu175-w72199bca72716d8c1a99bf4...@phx.gbl%3E
>>>>
>>>>> Cheers,
>>>>>>> Ivan
>>>>>>>
>>>>>>>
>>>>>>> On Wed, Jan 13, 2016 at 11:20 AM, WangChenLiang <hi181904...@msn.com
>>>>>>> >
>>>>>>> wrote:
>>>>>>>
>>>>>>> Sorry, the link of attachment (http://1drv.ms/1ZjAiCg) is lost in
>>>>>>>>
>>>>>>> the
>>>>
>>>>> previous letter.
>>>>>>>>
>>>>>>>> From: hi181904...@msn.com
>>>>>>>>> To: dev@madlib.incubator.apache.org
>>>>>>>>> Subject: RE: How to contribute a spatial module to MADlib
>>>>>>>>>
>>>>>>>> manipulating
>>>>
>>>>> objects from PostGIS
>>>>>>>>
>>>>>>>>> Date: Wed, 13 Jan 2016 11:09:17 +0800
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Hi   ,Caleb and Ivan!
>>>>>>>>>    Thanks for your attention and help. I reviewed the previous
>>>>>>>>> draft
>>>>>>>>>
>>>>>>>> and find
>>>>>>>>
>>>>>>>>> something inappropriate. The archive containing the new draft and
>>>>>>>>>
>>>>>>>> example code
>>>>>>>>
>>>>>>>>> is attached in the letter which would be more reasonable  than the
>>>>>>>>>
>>>>>>>> earlier edition.
>>>>>>>>
>>>>>>>>> Please go over the manuscript and give suggestion again .
>>>>>>>>> The following are my answers to Caleb's questions.
>>>>>>>>> - Does this function require PostGIS to also be
>>>>>>>>> installed? If yes, it would be better
>>>>>>>>> if we disable the function if
>>>>>>>>> PostGIS is not present rather than introduce PostGIS
>>>>>>>>> as a dependency. (Similar
>>>>>>>>> to what we do with our requirement on the xml module with our PMML
>>>>>>>>>
>>>>>>>> export
>>>>>>>>
>>>>>>>>> functionality).
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> A:Yes. I am trying to avoid
>>>>>>>>> input any spatial datatypes in the interface of GWR.
>>>>>>>>> But I have no
>>>>>>>>> idea if it is necessary to provide simple alternative when PostGIS
>>>>>>>>>
>>>>>>>> is
>>>>
>>>>> not
>>>>>>>>
>>>>>>>>> available.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> - What are the exact datatypes in the function
>>>>>>>>> definition for regression_location
>>>>>>>>> and prediction_location?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> A:I changed the datatype
>>>>>>>>> to TEXT as the name of POINT or MULTIPOLYGON
>>>>>>>>> (centroid of
>>>>>>>>> each polygon for estimation for GWR).
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> - In the description it describes
>>>>>>>>> regression_location as "The length of
>>>>>>>>> regression_location must be equal to the length of
>>>>>>>>> source_table", which signals to me that it is likely intended to be
>>>>>>>>>
>>>>>>>> a
>>>>
>>>>> column of the source table? If not then how is
>>>>>>>>> this length represented?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> A: In the previous
>>>>>>>>> interface, I was trying to input a geometry field which could be
>>>>>>>>> from another
>>>>>>>>> table having different row number. Now, I alter the argument
>>>>>>>>> definition and make it
>>>>>>>>> to TEXT. It must be the name of geometry field in the
>>>>>>>>> source table.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> - You didn't mark regression_location as
>>>>>>>>> (optional). Due to the way Postgres
>>>>>>>>> functions work all optional arguments
>>>>>>>>> must come after all required arguments,
>>>>>>>>> so having a non-optional argument in
>>>>>>>>> the middle of the optional list must be
>>>>>>>>> avoided.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> A:Thanks for
>>>>>>>>> reminding me of this mistake. It is really my fault. The order of
>>>>>>>>> argument is changed in this edition.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> - I haven't read through the literature, but it is
>>>>>>>>> not immediately clear to me why
>>>>>>>>> prediction_location is a parameter to
>>>>>>>>> gwregr_train() rather than gwregr_predict().
>>>>>>>>> Can you provide a brief
>>>>>>>>> description to the way that prediction_location is used in
>>>>>>>>> the model and its
>>>>>>>>> relationship to training and prediction.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> A: Actually,
>>>>>>>>> there are three kinds location data including location of sample
>>>>>>>>>
>>>>>>>> data,
>>>>
>>>>> regression and prediction in the modeling of GWR.
>>>>>>>>>
>>>>>>>>> Locations of sample data indicate where is sample
>>>>>>>>> data. Locations of regression
>>>>>>>>> indicate where regression should be conducted. If
>>>>>>>>> it is identical to data location
>>>>>>>>> (in most instances),diagnostic information can
>>>>>>>>> be calculated.
>>>>>>>>>
>>>>>>>>> Locations of
>>>>>>>>> prediction indicate where coefficients should be predicted. It
>>>>>>>>>
>>>>>>>> should
>>>>
>>>>> be a
>>>>>>>>
>>>>>>>>> parameter for a predict function. Putting regression_location into
>>>>>>>>>
>>>>>>>> training
>>>>>>>>
>>>>>>>>> function is just for omitting kernel arguments and maybe not
>>>>>>>>>
>>>>>>>> appropriate.  In the process of
>>>>>>>>
>>>>>>>>> training, GWR estimates weight and coefficients with distance
>>>>>>>>> between data_loctions and regression_loctions. Then, diagnostic
>>>>>>>>>
>>>>>>>> information are
>>>>>>>>
>>>>>>>>> estimated when these two locations are identical. We can treat
>>>>>>>>>
>>>>>>>> data_locationas regression_location to simplify the process not
>>>>>>>>
>>>>>>> taking
>>>>
>>>>> different locations from
>>>>>>>>
>>>>>>>>> data location in the training step.
>>>>>>>>>
>>>>>>>>>   In the process of
>>>>>>>>> prediction , there are two new information including new
>>>>>>>>> independent variables and new locations. Therefore, coefficients
>>>>>>>>> and
>>>>>>>>>
>>>>>>>> weight
>>>>>>>>
>>>>>>>>> vector must be estimated
>>>>>>>>> again. GWR can
>>>>>>>>> estimate coefficients in any positions
>>>>>>>>> using independent variables of sample data.
>>>>>>>>> If we also provide independent
>>>>>>>>> variables in any positions,we can also obtain
>>>>>>>>> dependent variable in any position. So if we treat coefficients at
>>>>>>>>>
>>>>>>>> prediction_location as a training result to put
>>>>>>>>
>>>>>>>>> coefficients into prediction
>>>>>>>>> directly, it is reasonable to put it into training function. But if
>>>>>>>>>
>>>>>>>> we
>>>>
>>>>> treat it as a part of prediction, it is appropriate to set
>>>>>>>> predicton_location within predict function. And then, prediction
>>>>>>>>
>>>>>>> function
>>>>
>>>>> must require kernel
>>>>>>>>
>>>>>>>>> parameters in addition to new data and locations for prediction.
>>>>>>>>>
>>>>>>>> Maybe
>>>>
>>>>> this way
>>>>>>>>
>>>>>>>>> is more clear
>>>>>>>>> and reasonable, and is similar with others GWR packages in R.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>        I
>>>>>>>>> rewrote the description of interface taking your suggestion into
>>>>>>>>>
>>>>>>>> account. I
>>>>>>>>
>>>>>>>>> moved
>>>>>>>>> prediction_location into predict function and modified
>>>>>>>>> some mistake and
>>>>>>>>> unnecessary arguments.  The new draft of interface design is
>>>>>>>>>
>>>>>>>> attached
>>>>
>>>>> in the
>>>>>>>>
>>>>>>>>> letter.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> Regards,
>>>>>>>>>
>>>>>>>>> ChenLiang Wang
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> From: cwel...@pivotal.io
>>>>>>>>>> Date: Tue, 5 Jan 2016 10:31:20 -0800
>>>>>>>>>> Subject: Re: How to contribute a spatial module to MADlib
>>>>>>>>>>
>>>>>>>>> manipulating objects from PostGIS
>>>>>>>>
>>>>>>>>> To: dev@madlib.incubator.apache.org
>>>>>>>>>>
>>>>>>>>>> Hi ChenLiang,
>>>>>>>>>>
>>>>>>>>>> Thanks for taking the next step to flush this out.
>>>>>>>>>>
>>>>>>>>>> As a whole:
>>>>>>>>>> - naming and basic interface seems consistent with existing
>>>>>>>>>>
>>>>>>>>> conventions.
>>>>>>>>
>>>>>>>>> - names are descriptive.
>>>>>>>>>> - references to the literature is provided.
>>>>>>>>>> - functionality is complementary to the library.
>>>>>>>>>>
>>>>>>>>>> What is not clear to me is:
>>>>>>>>>> - Does this function require PostGIS to also be installed?  If
>>>>>>>>>> yes,
>>>>>>>>>>
>>>>>>>>> it
>>>>>>>>
>>>>>>>>> would be better if we disable the function if PostGIS is not
>>>>>>>>>>
>>>>>>>>> present
>>>>
>>>>> rather
>>>>>>>>
>>>>>>>>> than introduce PostGIS as a dependency.  (Similar to what we do
>>>>>>>>>>
>>>>>>>>> with
>>>>
>>>>> our
>>>>>>>>
>>>>>>>>> requirement on the xml module with our PMML export functionality).
>>>>>>>>>> - What are the exact datatypes in the function definition for
>>>>>>>>>> regression_location and prediction_location?
>>>>>>>>>> - In the description it describes regression_location as "The
>>>>>>>>>>
>>>>>>>>> length
>>>>
>>>>> of
>>>>>>>>
>>>>>>>>> regression_location must be equal to the length of source_table",
>>>>>>>>>>
>>>>>>>>> which
>>>>>>>>
>>>>>>>>> signals to me that it is likely intended to be a column of the
>>>>>>>>>>
>>>>>>>>> source
>>>>
>>>>> table?  If not then how is this length represented?
>>>>>>>>>> - You didn't mark regression_location as (optional).  Due to the
>>>>>>>>>>
>>>>>>>>> way
>>>>
>>>>> Postgres functions work all optional arguments must come after all
>>>>>>>>>>
>>>>>>>>> required
>>>>>>>>
>>>>>>>>> arguments, so having a non-optional argument in the middle of the
>>>>>>>>>>
>>>>>>>>> optional
>>>>>>>>
>>>>>>>>> list must be avoided.
>>>>>>>>>> - I haven't read through the literature, but it is not immediately
>>>>>>>>>>
>>>>>>>>> clear to
>>>>>>>>
>>>>>>>>> me why prediction_location is a parameter to gwregr_train() rather
>>>>>>>>>>
>>>>>>>>> than
>>>>>>>>
>>>>>>>>> gwregr_predict().  Can you provide a brief description to the way
>>>>>>>>>>
>>>>>>>>> that
>>>>>>>>
>>>>>>>>> prediction_location is used in the model and its relationship to
>>>>>>>>>>
>>>>>>>>> training
>>>>>>>>
>>>>>>>>> and prediction.
>>>>>>>>>>
>>>>>>>>>> Regards,
>>>>>>>>>>   Caleb
>>>>>>>>>>
>>>>>>>>>                                     ChenLiang 要与你在 OneDrive
>>>>>>>>>
>>>>>>>> 上共享一个文件。要查看该文件,请单击下面的链接。
>>>>>>>>                                                     gwr4madlib.rar
>>>>>>>>
>>>>>>>
>>>
>

Reply via email to