Can you arrange a time around this week? Please check with Nirmal too. On Sun, Jun 28, 2015 at 9:31 AM, Danula Eranjith <hmdanu...@gmail.com> wrote:
> Hi all, > > No, We haven't done a review yet. > It would be great if we could have one so that I can discuss with you all > and clarify the next steps of the implementation as you mentioned. > > Thanks > Danula > > On Sun, Jun 28, 2015 at 9:25 AM, Supun Sethunga <sup...@wso2.com> wrote: > >> Hi Danula, >> >> Did we have a review for the work done so far? If not, shall we have a >> one? We can clear out any doubts and issues as well.. >> >> Thanks, >> Supun >> >> On Wed, Jun 24, 2015 at 6:42 AM, Nirmal Fernando <nir...@wso2.com> wrote: >> >>> Hi Danula, >>> >>> Thanks for the update, keep them coming. >>> >>> On a JavaRDD you can perform a collect() to get a list, AFAIR. Yes, this >>> is costly, since it would load whole dataset into memory. So, is this an >>> operation which involves multiple rows? >>> >>> On Tue, Jun 23, 2015 at 2:15 PM, Danula Eranjith <hmdanu...@gmail.com> >>> wrote: >>> >>>> Hi Supun, >>>> >>>> I modified the "Fill" operation to add what you mentioned. >>>> >>>> I used a workaround to to implement certain parts of the operations >>>> such as filling with values from rows above and below. >>>> I created a List Implementation using toArray() method in JavaRDD and >>>> then converted it back to a JavaRDD after the operation. >>>> >>>> This will be inefficient (in terms of both memory and time) when >>>> working with very large data sets. But I think its important to have these >>>> features included. Otherwise a user would be left with very limited set of >>>> operations. >>>> >>>> Please let me know if you have a different opinion on this. >>>> >>>> Thanks, >>>> Danula >>>> >>>> On Tue, Jun 16, 2015 at 9:44 AM, Supun Sethunga <sup...@wso2.com> >>>> wrote: >>>> >>>>> Somehow there are issues in implementing certain wrangler functions >>>>>> due to limitations in JavaRDD used in spark >>>>>> e.g. - >>>>>> Fill operation - when filling with values from rows above and below >>>>>> Fold operation >>>>> >>>>> >>>>> Agree, since rows will get executed randomly with spark, inter-row >>>>> operations are not very meaningful. >>>>> But you can slightly modify the implementation of the "Fill" >>>>> operation, such as, to fill values based on an >>>>> expression/static-value/mean >>>>> etc. (not depending on other rows).. >>>>> >>>>> Thanks, >>>>> Supun >>>>> >>>>> On Tue, Jun 16, 2015 at 9:27 AM, Supun Sethunga <sup...@wso2.com> >>>>> wrote: >>>>> >>>>>> Hi Danula, >>>>>> >>>>>> Sorry for the late reply. Have you got the details you were looking >>>>>> for? >>>>>> >>>>>> It would be great if I could get to know which wrangler operations >>>>>>> are important for a user of the ML >>>>>> >>>>>> >>>>>> Other than the ones you have mentioned in the proposal, think its >>>>>> better to have "Translate" operation as well (to create a new column >>>>>> based on an existing column). >>>>>> >>>>>> Thanks, >>>>>> Supun >>>>>> >>>>>> >>>>>> >>>>>> On Thu, Jun 4, 2015 at 10:11 PM, Danula Eranjith <hmdanu...@gmail.com >>>>>> > wrote: >>>>>> >>>>>>> Hi all, >>>>>>> >>>>>>> I am currently working on generating spark transformations related >>>>>>> to the operations available in the data wrangler. >>>>>>> >>>>>>> Data wrangler provides sufficient parameters to re-create these at >>>>>>> spark.I have successfully implemented delete and split operations of >>>>>>> wrangler in spark. >>>>>>> >>>>>>> Once this phase is completed, I can either directly generate these >>>>>>> scripts at wrangler or use the javascript output and convert it to spark >>>>>>> depending on the implementation. >>>>>>> >>>>>>> Somehow there are issues in implementing certain wrangler functions >>>>>>> due to limitations in JavaRDD used in spark >>>>>>> >>>>>>> e.g. - >>>>>>> Fill operation - when filling with values from rows above and below >>>>>>> Fold operation >>>>>>> >>>>>>> It would be great if I could get to know which wrangler operations >>>>>>> are important for a user of the ML >>>>>>> >>>>>>> Thanks, >>>>>>> Danula >>>>>>> >>>>>>> On Wed, Jun 3, 2015 at 8:30 AM, Nirmal Fernando <nir...@wso2.com> >>>>>>> wrote: >>>>>>> >>>>>>>> Hi Danula, >>>>>>>> >>>>>>>> Please send an update of your work thus far. >>>>>>>> >>>>>>>> On Sun, May 10, 2015 at 2:30 PM, Nirmal Fernando <nir...@wso2.com> >>>>>>>> wrote: >>>>>>>> >>>>>>>>> Hi Danula, >>>>>>>>> >>>>>>>>> Welcome to GSoC 15' ! Can you do some research on directly >>>>>>>>> generating spark transformations using Wrangler and come up with a >>>>>>>>> summary ? >>>>>>>>> >>>>>>>>> On Fri, May 8, 2015 at 11:03 AM, Danula Eranjith < >>>>>>>>> hmdanu...@gmail.com> wrote: >>>>>>>>> >>>>>>>>>> Hi all, >>>>>>>>>> >>>>>>>>>> Thank you for selecting my proposal [1] >>>>>>>>>> <https://docs.google.com/document/d/18NFa23CrhXqnHrkl_AuRz3sQ3Axg7SEmiA7l66Hl9_0/edit?usp=sharing> >>>>>>>>>> for GSoC 2015. I am really looking forward to work with you all and >>>>>>>>>> contribute to WSO2. >>>>>>>>>> >>>>>>>>>> I have already completed my primary research on wrangler and >>>>>>>>>> would like to meet you to get feedback on the proposed architecture. >>>>>>>>>> I am >>>>>>>>>> planning to start working on the project before 25th of May. >>>>>>>>>> >>>>>>>>>> Thank you, >>>>>>>>>> Danula >>>>>>>>>> >>>>>>>>>> [1] - >>>>>>>>>> https://docs.google.com/document/d/18NFa23CrhXqnHrkl_AuRz3sQ3Axg7SEmiA7l66Hl9_0/edit?usp=sharing >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> >>>>>>>>> Thanks & regards, >>>>>>>>> Nirmal >>>>>>>>> >>>>>>>>> Associate Technical Lead - Data Technologies Team, WSO2 Inc. >>>>>>>>> Mobile: +94715779733 >>>>>>>>> Blog: http://nirmalfdo.blogspot.com/ >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> >>>>>>>> Thanks & regards, >>>>>>>> Nirmal >>>>>>>> >>>>>>>> Associate Technical Lead - Data Technologies Team, WSO2 Inc. >>>>>>>> Mobile: +94715779733 >>>>>>>> Blog: http://nirmalfdo.blogspot.com/ >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> *Supun Sethunga* >>>>>> Software Engineer >>>>>> WSO2, Inc. >>>>>> http://wso2.com/ >>>>>> lean | enterprise | middleware >>>>>> Mobile : +94 716546324 >>>>>> >>>>> >>>>> >>>>> >>>>> -- >>>>> *Supun Sethunga* >>>>> Software Engineer >>>>> WSO2, Inc. >>>>> http://wso2.com/ >>>>> lean | enterprise | middleware >>>>> Mobile : +94 716546324 >>>>> >>>> >>>> >>> >>> >>> -- >>> >>> Thanks & regards, >>> Nirmal >>> >>> Associate Technical Lead - Data Technologies Team, WSO2 Inc. >>> Mobile: +94715779733 >>> Blog: http://nirmalfdo.blogspot.com/ >>> >>> >>> >> >> >> -- >> *Supun Sethunga* >> Software Engineer >> WSO2, Inc. >> http://wso2.com/ >> lean | enterprise | middleware >> Mobile : +94 716546324 >> > > -- *Supun Sethunga* Software Engineer WSO2, Inc. http://wso2.com/ lean | enterprise | middleware Mobile : +94 716546324
_______________________________________________ Dev mailing list Dev@wso2.org http://wso2.org/cgi-bin/mailman/listinfo/dev