Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-20 Thread Supun Sethunga
Hi Danula, As the pencil down date is tomorrow, shall we have a final demo on the project after that? Glad if you can set up the meeting anytime between 5am - 8am OR after 8pm *EST*. Thanks, Supun On Fri, Aug 14, 2015 at 4:40 AM, Nirmal Fernando nir...@wso2.com wrote: On Fri, Aug 14, 2015

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-14 Thread Danula Eranjith
Hi Nirmal, I have changed the structure so that operations could be recorded at one point using the javascript and then executed later by executing a method in Wrangler class. public void test(JavaRDDString[] data,String scriptPath) public JavaRDDString[] executeOperations(JavaSparkContext

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-14 Thread Nirmal Fernando
On Fri, Aug 14, 2015 at 1:25 PM, Danula Eranjith hmdanu...@gmail.com wrote: Hi Nirmal, I have changed the structure so that operations could be recorded at one point using the javascript and then executed later by executing a method in Wrangler class. public void test(JavaRDDString[]

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-10 Thread Supun Sethunga
Any update? On Fri, Aug 7, 2015 at 10:13 AM, Supun Sethunga sup...@wso2.com wrote: Hi Danula, Sorry I couldn't join the meeting. Can you please share the meeting/review notes? Also the progress on the suggestions and what is left to be done in overall? Thanks, Supun On Wed, Aug 5, 2015

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-10 Thread Danula Eranjith
Hi Supun, Following points were discussed in the meeting *Integration to ML* We decided to add the wrangler interface as the first step considering the current ML implementation. So the steps from a users perspective would be as follows - A sample from the dataset will be sent to wrangler

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-07 Thread Supun Sethunga
Hi Danula, Sorry I couldn't join the meeting. Can you please share the meeting/review notes? Also the progress on the suggestions and what is left to be done in overall? Thanks, Supun On Wed, Aug 5, 2015 at 3:47 AM, Nirmal Fernando nir...@wso2.com wrote: Hi Danula, It should be a

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-05 Thread Nirmal Fernando
Hi Danula, It should be a JavaRDDString[], where each row represents the feature vector as a string[]. On Tue, Aug 4, 2015 at 11:51 AM, Danula Eranjith hmdanu...@gmail.com wrote: In other words, What would be the preferred output type for a dataset which is pre-processed by wrangler? As I

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-04 Thread Danula Eranjith
Hi Nirmal, In ML, what is the preferred way of keeping data in a single row of JavaRDD? As I have figured it depends on the algorithm being used. Danula On Thu, Jul 30, 2015 at 9:14 AM, Nirmal Fernando nir...@wso2.com wrote: Thanks Danula, I'll send an invite. On Wed, Jul 29, 2015 at 10:24

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-04 Thread Danula Eranjith
In other words, What would be the preferred output type for a dataset which is pre-processed by wrangler? As I have observed different algorithms use different JavaRDD types as input ( JavaRDDString, JavaRDDVector etc ) On Tue, Aug 4, 2015 at 11:48 AM, Nirmal Fernando nir...@wso2.com wrote: Hi

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-08-04 Thread Nirmal Fernando
Hi Danula, On Tue, Aug 4, 2015 at 11:47 AM, Danula Eranjith hmdanu...@gmail.com wrote: Hi Nirmal, In ML, what is the preferred way of keeping data in a single row of JavaRDD? I didn't quite get your question. Can you elaborate please? As I have figured it depends on the algorithm being

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-29 Thread Danula Eranjith
Hi Nirmal, I am available after 1.30pm on Tuesday, Wednesday and Thursday. Danula On Wed, Jul 29, 2015 at 12:10 PM, Nirmal Fernando nir...@wso2.com wrote: Hi Danula, Can we arrange a demo/review somewhere next week? Please let me know few time slots. On Thu, Jul 23, 2015 at 11:47 AM,

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-29 Thread Nirmal Fernando
Thanks Danula, I'll send an invite. On Wed, Jul 29, 2015 at 10:24 PM, Danula Eranjith hmdanu...@gmail.com wrote: Hi Nirmal, I am available after 1.30pm on Tuesday, Wednesday and Thursday. Danula On Wed, Jul 29, 2015 at 12:10 PM, Nirmal Fernando nir...@wso2.com wrote: Hi Danula, Can we

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-29 Thread Nirmal Fernando
Hi Danula, Can we arrange a demo/review somewhere next week? Please let me know few time slots. On Thu, Jul 23, 2015 at 11:47 AM, Nirmal Fernando nir...@wso2.com wrote: Thanks Danula. On Thu, Jul 23, 2015 at 11:41 AM, Danula Eranjith hmdanu...@gmail.com wrote: You can find the source at

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-23 Thread Nirmal Fernando
Thanks Danula. On Thu, Jul 23, 2015 at 11:41 AM, Danula Eranjith hmdanu...@gmail.com wrote: You can find the source at [1] https://github.com/danula/wso2-ml-wrangler-integration. I have to do some refactoring when integrating to ML. [1] -

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-19 Thread Danula Eranjith
Hi Nirmal, Would it be possible to get some sample data sets which are more likely to be pre-processed using wrangler. I am currently testing my implementations against small and more general data sets. I have checked datasets available at [1]

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-16 Thread Danula Eranjith
Hi all, Sorry for not keeping you in the loop. After considering and experimenting with several options. I am using the javascript code generated by wrangler to implement them using spark. I have used regular expressions to extract the operations, parameters and values and mapped them to spark

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-16 Thread Nirmal Fernando
Thanks Danula. On Thu, Jul 16, 2015 at 10:07 PM, Danula Eranjith hmdanu...@gmail.com wrote: Hi all, Sorry for not keeping you in the loop. After considering and experimenting with several options. I am using the javascript code generated by wrangler to implement them using spark. I have

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-15 Thread Supun Sethunga
Hi Danula, Any update on the progress? Were you managed to integrate the transformations with the wrangler? Thanks, On Thu, Jul 2, 2015 at 11:38 AM, Danula Eranjith hmdanu...@gmail.com wrote: Hi all, Update on the current progress of the project and future activities as we discussed at the

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-15 Thread Nirmal Fernando
Hi Danula, Please send an update at least every week. On Wed, Jul 15, 2015 at 5:51 PM, Supun Sethunga sup...@wso2.com wrote: Hi Danula, Any update on the progress? Were you managed to integrate the transformations with the wrangler? Thanks, On Thu, Jul 2, 2015 at 11:38 AM, Danula

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-07-02 Thread Danula Eranjith
Hi all, Update on the current progress of the project and future activities as we discussed at the recent meeting. *Current Progress* I have completed the phase of creating spark transformations relevant to operations available in wrangler. Operations implemented - Fill - Split - Drop - Delete

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Supun Sethunga
Can you arrange a time around this week? Please check with Nirmal too. On Sun, Jun 28, 2015 at 9:31 AM, Danula Eranjith hmdanu...@gmail.com wrote: Hi all, No, We haven't done a review yet. It would be great if we could have one so that I can discuss with you all and clarify the next steps

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Danula Eranjith
correction Actually I am free only from 12.15 pm to 1.15 pm. But can make myself available from 10.15 am to 12.15 pm if the previous time is not feasible. On Sun, Jun 28, 2015 at 11:28 AM, Danula Eranjith hmdanu...@gmail.com wrote: Actually I am free only from 12.15 am to 1.15 am. But can make

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Nirmal Fernando
Cool, thanks. Will send an invite. On Sun, Jun 28, 2015 at 11:18 AM, Danula Eranjith hmdanu...@gmail.com wrote: Okay Sure. We can have a hangout On Sun, Jun 28, 2015 at 11:15 AM, Nirmal Fernando nir...@wso2.com wrote: It'll be good if we can have it before mid evaluations. If you can't

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Nirmal Fernando
Let us know feasible time slots for tomorrow please. On Sun, Jun 28, 2015 at 11:20 AM, Nirmal Fernando nir...@wso2.com wrote: Cool, thanks. Will send an invite. On Sun, Jun 28, 2015 at 11:18 AM, Danula Eranjith hmdanu...@gmail.com wrote: Okay Sure. We can have a hangout On Sun, Jun 28,

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Danula Eranjith
Hi all, No, We haven't done a review yet. It would be great if we could have one so that I can discuss with you all and clarify the next steps of the implementation as you mentioned. Thanks Danula On Sun, Jun 28, 2015 at 9:25 AM, Supun Sethunga sup...@wso2.com wrote: Hi Danula, Did we have

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Danula Eranjith
Okay Sure. We can have a hangout On Sun, Jun 28, 2015 at 11:15 AM, Nirmal Fernando nir...@wso2.com wrote: It'll be good if we can have it before mid evaluations. If you can't make it to Trace, we can have a hangout? On Sun, Jun 28, 2015 at 11:11 AM, Danula Eranjith hmdanu...@gmail.com

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Nirmal Fernando
+1 shall we have it tomorrow at Trace? On Sun, Jun 28, 2015 at 9:45 AM, Supun Sethunga sup...@wso2.com wrote: Can you arrange a time around this week? Please check with Nirmal too. On Sun, Jun 28, 2015 at 9:31 AM, Danula Eranjith hmdanu...@gmail.com wrote: Hi all, No, We haven't done a

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Danula Eranjith
It would be difficult for me to make it tomorrow. How about Thursday (02/07) at Trace? anytime after 11.30 am would be great. On Sun, Jun 28, 2015 at 10:09 AM, Nirmal Fernando nir...@wso2.com wrote: +1 shall we have it tomorrow at Trace? On Sun, Jun 28, 2015 at 9:45 AM, Supun Sethunga

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Nirmal Fernando
It'll be good if we can have it before mid evaluations. If you can't make it to Trace, we can have a hangout? On Sun, Jun 28, 2015 at 11:11 AM, Danula Eranjith hmdanu...@gmail.com wrote: It would be difficult for me to make it tomorrow. How about Thursday (02/07) at Trace? anytime after 11.30

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-27 Thread Danula Eranjith
Actually I am free only from 12.15 am to 1.15 am. But can make myself available from 10.15 am to 12.15 am if the previous time is not feasible. On Sun, Jun 28, 2015 at 11:21 AM, Nirmal Fernando nir...@wso2.com wrote: Let us know feasible time slots for tomorrow please. On Sun, Jun 28, 2015 at

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-23 Thread Danula Eranjith
Hi Supun, I modified the Fill operation to add what you mentioned. I used a workaround to to implement certain parts of the operations such as filling with values from rows above and below. I created a List Implementation using toArray() method in JavaRDD and then converted it back to a JavaRDD

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-23 Thread Danula Eranjith
Hi all, I have completed implementing the wrangler operations as spark transformations. I am currently working on linking these operations with wrangler. Thanks, Danula On Wed, Jun 17, 2015 at 10:25 AM, Nirmal Fernando nir...@wso2.com wrote: Danula, Can you please send an update on the

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-23 Thread Nirmal Fernando
Hi Danula, Thanks for the update, keep them coming. On a JavaRDD you can perform a collect() to get a list, AFAIR. Yes, this is costly, since it would load whole dataset into memory. So, is this an operation which involves multiple rows? On Tue, Jun 23, 2015 at 2:15 PM, Danula Eranjith

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-16 Thread Nirmal Fernando
Danula, Can you please send an update on the status of the project? On Tue, Jun 16, 2015 at 9:44 AM, Supun Sethunga sup...@wso2.com wrote: Somehow there are issues in implementing certain wrangler functions due to limitations in JavaRDD used in spark e.g. - Fill operation - when filling

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-15 Thread Supun Sethunga
Hi Danula, Sorry for the late reply. Have you got the details you were looking for? It would be great if I could get to know which wrangler operations are important for a user of the ML Other than the ones you have mentioned in the proposal, think its better to have Translate operation as

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-15 Thread Supun Sethunga
Somehow there are issues in implementing certain wrangler functions due to limitations in JavaRDD used in spark e.g. - Fill operation - when filling with values from rows above and below Fold operation Agree, since rows will get executed randomly with spark, inter-row operations are not

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-04 Thread Danula Eranjith
Hi all, I am currently working on generating spark transformations related to the operations available in the data wrangler. Data wrangler provides sufficient parameters to re-create these at spark.I have successfully implemented delete and split operations of wrangler in spark. Once this phase

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-06-02 Thread Nirmal Fernando
Hi Danula, Please send an update of your work thus far. On Sun, May 10, 2015 at 2:30 PM, Nirmal Fernando nir...@wso2.com wrote: Hi Danula, Welcome to GSoC 15' ! Can you do some research on directly generating spark transformations using Wrangler and come up with a summary ? On Fri, May 8,

Re: [Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-05-10 Thread Nirmal Fernando
Hi Danula, Welcome to GSoC 15' ! Can you do some research on directly generating spark transformations using Wrangler and come up with a summary ? On Fri, May 8, 2015 at 11:03 AM, Danula Eranjith hmdanu...@gmail.com wrote: Hi all, Thank you for selecting my proposal [1]

[Dev] [GSoC-2015] Data Wrangler extension for WSO2 Machine Learner

2015-05-07 Thread Danula Eranjith
Hi all, Thank you for selecting my proposal [1] https://docs.google.com/document/d/18NFa23CrhXqnHrkl_AuRz3sQ3Axg7SEmiA7l66Hl9_0/edit?usp=sharing for GSoC 2015. I am really looking forward to work with you all and contribute to WSO2. I have already completed my primary research on wrangler and