Hi Maheshakya, As you requested i can change other parameters as well such as feature size(p). Initially i did it with p=3;sure thing. Anyway you can see and run the code if you want. source is at [1]. the test timing is called with random data as you requested if you set args[0] to 1. And you can find the extension and streaming algorithms in gsoc/ directiry[2]. thank you. BR, Mahesh. [1] https://github.com/dananjayamahesh/GSOC2016/blob/master/spark-examples/first-example/src/main/java/org/sparkexample/StreamingLinearRegression.java [2] https://github.com/dananjayamahesh/GSOC2016/tree/master/gsoc
On Sat, Jun 4, 2016 at 10:39 AM, Mahesh Dananjaya <dananjayamah...@gmail.com > wrote: > Hi supun, > Though i pushed it yesterday, there was some problems with the network. > now you can see them in the repo location [1].I added some Matlab plot you > can see the patter there.you can use ml also. Ok sure thing. I can prepare > a report or else blog if you want. files are as follows. The y axis is in > ns and x axis is in batch size. And also i added two pplots as jpegs[2], so > you can easily compare. > lr_timing_1000.txt -> batch size incremented by 1000 > lr_timing_10000.txt -> batch size incremented by 10000 > lr_timing_power10.txt -> batch size incremented by power of 10 > > In here independent variable is only tha batch size.If you want i can send > you making other parameters such as step size, number of iteration, feature > vector size as independent variables. please let me know if you want > further info. thank you. > regards, > Mahesh. > > > [1 > ]https://github.com/dananjayamahesh/GSOC2016/tree/master/spark-examples/first-example/output > [2] > https://github.com/dananjayamahesh/GSOC2016/blob/master/spark-examples/first-example/output/lr_timing_10000.jpg > > On Sat, Jun 4, 2016 at 9:58 AM, Supun Sethunga <sup...@wso2.com> wrote: > >> Hi Mahesh, >> >> I have added those timing reports to my repo [1]. >> >> Whats the file name? :) >> >> Btw, can you compile simple doc (gdoc) with the above results, and bring >> everything to one place? That way it is easy to compare, and keep track. >> >> Thanks, >> Supun >> >> On Fri, Jun 3, 2016 at 7:23 PM, Mahesh Dananjaya < >> dananjayamah...@gmail.com> wrote: >> >>> Hi Maheshkya, >>> I have added those timing reports to my repo [1].please have a look at. >>> three files are there. one is using incremet as 1000 for batch sizes >>> (lr_timing_1000). Otherone is using incremet by 10000 (lr_timing_10000) >>> upto 1 million in both scenarios.you can see the reports and figures in the >>> location [2] in the repo. i also added the streaminglinearregression >>> classes in the repo gsoc folder.thank you. >>> regards, >>> Mahesh. >>> [1]https://github.com/dananjayamahesh/GSOC2016 >>> [2] >>> https://github.com/dananjayamahesh/GSOC2016/tree/master/spark-examples/first-example/output >>> >>> On Mon, May 30, 2016 at 9:24 AM, Maheshakya Wijewardena < >>> mahesha...@wso2.com> wrote: >>> >>>> Hi Mahesh, >>>> >>>> Thank you for the update. I will look into your implementation. >>>> >>>> And i will be able to send you the timing/performances analysis report >>>>> tomorrow for the SGD functions >>>>> >>>> >>>> Great. Sent those asap so that we can proceed. >>>> >>>> Best regards. >>>> >>>> On Sun, May 29, 2016 at 6:56 PM, Mahesh Dananjaya < >>>> dananjayamah...@gmail.com> wrote: >>>> >>>>> >>>>> Hi maheshakay, >>>>> I have implemented the linear regression with cep siddhi event stream >>>>> with taking batch sizes as parameters from the cep. Now we can trying the >>>>> moving window method to. Before that i think i should get your opinion on >>>>> data structures to save the streaming data.please check my repo [1] >>>>> /gsoc/ >>>>> folder there you can find all new things i add.. there in the extension >>>>> folder you can find those extension. And i will be able to send you the >>>>> timing/performances analysis report tomorrow for the SGD functions. thank >>>>> you. >>>>> regards, >>>>> Mahesh. >>>>> [1] https://github.com/dananjayamahesh/GSOC2016/tree/master/gsoc >>>>> >>>>> >>>>> On Fri, May 27, 2016 at 12:56 PM, Mahesh Dananjaya < >>>>> dananjayamah...@gmail.com> wrote: >>>>> >>>>>> Hi maheshkaya, >>>>>> i have written some siddhi extension and trying to develop a one for >>>>>> my one. In time series example in the [1], can you please explain me the >>>>>> input format and query lines in that example for my understanding. >>>>>> >>>>>> from baseballData#timeseries:regress(2, 10000, 0.95, salary, rbi, >>>>>> walks, strikeouts, errors) >>>>>> select * >>>>>> insert into regResults; >>>>>> >>>>>> i just want to knwo how i give a set of data into this extension and >>>>>> what is baseballData. Is it input stream as usual.or any data file?how >>>>>> can >>>>>> i find that data set to create dummy input stream like baseballData? >>>>>> >>>>>> thank you. >>>>>> regards, >>>>>> Mahesh. >>>>>> [1] >>>>>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Stream+Processor+Extension >>>>>> >>>>>> On Thu, May 26, 2016 at 2:58 PM, Mahesh Dananjaya < >>>>>> dananjayamah...@gmail.com> wrote: >>>>>> >>>>>>> Hi Maheshakya, >>>>>>> today i got the siddhi and debug the math extention. then did some >>>>>>> changes and check. Now i am trying to write same kind of extension in my >>>>>>> code base. so i add dependencies and it was built fine. Now i am trying >>>>>>> to >>>>>>> debug my extension and i did the same thing as i did in previous case. >>>>>>> Cep >>>>>>> is sending data, bu my extension is not firing in relevant break point. >>>>>>> 1. So how can i debug the siddhi extension in my new extension.(you >>>>>>> can see it in my example repoo) >>>>>>> >>>>>>> I think if i do it correctly we can built the extension for our >>>>>>> purpose. And i will send the relevant timing report of SGD algorithms >>>>>>> very >>>>>>> soon as supun was asking me. thank you. >>>>>>> regards, >>>>>>> Mahesh. >>>>>>> >>>>>>> On Tue, May 24, 2016 at 11:07 AM, Maheshakya Wijewardena < >>>>>>> mahesha...@wso2.com> wrote: >>>>>>> >>>>>>>> Also note that there is a calculation interval in the siddhi time >>>>>>>> series regression function[1]. You maybe able get some insight for this >>>>>>>> from that as well. >>>>>>>> >>>>>>>> [1] https://docs.wso2.com/display/CEP400/Regression >>>>>>>> >>>>>>>> On Tue, May 24, 2016 at 11:03 AM, Maheshakya Wijewardena < >>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>> >>>>>>>>> Hi Mahesh, >>>>>>>>> >>>>>>>>> As we discussed offline, we can use similar mechanism to train >>>>>>>>> linear regression models, logistic regression models and k-means >>>>>>>>> clustering >>>>>>>>> models. >>>>>>>>> >>>>>>>>> It is very interesting that i have found that somethings that can >>>>>>>>>> make use of our work. In the cep 4.0 documentation there is a Custom >>>>>>>>>> Stream >>>>>>>>>> Processor Extention program [1]. There is a example of >>>>>>>>>> LinearRegressionStreamProcessor [1]. >>>>>>>>>> >>>>>>>>> >>>>>>>>> As we have to train predictive models with Spark, you can write >>>>>>>>> wrappers around regression/clustering models of Spark. Refer to >>>>>>>>> Siddhi time >>>>>>>>> series regression source codes[1][2]. You can write a streaming linear >>>>>>>>> regression class for ML in a similar fashion by wrapping Spark mllib >>>>>>>>> implementations. You can use the methods "addEvent", "removeEvent", >>>>>>>>> etc. >>>>>>>>> (may have to be changed according to requirements) for the similar >>>>>>>>> purpose. >>>>>>>>> You can introduce trainLinearRegression/LogisticRegression/Kmeans >>>>>>>>> which >>>>>>>>> does a similar thing as in createLinearRegression in those time series >>>>>>>>> functions. In the processData method you can use Spark mllib classes >>>>>>>>> to >>>>>>>>> actually train models and return the model weights, evaluation >>>>>>>>> metrics. So, >>>>>>>>> converting streams into RDDs and retrieving information from the >>>>>>>>> trained >>>>>>>>> models shall happen in this method. >>>>>>>>> >>>>>>>>> In the stream processor extension example, you can retrieve those >>>>>>>>> values then use them to train new models with new batches. >>>>>>>>> Weights/cluster >>>>>>>>> centers maybe passed as initialization parameters for the wrappers. >>>>>>>>> >>>>>>>>> Please note that we have to figure out the best siddhi extension >>>>>>>>> type for this process. In the siddhi query, we define batch size, >>>>>>>>> type of >>>>>>>>> algorithm and number of features (there can be more). After batch size >>>>>>>>> number of events received, train a model and save parameters, return >>>>>>>>> evaluation metric. With the next batch, retrain the model initialized >>>>>>>>> with >>>>>>>>> previously learned parameters. >>>>>>>>> >>>>>>>>> We also may need to test the same scenario with a moving window, >>>>>>>>> but I suspect that that approach may become so slow as a model is >>>>>>>>> trained >>>>>>>>> each time an event is received. So, we may have to change the number >>>>>>>>> of >>>>>>>>> slots the moving window moves at a time (eg: not one by one, but ten >>>>>>>>> by >>>>>>>>> ten). >>>>>>>>> >>>>>>>>> Once this is resolved, majority of the research part will be >>>>>>>>> finished and all we will be left to do is implementing wrappers >>>>>>>>> around the >>>>>>>>> 3 learning algorithms we consider. >>>>>>>>> >>>>>>>>> Best regards. >>>>>>>>> >>>>>>>>> [1] >>>>>>>>> https://github.com/wso2/siddhi/blob/master/modules/siddhi-extensions/timeseries/src/main/java/org/wso2/siddhi/extension/timeseries/linreg/RegressionCalculator.java >>>>>>>>> [2] >>>>>>>>> https://github.com/wso2/siddhi/blob/master/modules/siddhi-extensions/timeseries/src/main/java/org/wso2/siddhi/extension/timeseries/linreg/SimpleLinearRegressionCalculator.java >>>>>>>>> >>>>>>>>> >>>>>>>>> On Sat, May 21, 2016 at 2:55 PM, Mahesh Dananjaya < >>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>> >>>>>>>>>> Hi Maheshkya, >>>>>>>>>> shall we use [1] for our work? i am checking the possibility. >>>>>>>>>> BR, >>>>>>>>>> Mahesh. >>>>>>>>>> [1] >>>>>>>>>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Stream+Processor+Extension >>>>>>>>>> [2] >>>>>>>>>> https://docs.wso2.com/display/CEP400/Inbuilt+Windows#InbuiltWindows-lengthlength >>>>>>>>>> [3] >>>>>>>>>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Aggregate+Function >>>>>>>>>> >>>>>>>>>> On Sat, May 21, 2016 at 2:44 PM, Mahesh Dananjaya < >>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>> It is very interesting that i have found that somethings that >>>>>>>>>>> can make use of our work. In the cep 4.0 documentation there is a >>>>>>>>>>> Custom >>>>>>>>>>> Stream Processor Extention program [1]. There is a example of >>>>>>>>>>> LinearRegressionStreamProcessor [1] and also i saw >>>>>>>>>>> private int batchSize = 1000000000; i am going through this >>>>>>>>>>> one. >>>>>>>>>>> Please check whether we can use. WIll there be any compatibility >>>>>>>>>>> or support issue? >>>>>>>>>>> regards, >>>>>>>>>>> Mahesh. >>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>>> [1] >>>>>>>>>>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Stream+Processor+Extension >>>>>>>>>>> >>>>>>>>>>> On Sat, May 21, 2016 at 11:52 AM, Mahesh Dananjaya < >>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>> >>>>>>>>>>>> Hi maheshakya, >>>>>>>>>>>> anyway how can test any siddhi extention after write it without >>>>>>>>>>>> integrating it to cep.can you please explain me the procedure. i am >>>>>>>>>>>> referring to [1] [2] [3] [4]. thank you. >>>>>>>>>>>> BR, >>>>>>>>>>>> Mahesh. >>>>>>>>>>>> >>>>>>>>>>>> [1] >>>>>>>>>>>> https://docs.wso2.com/display/CEP310/Writing+Extensions+to+Siddhi >>>>>>>>>>>> [2] >>>>>>>>>>>> https://docs.wso2.com/display/CEP310/Writing+a+Custom+Function >>>>>>>>>>>> [3] >>>>>>>>>>>> https://docs.wso2.com/display/CEP310/Writing+a+Custom+Window >>>>>>>>>>>> [4] >>>>>>>>>>>> https://docs.wso2.com/display/CEP400/Writing+Extensions+to+Siddhi >>>>>>>>>>>> >>>>>>>>>>>> On Thu, May 19, 2016 at 12:08 PM, Mahesh Dananjaya < >>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>> thank you for the feedback. I have add data-sets into repo. >>>>>>>>>>>>> data-sets/lr. I am all right with next week.Now i am writing some >>>>>>>>>>>>> examples >>>>>>>>>>>>> to collect samples and build mini batches and run the algorithms >>>>>>>>>>>>> on those >>>>>>>>>>>>> mini-batches. thank you. will add those into repo soon.I am still >>>>>>>>>>>>> working >>>>>>>>>>>>> on that siddhi extention.i will let you know the progress. >>>>>>>>>>>>> BR, >>>>>>>>>>>>> mahesh. >>>>>>>>>>>>> >>>>>>>>>>>>> On Thu, May 19, 2016 at 11:10 AM, Maheshakya Wijewardena < >>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>> >>>>>>>>>>>>>> I've look into your code sample of streaming linear >>>>>>>>>>>>>> regression. Looks good to me, apart from few issues in coding >>>>>>>>>>>>>> practices >>>>>>>>>>>>>> which we can improve when you're doing the implementations in >>>>>>>>>>>>>> carbon-ml and >>>>>>>>>>>>>> during the code reviews. You are using a set of files as >>>>>>>>>>>>>> mini-batches of >>>>>>>>>>>>>> data, right? Can you also send us the datasets you've been >>>>>>>>>>>>>> using. I'd like >>>>>>>>>>>>>> to run this. >>>>>>>>>>>>>> >>>>>>>>>>>>>> does that cep problem is now all right that we were trying to >>>>>>>>>>>>>>> fix. I am still using those pre-build versions. If so i can >>>>>>>>>>>>>>> merge with the >>>>>>>>>>>>>>> latest one. >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> I'll check this and let you know. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Can we arrange a meeting (preferably in WSO2 offices) in next >>>>>>>>>>>>>> week with ML team members as well. Coding period begins on next >>>>>>>>>>>>>> Monday, so >>>>>>>>>>>>>> it's better to get overall feedback from others and discuss more >>>>>>>>>>>>>> about the >>>>>>>>>>>>>> project. Let me know convenient time slots for you. I'll arrange >>>>>>>>>>>>>> a meeting >>>>>>>>>>>>>> with ML team. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Wed, May 18, 2016 at 9:53 AM, Mahesh Dananjaya < >>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>> Ok. I will check it.you have sent me those relevant >>>>>>>>>>>>>>> references and i am working on that thing.thank you. does that >>>>>>>>>>>>>>> cep problem >>>>>>>>>>>>>>> is now all right that we were trying to fix. I am still using >>>>>>>>>>>>>>> those >>>>>>>>>>>>>>> pre-build versions. If so i can merge with the latest >>>>>>>>>>>>>>> one.thanks. >>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On Wed, May 18, 2016 at 9:44 AM, Maheshakya Wijewardena < >>>>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> You don't actually have to implement anything in spark >>>>>>>>>>>>>>>> streaming. Try to understand how streaming data is handled in >>>>>>>>>>>>>>>> and the >>>>>>>>>>>>>>>> specifics of the underlying algorithms in streaming. >>>>>>>>>>>>>>>> What we want to do is having the similar algorithms that >>>>>>>>>>>>>>>> support CEP event streams with siddhi. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On Wed, May 18, 2016 at 9:38 AM, Mahesh Dananjaya < >>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>> Did you check the repo. I will add recent works today.And >>>>>>>>>>>>>>>>> also i was going through the Java docs related to spark >>>>>>>>>>>>>>>>> streaming work. It >>>>>>>>>>>>>>>>> is with that scala API. thank you. >>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> On Tue, May 17, 2016 at 10:11 AM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>> I have gone through the Java Docs and run some of the >>>>>>>>>>>>>>>>>> Spark examples on spark shell which are paramount improtant >>>>>>>>>>>>>>>>>> for our work. >>>>>>>>>>>>>>>>>> Then i have been writing my codes to check the Linear >>>>>>>>>>>>>>>>>> regression, K means >>>>>>>>>>>>>>>>>> for streaming. please check my git repo [1]. I think now i >>>>>>>>>>>>>>>>>> have to ask on >>>>>>>>>>>>>>>>>> dev regarding the capturing event streams for our work. I >>>>>>>>>>>>>>>>>> will update the >>>>>>>>>>>>>>>>>> recent things on git. check the park-example directory for >>>>>>>>>>>>>>>>>> java. examples >>>>>>>>>>>>>>>>>> run on git shell is not included there. In my case i think i >>>>>>>>>>>>>>>>>> have to build >>>>>>>>>>>>>>>>>> mini batches from data streams that comes as individual >>>>>>>>>>>>>>>>>> samples. Now i am >>>>>>>>>>>>>>>>>> working on some coding to collect mini batches from data >>>>>>>>>>>>>>>>>> streams.thank you. >>>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>> [1]https://github.com/dananjayamahesh/GSOC2016 >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> On Tue, May 17, 2016 at 10:10 AM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>> I have gone through the Java Docs and run some of the >>>>>>>>>>>>>>>>>>> Spark examples on spark shell which are paramount improtant >>>>>>>>>>>>>>>>>>> for our work. >>>>>>>>>>>>>>>>>>> Then i have been writing my codes to check the Linear >>>>>>>>>>>>>>>>>>> regression, K means >>>>>>>>>>>>>>>>>>> for streaming. please check my git repo [1]. I think now i >>>>>>>>>>>>>>>>>>> have to ask on >>>>>>>>>>>>>>>>>>> dev regarding the capturing event streams for our work. I >>>>>>>>>>>>>>>>>>> will update the >>>>>>>>>>>>>>>>>>> recent things on git. check the park-example directory for >>>>>>>>>>>>>>>>>>> java. examples >>>>>>>>>>>>>>>>>>> run on git shell is not included there. In my case i think >>>>>>>>>>>>>>>>>>> i have to build >>>>>>>>>>>>>>>>>>> mini batches from data streams that comes as individual >>>>>>>>>>>>>>>>>>> samples. Now i am >>>>>>>>>>>>>>>>>>> working on some coding to collect mini batches from data >>>>>>>>>>>>>>>>>>> streams.thank you. >>>>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>> [1]https://github.com/dananjayamahesh/GSOC2016 >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> On Mon, May 16, 2016 at 1:19 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>>> thank you. i will update the repo today.thank you.i >>>>>>>>>>>>>>>>>>>> changed the carbon ml siddhi extention and see how the >>>>>>>>>>>>>>>>>>>> changes are >>>>>>>>>>>>>>>>>>>> effecting. i will update the progress as soon as >>>>>>>>>>>>>>>>>>>> possible.thank you. i had >>>>>>>>>>>>>>>>>>>> some problem in spark mllib dependency. i was fixing that. >>>>>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>> p.s: do i need to maintain a blog? >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> On Mon, May 16, 2016 at 10:02 AM, Maheshakya >>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Sorry for replying late. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Thank you for the update. I believe you have done some >>>>>>>>>>>>>>>>>>>>> implementations with with Spark MLLIb algorithms in >>>>>>>>>>>>>>>>>>>>> streaming fashion as we >>>>>>>>>>>>>>>>>>>>> have discussed. If so, can you please share your code in >>>>>>>>>>>>>>>>>>>>> a Github repo. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Now i want to implements some machine learning >>>>>>>>>>>>>>>>>>>>>> algorithms with importing mllib and want to run within >>>>>>>>>>>>>>>>>>>>>> your code base >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> For the moment you can try out editing the same class >>>>>>>>>>>>>>>>>>>>> PredictStreamProcessor in the siddhi extension in >>>>>>>>>>>>>>>>>>>>> carbon-ml. Later we will >>>>>>>>>>>>>>>>>>>>> add this separately. You should be able to add >>>>>>>>>>>>>>>>>>>>> org.apache.spark.mllib. >>>>>>>>>>>>>>>>>>>>> classes to there. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> And i want to see how event streams are coming from >>>>>>>>>>>>>>>>>>>>>> cep. As i think it is not in a RDD format since it is >>>>>>>>>>>>>>>>>>>>>> arriving as the >>>>>>>>>>>>>>>>>>>>>> individual samples. I will send a email to dev asking >>>>>>>>>>>>>>>>>>>>>> about how to get the >>>>>>>>>>>>>>>>>>>>>> streams. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Please pay attention to length[1] and lengthbatch[1] >>>>>>>>>>>>>>>>>>>>> inbuilt windows in siddhi. What you need to write are >>>>>>>>>>>>>>>>>>>>> functions similar to >>>>>>>>>>>>>>>>>>>>> a custom aggregate function[2]. >>>>>>>>>>>>>>>>>>>>> When you send the email to dev list, explain your >>>>>>>>>>>>>>>>>>>>> requirement. You need to get a set of event with from a >>>>>>>>>>>>>>>>>>>>> stream with a >>>>>>>>>>>>>>>>>>>>> specified window size (number of events). Then build a >>>>>>>>>>>>>>>>>>>>> model within that >>>>>>>>>>>>>>>>>>>>> function. You also need to retain the data (learned >>>>>>>>>>>>>>>>>>>>> weights, cluster >>>>>>>>>>>>>>>>>>>>> centers, etc.) from the previous window to use in the >>>>>>>>>>>>>>>>>>>>> current window. Ask >>>>>>>>>>>>>>>>>>>>> what can be the most suitable option for this among the >>>>>>>>>>>>>>>>>>>>> set of siddhi >>>>>>>>>>>>>>>>>>>>> extensions given. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>> https://docs.wso2.com/display/CEP400/Inbuilt+Windows#InbuiltWindows-lengthlength >>>>>>>>>>>>>>>>>>>>> [2] >>>>>>>>>>>>>>>>>>>>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Aggregate+Function >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> On Wed, May 11, 2016 at 1:43 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> ---------- Forwarded message ---------- >>>>>>>>>>>>>>>>>>>>>> From: Mahesh Dananjaya <dananjayamah...@gmail.com> >>>>>>>>>>>>>>>>>>>>>> Date: Wed, May 11, 2016 at 1:43 PM >>>>>>>>>>>>>>>>>>>>>> Subject: Re: [Dev] GSOC2016: [ML][CEP] Predictive >>>>>>>>>>>>>>>>>>>>>> analytic with online data for WSO2 Machine Learner >>>>>>>>>>>>>>>>>>>>>> To: Maheshakya Wijewardena <mahesha...@wso2.com> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>>>>> sorry for not updating. I did what you wanted me to >>>>>>>>>>>>>>>>>>>>>> do. I checked the code base and train functions. I went >>>>>>>>>>>>>>>>>>>>>> through those java >>>>>>>>>>>>>>>>>>>>>> docs. I went through the carbon-ml current >>>>>>>>>>>>>>>>>>>>>> implementation of LG and K-Mean. >>>>>>>>>>>>>>>>>>>>>> And i had Apache Spark and i tried with several >>>>>>>>>>>>>>>>>>>>>> examples. Now i want to >>>>>>>>>>>>>>>>>>>>>> implements some machine learning algorithms with >>>>>>>>>>>>>>>>>>>>>> importing mllib and want >>>>>>>>>>>>>>>>>>>>>> to run within your code base. Can you help me with that. >>>>>>>>>>>>>>>>>>>>>> And i want to see how event streams are coming from >>>>>>>>>>>>>>>>>>>>>> cep. As i think it is not in a RDD format since it is >>>>>>>>>>>>>>>>>>>>>> arriving as the >>>>>>>>>>>>>>>>>>>>>> individual samples. I will send a email to dev asking >>>>>>>>>>>>>>>>>>>>>> about how to get the >>>>>>>>>>>>>>>>>>>>>> streams. I debugged many of those functions in the code >>>>>>>>>>>>>>>>>>>>>> base. So need >>>>>>>>>>>>>>>>>>>>>> further instructions to proceed.thank you. >>>>>>>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> On Wed, May 11, 2016 at 10:32 AM, Maheshakya >>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> Any update on your progress? >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> On Wed, May 4, 2016 at 8:35 PM, Maheshakya >>>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> is that "Put break points in train methods in >>>>>>>>>>>>>>>>>>>>>>>>> Linear Regression class" means the spark/algorithms/ >>>>>>>>>>>>>>>>>>>>>>>>> LinearRegrassion.java >>>>>>>>>>>>>>>>>>>>>>>>> class in the org.wso2.carbon.ml.core? is that the >>>>>>>>>>>>>>>>>>>>>>>>> correct file? >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Yes, this is the correct place. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> You can refer to spark programming guide[1][2] as >>>>>>>>>>>>>>>>>>>>>>>> well as our ML code base when you try those algorithms >>>>>>>>>>>>>>>>>>>>>>>> out. Please try to >>>>>>>>>>>>>>>>>>>>>>>> do rough implementations of the streaming versions of >>>>>>>>>>>>>>>>>>>>>>>> linear regression, >>>>>>>>>>>>>>>>>>>>>>>> logistic regression and k-means clustering as we have >>>>>>>>>>>>>>>>>>>>>>>> discussed in the >>>>>>>>>>>>>>>>>>>>>>>> proposal in plain Java. It's better if you can create >>>>>>>>>>>>>>>>>>>>>>>> a git repo and share >>>>>>>>>>>>>>>>>>>>>>>> your code once you have made some progress. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Were you able debug and understand the flow of the >>>>>>>>>>>>>>>>>>>>>>>> ML siddhi extension? I hope you haven't encountered >>>>>>>>>>>>>>>>>>>>>>>> more errors after >>>>>>>>>>>>>>>>>>>>>>>> switching the released version of CEP. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Is this Friday okay for you? Afternoon at 2:00 pm? >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>>>>> http://spark.apache.org/docs/latest/programming-guide.html >>>>>>>>>>>>>>>>>>>>>>>> [2] >>>>>>>>>>>>>>>>>>>>>>>> http://spark.apache.org/docs/latest/mllib-guide.html >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> On Wed, May 4, 2016 at 1:07 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>>>>>>>> I have been looking into some algorithms related >>>>>>>>>>>>>>>>>>>>>>>>> to stochastic gradient descent based >>>>>>>>>>>>>>>>>>>>>>>>> algorithms.anything i should focus >>>>>>>>>>>>>>>>>>>>>>>>> please let me know.Ans also i will be available for >>>>>>>>>>>>>>>>>>>>>>>>> calling this week and >>>>>>>>>>>>>>>>>>>>>>>>> next week.thank you. >>>>>>>>>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 5:05 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>>>>>>>>> thank you.that's good. i have been trying to fix >>>>>>>>>>>>>>>>>>>>>>>>>> that for couple of days. please inform me when it >>>>>>>>>>>>>>>>>>>>>>>>>> will be fixed.now i have >>>>>>>>>>>>>>>>>>>>>>>>>> been testing the ML algorithms and trying to >>>>>>>>>>>>>>>>>>>>>>>>>> identify the flow and the >>>>>>>>>>>>>>>>>>>>>>>>>> hierarchy. is that "Put break points in train >>>>>>>>>>>>>>>>>>>>>>>>>> methods in Linear Regression >>>>>>>>>>>>>>>>>>>>>>>>>> class" means the spark/algorithms/ >>>>>>>>>>>>>>>>>>>>>>>>>> LinearRegrassion.java class in the >>>>>>>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.core? is that the correct file? >>>>>>>>>>>>>>>>>>>>>>>>>> And also i am planning to write some programs to >>>>>>>>>>>>>>>>>>>>>>>>>> use apache spark mllib algorithms. and i refer to >>>>>>>>>>>>>>>>>>>>>>>>>> [1] and some wso2 >>>>>>>>>>>>>>>>>>>>>>>>>> documentations to get some idea about ML >>>>>>>>>>>>>>>>>>>>>>>>>> structure.thank you. >>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>> [1]nirmalfdo.blogspot.com >>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 4:36 PM, Maheshakya >>>>>>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> I have checked. It seems the issue you have >>>>>>>>>>>>>>>>>>>>>>>>>>> encountered is cause only in the current >>>>>>>>>>>>>>>>>>>>>>>>>>> development branch of the >>>>>>>>>>>>>>>>>>>>>>>>>>> product-cep. It doesn't identify the ML siddhi >>>>>>>>>>>>>>>>>>>>>>>>>>> extension as an extension. >>>>>>>>>>>>>>>>>>>>>>>>>>> ML siddhi extension works fine in the latest >>>>>>>>>>>>>>>>>>>>>>>>>>> release of CEP (4.1.0) [1]. >>>>>>>>>>>>>>>>>>>>>>>>>>> Until we figure out the reason and come up with >>>>>>>>>>>>>>>>>>>>>>>>>>> a solution, can you use the latest CEP release for >>>>>>>>>>>>>>>>>>>>>>>>>>> your work. It's fine to >>>>>>>>>>>>>>>>>>>>>>>>>>> use that since you haven't started actual >>>>>>>>>>>>>>>>>>>>>>>>>>> development yet. >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>>>>>>>> http://wso2.com/products/complex-event-processor/ >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 3:19 PM, Maheshakya >>>>>>>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>> Is is vital to use those local repo in my >>>>>>>>>>>>>>>>>>>>>>>>>>>>> upcoming implementation? >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> Yes. The remote p2-repo contains the p2-repos >>>>>>>>>>>>>>>>>>>>>>>>>>>> of released versions. What you have to develop on >>>>>>>>>>>>>>>>>>>>>>>>>>>> is the current master of >>>>>>>>>>>>>>>>>>>>>>>>>>>> the carbon-ml and product-ml. You can try out with >>>>>>>>>>>>>>>>>>>>>>>>>>>> the modification I have >>>>>>>>>>>>>>>>>>>>>>>>>>>> suggested. In the meantime, I'll verify whether >>>>>>>>>>>>>>>>>>>>>>>>>>>> the current repos are >>>>>>>>>>>>>>>>>>>>>>>>>>>> working as expected. >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> And also i am trying to debug the carbon-ml >>>>>>>>>>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.core by putting some break >>>>>>>>>>>>>>>>>>>>>>>>>>>>> point in the >>>>>>>>>>>>>>>>>>>>>>>>>>>>> spark/algorithms/Linear Regression >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> It's great that you have started looking at the >>>>>>>>>>>>>>>>>>>>>>>>>>>> implementation of linear regression as well. Put >>>>>>>>>>>>>>>>>>>>>>>>>>>> break points in train >>>>>>>>>>>>>>>>>>>>>>>>>>>> methods in LinearRegression class. This is being >>>>>>>>>>>>>>>>>>>>>>>>>>>> used when you run linear >>>>>>>>>>>>>>>>>>>>>>>>>>>> regression from UI. >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> I can see some comments left behind for >>>>>>>>>>>>>>>>>>>>>>>>>>>>> streaming algo as well.thank you >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> You may be referring to the linear regression >>>>>>>>>>>>>>>>>>>>>>>>>>>> with SGD model. Here, there's no retraining with >>>>>>>>>>>>>>>>>>>>>>>>>>>> streaming data involved. >>>>>>>>>>>>>>>>>>>>>>>>>>>> The SGD with minibatches is used to train the >>>>>>>>>>>>>>>>>>>>>>>>>>>> model with the data set only >>>>>>>>>>>>>>>>>>>>>>>>>>>> once. >>>>>>>>>>>>>>>>>>>>>>>>>>>> What you have to do is create a similar >>>>>>>>>>>>>>>>>>>>>>>>>>>> mechanism to involve streaming data and retrain >>>>>>>>>>>>>>>>>>>>>>>>>>>> models. We will get to that >>>>>>>>>>>>>>>>>>>>>>>>>>>> part once you get comfortable with siddhi >>>>>>>>>>>>>>>>>>>>>>>>>>>> extensions. >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> BTW, is it possible for you to join a call on >>>>>>>>>>>>>>>>>>>>>>>>>>>> this Friday or in the next week. We'll try to >>>>>>>>>>>>>>>>>>>>>>>>>>>> resolve your current issues >>>>>>>>>>>>>>>>>>>>>>>>>>>> and discuss further on project. >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 1:03 PM, Mahesh >>>>>>>>>>>>>>>>>>>>>>>>>>>> Dananjaya <dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>> Hi maheshakya, >>>>>>>>>>>>>>>>>>>>>>>>>>>>> Is it ok to go with p2 repo at >>>>>>>>>>>>>>>>>>>>>>>>>>>>> http://product-dist.wso2.com/p2/carbon/releases/wilkes/features/ >>>>>>>>>>>>>>>>>>>>>>>>>>>>> rather than the P2-repo at >>>>>>>>>>>>>>>>>>>>>>>>>>>>> product-ml/modules/p2-profile/target/p2-repo in >>>>>>>>>>>>>>>>>>>>>>>>>>>>> local repo.What is the impact?.Is is vital to use >>>>>>>>>>>>>>>>>>>>>>>>>>>>> those local repo in my >>>>>>>>>>>>>>>>>>>>>>>>>>>>> upcoming implementation?so i was trying to give >>>>>>>>>>>>>>>>>>>>>>>>>>>>> remote p2 repo to cep built >>>>>>>>>>>>>>>>>>>>>>>>>>>>> by source and trying to debug the cep ml >>>>>>>>>>>>>>>>>>>>>>>>>>>>> extension and got the same error >>>>>>>>>>>>>>>>>>>>>>>>>>>>> as yesterday. But pre-built product is working >>>>>>>>>>>>>>>>>>>>>>>>>>>>> fine. Therefore now i am >>>>>>>>>>>>>>>>>>>>>>>>>>>>> trying as you described in the last email. >>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>> And also i am trying to debug the carbon-ml >>>>>>>>>>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.core by putting some break >>>>>>>>>>>>>>>>>>>>>>>>>>>>> point in the >>>>>>>>>>>>>>>>>>>>>>>>>>>>> spark/algorithms/Linear Regression. I am trying >>>>>>>>>>>>>>>>>>>>>>>>>>>>> to trigger it with >>>>>>>>>>>>>>>>>>>>>>>>>>>>> product-ml project with data set. Does that >>>>>>>>>>>>>>>>>>>>>>>>>>>>> Linear Regression also in UI is >>>>>>>>>>>>>>>>>>>>>>>>>>>>> consuming that spark algorithms or is it in >>>>>>>>>>>>>>>>>>>>>>>>>>>>> another place? I can see some >>>>>>>>>>>>>>>>>>>>>>>>>>>>> comments left behind for streaming algo as >>>>>>>>>>>>>>>>>>>>>>>>>>>>> well.thank you. >>>>>>>>>>>>>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 9:35 AM, Maheshakya >>>>>>>>>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> The earlier error you have mentioned may >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> occur due to the incompatible Siddhi versions in >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ML p2-repo and CEP, when >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> you add the p2-repo from prodcut-ml you built. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Current siddhi version in product-cep is >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 3.0.6-SNAPSHOT[1], but in ML it's 3.0.2. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Can you try changing the siddhi.version in >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> carbon-ml/pom.xml to 3.0.6-SNAPSHOT, build >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> carbon-ml, then build product-ml >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> again. After this, add p2-repo as a local >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> repository again to fresh CEP >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> pack and try it out. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> On Mon, May 2, 2016 at 7:02 PM, Mahesh >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Dananjaya <dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> now i remotely debug the CEP extension for >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ML Prediction. What i did was, i have all the >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> pre-build version of CEP and >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ML. Therefore i used that built CEP and did the >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> same thing that i was doing >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> with the source code. I think the only change i >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> did was install those >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> packages from remote p2 repo. This was work >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> fine and i debugged the >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> carbon-ml org.wso2.carbon.ml.siddhi.extension >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> as described in the [1]. So >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> now i have to try same thing with the build by >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> source content.thank you. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> https://docs.wso2.com/display/ML110/WSO2+CEP+Extension+for+ML+Predictions#WSO2CEPExtensionforMLPredictions-Siddhisyntaxfortheextension >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Mahesh >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> On Mon, Apr 25, 2016 at 5:49 PM, Maheshakya >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Congratulations and welcome to GSoC 2016. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> You did a great job in preparing the proposal. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Now it's time to dig deep >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> and get started with the project. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> First of all you need to familiarize with >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> the code base. We have agreed to implement >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> this with CEP event streams. We >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> already have a CEP extension for predictions >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [1][2]. Go through this >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> implementation and familiarize your self with >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> that. You need to understand >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> how: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 1. Even streams are consumed >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 2. predictions are made from individual >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> event >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> 3. Results are sent back >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Get WSO2 ML and CEP sources (You may use >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> latest released version of CEP) and build the >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> products. Get both >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> carbon-ml[3] and product-ml[4] masters and >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> create new branches for your >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> work from masters. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> After you build the products, you may need >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> to do remote debugging[5] to understand the >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> flow. So please follow an >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> example of real time prediction with ML with >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> debugging and get some idea. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> The component you need to debug is >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.siddhi.extension. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Next tasks would be implementing online >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> learning algorithms in plain java with spark >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> ml lib and integrating those >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> to ML. We also need to come up with a proper >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> and detailed architecture to >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> employ those algorithms in ML. Getting >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> familiar with the aforementioned >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> sections would give you some insight on how >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> this should be implemented. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> So please try to get a quick grasp then you >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> can start the implementation. Let us know if >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> you have any questions or you >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> get stuck somewhere. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Also, please always add WSO2 developer's >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> list as well when you communicate with us >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> regarding the project so that you >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> can get opinions and feedback from others as >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> well. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> https://docs.wso2.com/display/ML110/WSO2+CEP+Extension+for+ML+Predictions#WSO2CEPExtensionforMLPredictions-Siddhisyntaxfortheextension >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [2] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> https://github.com/wso2/carbon-ml/tree/master/components/extensions/org.wso2.carbon.ml.siddhi.extension >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [3] https://github.com/wso2/carbon-ml >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [4] https://github.com/wso2/product-ml >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> [5] >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> https://dzone.com/articles/how-debug-wso2-carbon-kernel >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> On Mon, Apr 25, 2016 at 3:33 PM, Mahesh >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Dananjaya <dananjayamah...@gmail.com> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> wrote: >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> thank you for accepting my GSOC 2016 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> proposal and i am looking forward for the >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> further instruction and project >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> continuation. thank you very much. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> _______________________________________________ >>>>>>>>>>>>>>>>>>>>>> Dev mailing list >>>>>>>>>>>>>>>>>>>>>> Dev@wso2.org >>>>>>>>>>>>>>>>>>>>>> http://wso2.org/cgi-bin/mailman/listinfo/dev >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> -- >>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>> mahesha...@wso2.com >>>>>>>>> +94711228855 >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>> mahesha...@wso2.com >>>>>>>> +94711228855 >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>> >>>>> >>>> >>>> >>>> -- >>>> Pruthuvi Maheshakya Wijewardena >>>> mahesha...@wso2.com >>>> +94711228855 >>>> >>>> >>>> >>> >> >> >> -- >> *Supun Sethunga* >> Senior Software Engineer >> WSO2, Inc. >> http://wso2.com/ >> lean | enterprise | middleware >> Mobile : +94 716546324 >> > >
_______________________________________________ Dev mailing list Dev@wso2.org http://wso2.org/cgi-bin/mailman/listinfo/dev