Also note that there is a calculation interval in the siddhi time series regression function[1]. You maybe able get some insight for this from that as well.
[1] https://docs.wso2.com/display/CEP400/Regression On Tue, May 24, 2016 at 11:03 AM, Maheshakya Wijewardena < mahesha...@wso2.com> wrote: > Hi Mahesh, > > As we discussed offline, we can use similar mechanism to train linear > regression models, logistic regression models and k-means clustering models. > > It is very interesting that i have found that somethings that can make use >> of our work. In the cep 4.0 documentation there is a Custom Stream >> Processor Extention program [1]. There is a example of >> LinearRegressionStreamProcessor [1]. >> > > As we have to train predictive models with Spark, you can write wrappers > around regression/clustering models of Spark. Refer to Siddhi time series > regression source codes[1][2]. You can write a streaming linear regression > class for ML in a similar fashion by wrapping Spark mllib implementations. > You can use the methods "addEvent", "removeEvent", etc. (may have to be > changed according to requirements) for the similar purpose. You can > introduce trainLinearRegression/LogisticRegression/Kmeans which does a > similar thing as in createLinearRegression in those time series functions. > In the processData method you can use Spark mllib classes to actually train > models and return the model weights, evaluation metrics. So, converting > streams into RDDs and retrieving information from the trained models shall > happen in this method. > > In the stream processor extension example, you can retrieve those values > then use them to train new models with new batches. Weights/cluster centers > maybe passed as initialization parameters for the wrappers. > > Please note that we have to figure out the best siddhi extension type for > this process. In the siddhi query, we define batch size, type of algorithm > and number of features (there can be more). After batch size number of > events received, train a model and save parameters, return evaluation > metric. With the next batch, retrain the model initialized with previously > learned parameters. > > We also may need to test the same scenario with a moving window, but I > suspect that that approach may become so slow as a model is trained each > time an event is received. So, we may have to change the number of slots > the moving window moves at a time (eg: not one by one, but ten by ten). > > Once this is resolved, majority of the research part will be finished and > all we will be left to do is implementing wrappers around the 3 learning > algorithms we consider. > > Best regards. > > [1] > https://github.com/wso2/siddhi/blob/master/modules/siddhi-extensions/timeseries/src/main/java/org/wso2/siddhi/extension/timeseries/linreg/RegressionCalculator.java > [2] > https://github.com/wso2/siddhi/blob/master/modules/siddhi-extensions/timeseries/src/main/java/org/wso2/siddhi/extension/timeseries/linreg/SimpleLinearRegressionCalculator.java > > > On Sat, May 21, 2016 at 2:55 PM, Mahesh Dananjaya < > dananjayamah...@gmail.com> wrote: > >> Hi Maheshkya, >> shall we use [1] for our work? i am checking the possibility. >> BR, >> Mahesh. >> [1] >> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Stream+Processor+Extension >> [2] >> https://docs.wso2.com/display/CEP400/Inbuilt+Windows#InbuiltWindows-lengthlength >> [3] >> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Aggregate+Function >> >> On Sat, May 21, 2016 at 2:44 PM, Mahesh Dananjaya < >> dananjayamah...@gmail.com> wrote: >> >>> Hi Maheshakya, >>> It is very interesting that i have found that somethings that can make >>> use of our work. In the cep 4.0 documentation there is a Custom Stream >>> Processor Extention program [1]. There is a example of >>> LinearRegressionStreamProcessor [1] and also i saw >>> private int batchSize = 1000000000; i am going through this one. >>> Please check whether we can use. WIll there be any compatibility or >>> support issue? >>> regards, >>> Mahesh. >>> >>> >>> [1] >>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Stream+Processor+Extension >>> >>> On Sat, May 21, 2016 at 11:52 AM, Mahesh Dananjaya < >>> dananjayamah...@gmail.com> wrote: >>> >>>> Hi maheshakya, >>>> anyway how can test any siddhi extention after write it without >>>> integrating it to cep.can you please explain me the procedure. i am >>>> referring to [1] [2] [3] [4]. thank you. >>>> BR, >>>> Mahesh. >>>> >>>> [1] https://docs.wso2.com/display/CEP310/Writing+Extensions+to+Siddhi >>>> [2] https://docs.wso2.com/display/CEP310/Writing+a+Custom+Function >>>> [3] https://docs.wso2.com/display/CEP310/Writing+a+Custom+Window >>>> [4] https://docs.wso2.com/display/CEP400/Writing+Extensions+to+Siddhi >>>> >>>> On Thu, May 19, 2016 at 12:08 PM, Mahesh Dananjaya < >>>> dananjayamah...@gmail.com> wrote: >>>> >>>>> Hi Maheshakya, >>>>> thank you for the feedback. I have add data-sets into repo. >>>>> data-sets/lr. I am all right with next week.Now i am writing some examples >>>>> to collect samples and build mini batches and run the algorithms on those >>>>> mini-batches. thank you. will add those into repo soon.I am still working >>>>> on that siddhi extention.i will let you know the progress. >>>>> BR, >>>>> mahesh. >>>>> >>>>> On Thu, May 19, 2016 at 11:10 AM, Maheshakya Wijewardena < >>>>> mahesha...@wso2.com> wrote: >>>>> >>>>>> Hi Mahesh, >>>>>> >>>>>> I've look into your code sample of streaming linear regression. Looks >>>>>> good to me, apart from few issues in coding practices which we can >>>>>> improve >>>>>> when you're doing the implementations in carbon-ml and during the code >>>>>> reviews. You are using a set of files as mini-batches of data, right? Can >>>>>> you also send us the datasets you've been using. I'd like to run this. >>>>>> >>>>>> does that cep problem is now all right that we were trying to fix. I >>>>>>> am still using those pre-build versions. If so i can merge with the >>>>>>> latest >>>>>>> one. >>>>>> >>>>>> >>>>>> I'll check this and let you know. >>>>>> >>>>>> Can we arrange a meeting (preferably in WSO2 offices) in next week >>>>>> with ML team members as well. Coding period begins on next Monday, so >>>>>> it's >>>>>> better to get overall feedback from others and discuss more about the >>>>>> project. Let me know convenient time slots for you. I'll arrange a >>>>>> meeting >>>>>> with ML team. >>>>>> >>>>>> Best regards. >>>>>> >>>>>> On Wed, May 18, 2016 at 9:53 AM, Mahesh Dananjaya < >>>>>> dananjayamah...@gmail.com> wrote: >>>>>> >>>>>>> Hi Maheshakya, >>>>>>> Ok. I will check it.you have sent me those relevant references and i >>>>>>> am working on that thing.thank you. does that cep problem is now all >>>>>>> right >>>>>>> that we were trying to fix. I am still using those pre-build versions. >>>>>>> If >>>>>>> so i can merge with the latest one.thanks. >>>>>>> BR, >>>>>>> Mahesh. >>>>>>> >>>>>>> On Wed, May 18, 2016 at 9:44 AM, Maheshakya Wijewardena < >>>>>>> mahesha...@wso2.com> wrote: >>>>>>> >>>>>>>> Hi Mahesh, >>>>>>>> >>>>>>>> You don't actually have to implement anything in spark streaming. >>>>>>>> Try to understand how streaming data is handled in and the specifics >>>>>>>> of the >>>>>>>> underlying algorithms in streaming. >>>>>>>> What we want to do is having the similar algorithms that support >>>>>>>> CEP event streams with siddhi. >>>>>>>> >>>>>>>> Best regards. >>>>>>>> >>>>>>>> On Wed, May 18, 2016 at 9:38 AM, Mahesh Dananjaya < >>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>> >>>>>>>>> Hi Maheshakya, >>>>>>>>> Did you check the repo. I will add recent works today.And also i >>>>>>>>> was going through the Java docs related to spark streaming work. It >>>>>>>>> is with >>>>>>>>> that scala API. thank you. >>>>>>>>> regards, >>>>>>>>> Mahesh. >>>>>>>>> >>>>>>>>> On Tue, May 17, 2016 at 10:11 AM, Mahesh Dananjaya < >>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>> >>>>>>>>>> Hi Maheshakya, >>>>>>>>>> I have gone through the Java Docs and run some of the Spark >>>>>>>>>> examples on spark shell which are paramount improtant for our work. >>>>>>>>>> Then i >>>>>>>>>> have been writing my codes to check the Linear regression, K means >>>>>>>>>> for >>>>>>>>>> streaming. please check my git repo [1]. I think now i have to ask >>>>>>>>>> on dev >>>>>>>>>> regarding the capturing event streams for our work. I will update the >>>>>>>>>> recent things on git. check the park-example directory for java. >>>>>>>>>> examples >>>>>>>>>> run on git shell is not included there. In my case i think i have to >>>>>>>>>> build >>>>>>>>>> mini batches from data streams that comes as individual samples. Now >>>>>>>>>> i am >>>>>>>>>> working on some coding to collect mini batches from data >>>>>>>>>> streams.thank you. >>>>>>>>>> regards, >>>>>>>>>> Mahesh. >>>>>>>>>> [1]https://github.com/dananjayamahesh/GSOC2016 >>>>>>>>>> >>>>>>>>>> On Tue, May 17, 2016 at 10:10 AM, Mahesh Dananjaya < >>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>> I have gone through the Java Docs and run some of the Spark >>>>>>>>>>> examples on spark shell which are paramount improtant for our work. >>>>>>>>>>> Then i >>>>>>>>>>> have been writing my codes to check the Linear regression, K means >>>>>>>>>>> for >>>>>>>>>>> streaming. please check my git repo [1]. I think now i have to ask >>>>>>>>>>> on dev >>>>>>>>>>> regarding the capturing event streams for our work. I will update >>>>>>>>>>> the >>>>>>>>>>> recent things on git. check the park-example directory for java. >>>>>>>>>>> examples >>>>>>>>>>> run on git shell is not included there. In my case i think i have >>>>>>>>>>> to build >>>>>>>>>>> mini batches from data streams that comes as individual samples. >>>>>>>>>>> Now i am >>>>>>>>>>> working on some coding to collect mini batches from data >>>>>>>>>>> streams.thank you. >>>>>>>>>>> regards, >>>>>>>>>>> Mahesh. >>>>>>>>>>> [1]https://github.com/dananjayamahesh/GSOC2016 >>>>>>>>>>> >>>>>>>>>>> On Mon, May 16, 2016 at 1:19 PM, Mahesh Dananjaya < >>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>> >>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>> thank you. i will update the repo today.thank you.i changed the >>>>>>>>>>>> carbon ml siddhi extention and see how the changes are effecting. >>>>>>>>>>>> i will >>>>>>>>>>>> update the progress as soon as possible.thank you. i had some >>>>>>>>>>>> problem in >>>>>>>>>>>> spark mllib dependency. i was fixing that. >>>>>>>>>>>> regards, >>>>>>>>>>>> Mahesh. >>>>>>>>>>>> p.s: do i need to maintain a blog? >>>>>>>>>>>> >>>>>>>>>>>> On Mon, May 16, 2016 at 10:02 AM, Maheshakya Wijewardena < >>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>> >>>>>>>>>>>>> Sorry for replying late. >>>>>>>>>>>>> >>>>>>>>>>>>> Thank you for the update. I believe you have done some >>>>>>>>>>>>> implementations with with Spark MLLIb algorithms in streaming >>>>>>>>>>>>> fashion as we >>>>>>>>>>>>> have discussed. If so, can you please share your code in a Github >>>>>>>>>>>>> repo. >>>>>>>>>>>>> >>>>>>>>>>>>> Now i want to implements some machine learning algorithms with >>>>>>>>>>>>>> importing mllib and want to run within your code base >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> For the moment you can try out editing the same class >>>>>>>>>>>>> PredictStreamProcessor in the siddhi extension in carbon-ml. >>>>>>>>>>>>> Later we will >>>>>>>>>>>>> add this separately. You should be able to add >>>>>>>>>>>>> org.apache.spark.mllib. >>>>>>>>>>>>> classes to there. >>>>>>>>>>>>> >>>>>>>>>>>>> And i want to see how event streams are coming from cep. As i >>>>>>>>>>>>>> think it is not in a RDD format since it is arriving as the >>>>>>>>>>>>>> individual >>>>>>>>>>>>>> samples. I will send a email to dev asking about how to get the >>>>>>>>>>>>>> streams. >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> Please pay attention to length[1] and lengthbatch[1] inbuilt >>>>>>>>>>>>> windows in siddhi. What you need to write are functions similar >>>>>>>>>>>>> to a custom >>>>>>>>>>>>> aggregate function[2]. >>>>>>>>>>>>> When you send the email to dev list, explain your requirement. >>>>>>>>>>>>> You need to get a set of event with from a stream with a >>>>>>>>>>>>> specified window >>>>>>>>>>>>> size (number of events). Then build a model within that function. >>>>>>>>>>>>> You also >>>>>>>>>>>>> need to retain the data (learned weights, cluster centers, etc.) >>>>>>>>>>>>> from the >>>>>>>>>>>>> previous window to use in the current window. Ask what can be the >>>>>>>>>>>>> most >>>>>>>>>>>>> suitable option for this among the set of siddhi extensions given. >>>>>>>>>>>>> >>>>>>>>>>>>> Best regards. >>>>>>>>>>>>> >>>>>>>>>>>>> [1] >>>>>>>>>>>>> https://docs.wso2.com/display/CEP400/Inbuilt+Windows#InbuiltWindows-lengthlength >>>>>>>>>>>>> [2] >>>>>>>>>>>>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Aggregate+Function >>>>>>>>>>>>> >>>>>>>>>>>>> On Wed, May 11, 2016 at 1:43 PM, Mahesh Dananjaya < >>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> ---------- Forwarded message ---------- >>>>>>>>>>>>>> From: Mahesh Dananjaya <dananjayamah...@gmail.com> >>>>>>>>>>>>>> Date: Wed, May 11, 2016 at 1:43 PM >>>>>>>>>>>>>> Subject: Re: [Dev] GSOC2016: [ML][CEP] Predictive analytic >>>>>>>>>>>>>> with online data for WSO2 Machine Learner >>>>>>>>>>>>>> To: Maheshakya Wijewardena <mahesha...@wso2.com> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>> sorry for not updating. I did what you wanted me to do. I >>>>>>>>>>>>>> checked the code base and train functions. I went through those >>>>>>>>>>>>>> java docs. >>>>>>>>>>>>>> I went through the carbon-ml current implementation of LG and >>>>>>>>>>>>>> K-Mean. And i >>>>>>>>>>>>>> had Apache Spark and i tried with several examples. Now i want to >>>>>>>>>>>>>> implements some machine learning algorithms with importing mllib >>>>>>>>>>>>>> and want >>>>>>>>>>>>>> to run within your code base. Can you help me with that. >>>>>>>>>>>>>> And i want to see how event streams are coming from cep. As i >>>>>>>>>>>>>> think it is not in a RDD format since it is arriving as the >>>>>>>>>>>>>> individual >>>>>>>>>>>>>> samples. I will send a email to dev asking about how to get the >>>>>>>>>>>>>> streams. I >>>>>>>>>>>>>> debugged many of those functions in the code base. So need >>>>>>>>>>>>>> further >>>>>>>>>>>>>> instructions to proceed.thank you. >>>>>>>>>>>>>> regards, >>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Wed, May 11, 2016 at 10:32 AM, Maheshakya Wijewardena < >>>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> Any update on your progress? >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> On Wed, May 4, 2016 at 8:35 PM, Maheshakya Wijewardena < >>>>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> is that "Put break points in train methods in Linear >>>>>>>>>>>>>>>>> Regression class" means the spark/algorithms/ >>>>>>>>>>>>>>>>> LinearRegrassion.java class >>>>>>>>>>>>>>>>> in the org.wso2.carbon.ml.core? is that the correct file? >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Yes, this is the correct place. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> You can refer to spark programming guide[1][2] as well as >>>>>>>>>>>>>>>> our ML code base when you try those algorithms out. Please try >>>>>>>>>>>>>>>> to do rough >>>>>>>>>>>>>>>> implementations of the streaming versions of linear >>>>>>>>>>>>>>>> regression, logistic >>>>>>>>>>>>>>>> regression and k-means clustering as we have discussed in the >>>>>>>>>>>>>>>> proposal in >>>>>>>>>>>>>>>> plain Java. It's better if you can create a git repo and share >>>>>>>>>>>>>>>> your code >>>>>>>>>>>>>>>> once you have made some progress. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Were you able debug and understand the flow of the ML >>>>>>>>>>>>>>>> siddhi extension? I hope you haven't encountered more errors >>>>>>>>>>>>>>>> after >>>>>>>>>>>>>>>> switching the released version of CEP. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Is this Friday okay for you? Afternoon at 2:00 pm? >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>> http://spark.apache.org/docs/latest/programming-guide.html >>>>>>>>>>>>>>>> [2] http://spark.apache.org/docs/latest/mllib-guide.html >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> On Wed, May 4, 2016 at 1:07 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>> I have been looking into some algorithms related to >>>>>>>>>>>>>>>>> stochastic gradient descent based algorithms.anything i >>>>>>>>>>>>>>>>> should focus please >>>>>>>>>>>>>>>>> let me know.Ans also i will be available for calling this >>>>>>>>>>>>>>>>> week and next >>>>>>>>>>>>>>>>> week.thank you. >>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 5:05 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>> thank you.that's good. i have been trying to fix that for >>>>>>>>>>>>>>>>>> couple of days. please inform me when it will be fixed.now i >>>>>>>>>>>>>>>>>> have been >>>>>>>>>>>>>>>>>> testing the ML algorithms and trying to identify the flow >>>>>>>>>>>>>>>>>> and the >>>>>>>>>>>>>>>>>> hierarchy. is that "Put break points in train methods in >>>>>>>>>>>>>>>>>> Linear Regression >>>>>>>>>>>>>>>>>> class" means the spark/algorithms/ LinearRegrassion.java >>>>>>>>>>>>>>>>>> class in the >>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.core? is that the correct file? >>>>>>>>>>>>>>>>>> And also i am planning to write some programs to use >>>>>>>>>>>>>>>>>> apache spark mllib algorithms. and i refer to [1] and some >>>>>>>>>>>>>>>>>> wso2 >>>>>>>>>>>>>>>>>> documentations to get some idea about ML structure.thank you. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> [1]nirmalfdo.blogspot.com >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 4:36 PM, Maheshakya Wijewardena < >>>>>>>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> I have checked. It seems the issue you have encountered >>>>>>>>>>>>>>>>>>> is cause only in the current development branch of the >>>>>>>>>>>>>>>>>>> product-cep. It >>>>>>>>>>>>>>>>>>> doesn't identify the ML siddhi extension as an extension. >>>>>>>>>>>>>>>>>>> ML siddhi >>>>>>>>>>>>>>>>>>> extension works fine in the latest release of CEP (4.1.0) >>>>>>>>>>>>>>>>>>> [1]. >>>>>>>>>>>>>>>>>>> Until we figure out the reason and come up with a >>>>>>>>>>>>>>>>>>> solution, can you use the latest CEP release for your work. >>>>>>>>>>>>>>>>>>> It's fine to >>>>>>>>>>>>>>>>>>> use that since you haven't started actual development yet. >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> [1] http://wso2.com/products/complex-event-processor/ >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 3:19 PM, Maheshakya Wijewardena < >>>>>>>>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Is is vital to use those local repo in my upcoming >>>>>>>>>>>>>>>>>>>>> implementation? >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> Yes. The remote p2-repo contains the p2-repos of >>>>>>>>>>>>>>>>>>>> released versions. What you have to develop on is the >>>>>>>>>>>>>>>>>>>> current master of the >>>>>>>>>>>>>>>>>>>> carbon-ml and product-ml. You can try out with the >>>>>>>>>>>>>>>>>>>> modification I have >>>>>>>>>>>>>>>>>>>> suggested. In the meantime, I'll verify whether the >>>>>>>>>>>>>>>>>>>> current repos are >>>>>>>>>>>>>>>>>>>> working as expected. >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> And also i am trying to debug the carbon-ml >>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.core by putting some break point in the >>>>>>>>>>>>>>>>>>>>> spark/algorithms/Linear Regression >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> It's great that you have started looking at the >>>>>>>>>>>>>>>>>>>> implementation of linear regression as well. Put break >>>>>>>>>>>>>>>>>>>> points in train >>>>>>>>>>>>>>>>>>>> methods in LinearRegression class. This is being used when >>>>>>>>>>>>>>>>>>>> you run linear >>>>>>>>>>>>>>>>>>>> regression from UI. >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> I can see some comments left behind for streaming algo >>>>>>>>>>>>>>>>>>>>> as well.thank you >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> You may be referring to the linear regression with SGD >>>>>>>>>>>>>>>>>>>> model. Here, there's no retraining with streaming data >>>>>>>>>>>>>>>>>>>> involved. The SGD >>>>>>>>>>>>>>>>>>>> with minibatches is used to train the model with the data >>>>>>>>>>>>>>>>>>>> set only once. >>>>>>>>>>>>>>>>>>>> What you have to do is create a similar mechanism to >>>>>>>>>>>>>>>>>>>> involve streaming data and retrain models. We will get to >>>>>>>>>>>>>>>>>>>> that part once >>>>>>>>>>>>>>>>>>>> you get comfortable with siddhi extensions. >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> BTW, is it possible for you to join a call on this >>>>>>>>>>>>>>>>>>>> Friday or in the next week. We'll try to resolve your >>>>>>>>>>>>>>>>>>>> current issues and >>>>>>>>>>>>>>>>>>>> discuss further on project. >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 1:03 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> Hi maheshakya, >>>>>>>>>>>>>>>>>>>>> Is it ok to go with p2 repo at >>>>>>>>>>>>>>>>>>>>> http://product-dist.wso2.com/p2/carbon/releases/wilkes/features/ >>>>>>>>>>>>>>>>>>>>> rather than the P2-repo at >>>>>>>>>>>>>>>>>>>>> product-ml/modules/p2-profile/target/p2-repo in >>>>>>>>>>>>>>>>>>>>> local repo.What is the impact?.Is is vital to use those >>>>>>>>>>>>>>>>>>>>> local repo in my >>>>>>>>>>>>>>>>>>>>> upcoming implementation?so i was trying to give remote p2 >>>>>>>>>>>>>>>>>>>>> repo to cep built >>>>>>>>>>>>>>>>>>>>> by source and trying to debug the cep ml extension and >>>>>>>>>>>>>>>>>>>>> got the same error >>>>>>>>>>>>>>>>>>>>> as yesterday. But pre-built product is working fine. >>>>>>>>>>>>>>>>>>>>> Therefore now i am >>>>>>>>>>>>>>>>>>>>> trying as you described in the last email. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> And also i am trying to debug the carbon-ml >>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.core by putting some break point in the >>>>>>>>>>>>>>>>>>>>> spark/algorithms/Linear Regression. I am trying to >>>>>>>>>>>>>>>>>>>>> trigger it with >>>>>>>>>>>>>>>>>>>>> product-ml project with data set. Does that Linear >>>>>>>>>>>>>>>>>>>>> Regression also in UI is >>>>>>>>>>>>>>>>>>>>> consuming that spark algorithms or is it in another >>>>>>>>>>>>>>>>>>>>> place? I can see some >>>>>>>>>>>>>>>>>>>>> comments left behind for streaming algo as well.thank you. >>>>>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> On Tue, May 3, 2016 at 9:35 AM, Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>> <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> The earlier error you have mentioned may occur due to >>>>>>>>>>>>>>>>>>>>>> the incompatible Siddhi versions in ML p2-repo and CEP, >>>>>>>>>>>>>>>>>>>>>> when you add the >>>>>>>>>>>>>>>>>>>>>> p2-repo from prodcut-ml you built. >>>>>>>>>>>>>>>>>>>>>> Current siddhi version in product-cep is >>>>>>>>>>>>>>>>>>>>>> 3.0.6-SNAPSHOT[1], but in ML it's 3.0.2. >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> Can you try changing the siddhi.version in >>>>>>>>>>>>>>>>>>>>>> carbon-ml/pom.xml to 3.0.6-SNAPSHOT, build carbon-ml, >>>>>>>>>>>>>>>>>>>>>> then build product-ml >>>>>>>>>>>>>>>>>>>>>> again. After this, add p2-repo as a local repository >>>>>>>>>>>>>>>>>>>>>> again to fresh CEP >>>>>>>>>>>>>>>>>>>>>> pack and try it out. >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> On Mon, May 2, 2016 at 7:02 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>>>>>>>>>>>> now i remotely debug the CEP extension for ML >>>>>>>>>>>>>>>>>>>>>>> Prediction. What i did was, i have all the pre-build >>>>>>>>>>>>>>>>>>>>>>> version of CEP and ML. >>>>>>>>>>>>>>>>>>>>>>> Therefore i used that built CEP and did the same thing >>>>>>>>>>>>>>>>>>>>>>> that i was doing >>>>>>>>>>>>>>>>>>>>>>> with the source code. I think the only change i did was >>>>>>>>>>>>>>>>>>>>>>> install those >>>>>>>>>>>>>>>>>>>>>>> packages from remote p2 repo. This was work fine and i >>>>>>>>>>>>>>>>>>>>>>> debugged the >>>>>>>>>>>>>>>>>>>>>>> carbon-ml org.wso2.carbon.ml.siddhi.extension as >>>>>>>>>>>>>>>>>>>>>>> described in the [1]. So >>>>>>>>>>>>>>>>>>>>>>> now i have to try same thing with the build by source >>>>>>>>>>>>>>>>>>>>>>> content.thank you. >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>>>> https://docs.wso2.com/display/ML110/WSO2+CEP+Extension+for+ML+Predictions#WSO2CEPExtensionforMLPredictions-Siddhisyntaxfortheextension >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> BR, >>>>>>>>>>>>>>>>>>>>>>> Mahesh >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> On Mon, Apr 25, 2016 at 5:49 PM, Maheshakya >>>>>>>>>>>>>>>>>>>>>>> Wijewardena <mahesha...@wso2.com> wrote: >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Congratulations and welcome to GSoC 2016. You did a >>>>>>>>>>>>>>>>>>>>>>>> great job in preparing the proposal. Now it's time to >>>>>>>>>>>>>>>>>>>>>>>> dig deep and get >>>>>>>>>>>>>>>>>>>>>>>> started with the project. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> First of all you need to familiarize with the code >>>>>>>>>>>>>>>>>>>>>>>> base. We have agreed to implement this with CEP event >>>>>>>>>>>>>>>>>>>>>>>> streams. We already >>>>>>>>>>>>>>>>>>>>>>>> have a CEP extension for predictions [1][2]. Go >>>>>>>>>>>>>>>>>>>>>>>> through this implementation >>>>>>>>>>>>>>>>>>>>>>>> and familiarize your self with that. You need to >>>>>>>>>>>>>>>>>>>>>>>> understand how: >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> 1. Even streams are consumed >>>>>>>>>>>>>>>>>>>>>>>> 2. predictions are made from individual event >>>>>>>>>>>>>>>>>>>>>>>> 3. Results are sent back >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Get WSO2 ML and CEP sources (You may use latest >>>>>>>>>>>>>>>>>>>>>>>> released version of CEP) and build the products. Get >>>>>>>>>>>>>>>>>>>>>>>> both carbon-ml[3] and >>>>>>>>>>>>>>>>>>>>>>>> product-ml[4] masters and create new branches for your >>>>>>>>>>>>>>>>>>>>>>>> work from masters. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> After you build the products, you may need to do >>>>>>>>>>>>>>>>>>>>>>>> remote debugging[5] to understand the flow. So please >>>>>>>>>>>>>>>>>>>>>>>> follow an example of >>>>>>>>>>>>>>>>>>>>>>>> real time prediction with ML with debugging and get >>>>>>>>>>>>>>>>>>>>>>>> some idea. The >>>>>>>>>>>>>>>>>>>>>>>> component you need to debug is >>>>>>>>>>>>>>>>>>>>>>>> org.wso2.carbon.ml.siddhi.extension. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Next tasks would be implementing online learning >>>>>>>>>>>>>>>>>>>>>>>> algorithms in plain java with spark ml lib and >>>>>>>>>>>>>>>>>>>>>>>> integrating those to ML. We >>>>>>>>>>>>>>>>>>>>>>>> also need to come up with a proper and detailed >>>>>>>>>>>>>>>>>>>>>>>> architecture to employ >>>>>>>>>>>>>>>>>>>>>>>> those algorithms in ML. Getting familiar with the >>>>>>>>>>>>>>>>>>>>>>>> aforementioned sections >>>>>>>>>>>>>>>>>>>>>>>> would give you some insight on how this should be >>>>>>>>>>>>>>>>>>>>>>>> implemented. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> So please try to get a quick grasp then you can >>>>>>>>>>>>>>>>>>>>>>>> start the implementation. Let us know if you have any >>>>>>>>>>>>>>>>>>>>>>>> questions or you get >>>>>>>>>>>>>>>>>>>>>>>> stuck somewhere. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Also, please always add WSO2 developer's list as >>>>>>>>>>>>>>>>>>>>>>>> well when you communicate with us regarding the >>>>>>>>>>>>>>>>>>>>>>>> project so that you can get >>>>>>>>>>>>>>>>>>>>>>>> opinions and feedback from others as well. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> [1] >>>>>>>>>>>>>>>>>>>>>>>> https://docs.wso2.com/display/ML110/WSO2+CEP+Extension+for+ML+Predictions#WSO2CEPExtensionforMLPredictions-Siddhisyntaxfortheextension >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> [2] >>>>>>>>>>>>>>>>>>>>>>>> https://github.com/wso2/carbon-ml/tree/master/components/extensions/org.wso2.carbon.ml.siddhi.extension >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> [3] https://github.com/wso2/carbon-ml >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> [4] https://github.com/wso2/product-ml >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> [5] >>>>>>>>>>>>>>>>>>>>>>>> https://dzone.com/articles/how-debug-wso2-carbon-kernel >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> On Mon, Apr 25, 2016 at 3:33 PM, Mahesh Dananjaya < >>>>>>>>>>>>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>>>>>>>>>>>> thank you for accepting my GSOC 2016 proposal and >>>>>>>>>>>>>>>>>>>>>>>>> i am looking forward for the further instruction and >>>>>>>>>>>>>>>>>>>>>>>>> project continuation. >>>>>>>>>>>>>>>>>>>>>>>>> thank you very much. >>>>>>>>>>>>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> -- >>>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> _______________________________________________ >>>>>>>>>>>>>> Dev mailing list >>>>>>>>>>>>>> Dev@wso2.org >>>>>>>>>>>>>> http://wso2.org/cgi-bin/mailman/listinfo/dev >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> -- >>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>>> >>>>>>>> -- >>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>> mahesha...@wso2.com >>>>>>>> +94711228855 >>>>>>>> >>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Pruthuvi Maheshakya Wijewardena >>>>>> mahesha...@wso2.com >>>>>> +94711228855 >>>>>> >>>>>> >>>>>> >>>>> >>>> >>> >> > > > -- > Pruthuvi Maheshakya Wijewardena > mahesha...@wso2.com > +94711228855 > > > -- Pruthuvi Maheshakya Wijewardena mahesha...@wso2.com +94711228855
_______________________________________________ Dev mailing list Dev@wso2.org http://wso2.org/cgi-bin/mailman/listinfo/dev