Hi Maheshakya, I have gone through the Java Docs and run some of the Spark examples on spark shell which are paramount improtant for our work. Then i have been writing my codes to check the Linear regression, K means for streaming. please check my git repo [1]. I think now i have to ask on dev regarding the capturing event streams for our work. I will update the recent things on git. check the park-example directory for java. examples run on git shell is not included there. In my case i think i have to build mini batches from data streams that comes as individual samples. Now i am working on some coding to collect mini batches from data streams.thank you. regards, Mahesh. [1]https://github.com/dananjayamahesh/GSOC2016
On Tue, May 17, 2016 at 10:10 AM, Mahesh Dananjaya < dananjayamah...@gmail.com> wrote: > Hi Maheshakya, > I have gone through the Java Docs and run some of the Spark examples on > spark shell which are paramount improtant for our work. Then i have been > writing my codes to check the Linear regression, K means for streaming. > please check my git repo [1]. I think now i have to ask on dev regarding > the capturing event streams for our work. I will update the recent things > on git. check the park-example directory for java. examples run on git > shell is not included there. In my case i think i have to build mini > batches from data streams that comes as individual samples. Now i am > working on some coding to collect mini batches from data streams.thank you. > regards, > Mahesh. > [1]https://github.com/dananjayamahesh/GSOC2016 > > On Mon, May 16, 2016 at 1:19 PM, Mahesh Dananjaya < > dananjayamah...@gmail.com> wrote: > >> Hi Maheshakya, >> thank you. i will update the repo today.thank you.i changed the carbon ml >> siddhi extention and see how the changes are effecting. i will update the >> progress as soon as possible.thank you. i had some problem in spark mllib >> dependency. i was fixing that. >> regards, >> Mahesh. >> p.s: do i need to maintain a blog? >> >> On Mon, May 16, 2016 at 10:02 AM, Maheshakya Wijewardena < >> mahesha...@wso2.com> wrote: >> >>> Hi Mahesh, >>> >>> Sorry for replying late. >>> >>> Thank you for the update. I believe you have done some implementations >>> with with Spark MLLIb algorithms in streaming fashion as we have discussed. >>> If so, can you please share your code in a Github repo. >>> >>> Now i want to implements some machine learning algorithms with importing >>>> mllib and want to run within your code base >>>> >>> >>> For the moment you can try out editing the same class >>> PredictStreamProcessor in the siddhi extension in carbon-ml. Later we will >>> add this separately. You should be able to add org.apache.spark.mllib. >>> classes to there. >>> >>> And i want to see how event streams are coming from cep. As i think it >>>> is not in a RDD format since it is arriving as the individual samples. I >>>> will send a email to dev asking about how to get the streams. >>> >>> >>> Please pay attention to length[1] and lengthbatch[1] inbuilt windows in >>> siddhi. What you need to write are functions similar to a custom aggregate >>> function[2]. >>> When you send the email to dev list, explain your requirement. You need >>> to get a set of event with from a stream with a specified window size >>> (number of events). Then build a model within that function. You also need >>> to retain the data (learned weights, cluster centers, etc.) from the >>> previous window to use in the current window. Ask what can be the most >>> suitable option for this among the set of siddhi extensions given. >>> >>> Best regards. >>> >>> [1] >>> https://docs.wso2.com/display/CEP400/Inbuilt+Windows#InbuiltWindows-lengthlength >>> [2] >>> https://docs.wso2.com/display/CEP400/Writing+a+Custom+Aggregate+Function >>> >>> On Wed, May 11, 2016 at 1:43 PM, Mahesh Dananjaya < >>> dananjayamah...@gmail.com> wrote: >>> >>>> >>>> ---------- Forwarded message ---------- >>>> From: Mahesh Dananjaya <dananjayamah...@gmail.com> >>>> Date: Wed, May 11, 2016 at 1:43 PM >>>> Subject: Re: [Dev] GSOC2016: [ML][CEP] Predictive analytic with online >>>> data for WSO2 Machine Learner >>>> To: Maheshakya Wijewardena <mahesha...@wso2.com> >>>> >>>> >>>> Hi Maheshakya, >>>> sorry for not updating. I did what you wanted me to do. I checked the >>>> code base and train functions. I went through those java docs. I went >>>> through the carbon-ml current implementation of LG and K-Mean. And i had >>>> Apache Spark and i tried with several examples. Now i want to implements >>>> some machine learning algorithms with importing mllib and want to run >>>> within your code base. Can you help me with that. >>>> And i want to see how event streams are coming from cep. As i think it >>>> is not in a RDD format since it is arriving as the individual samples. I >>>> will send a email to dev asking about how to get the streams. I debugged >>>> many of those functions in the code base. So need further instructions to >>>> proceed.thank you. >>>> regards, >>>> Mahesh. >>>> >>>> On Wed, May 11, 2016 at 10:32 AM, Maheshakya Wijewardena < >>>> mahesha...@wso2.com> wrote: >>>> >>>>> Hi Mahesh, >>>>> >>>>> Any update on your progress? >>>>> >>>>> Best regards. >>>>> >>>>> On Wed, May 4, 2016 at 8:35 PM, Maheshakya Wijewardena < >>>>> mahesha...@wso2.com> wrote: >>>>> >>>>>> Hi Mahesh, >>>>>> >>>>>> is that "Put break points in train methods in Linear Regression >>>>>>> class" means the spark/algorithms/ LinearRegrassion.java class in the >>>>>>> org.wso2.carbon.ml.core? is that the correct file? >>>>>> >>>>>> >>>>>> Yes, this is the correct place. >>>>>> >>>>>> You can refer to spark programming guide[1][2] as well as our ML code >>>>>> base when you try those algorithms out. Please try to do rough >>>>>> implementations of the streaming versions of linear regression, logistic >>>>>> regression and k-means clustering as we have discussed in the proposal in >>>>>> plain Java. It's better if you can create a git repo and share your code >>>>>> once you have made some progress. >>>>>> >>>>>> Were you able debug and understand the flow of the ML siddhi >>>>>> extension? I hope you haven't encountered more errors after switching the >>>>>> released version of CEP. >>>>>> >>>>>> Is this Friday okay for you? Afternoon at 2:00 pm? >>>>>> >>>>>> Best regards. >>>>>> >>>>>> >>>>>> Best regards. >>>>>> >>>>>> [1] http://spark.apache.org/docs/latest/programming-guide.html >>>>>> [2] http://spark.apache.org/docs/latest/mllib-guide.html >>>>>> >>>>>> On Wed, May 4, 2016 at 1:07 PM, Mahesh Dananjaya < >>>>>> dananjayamah...@gmail.com> wrote: >>>>>> >>>>>>> Hi Maheshakya, >>>>>>> I have been looking into some algorithms related to stochastic >>>>>>> gradient descent based algorithms.anything i should focus please let me >>>>>>> know.Ans also i will be available for calling this week and next >>>>>>> week.thank >>>>>>> you. >>>>>>> BR, >>>>>>> Mahesh. >>>>>>> >>>>>>> On Tue, May 3, 2016 at 5:05 PM, Mahesh Dananjaya < >>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>> >>>>>>>> Hi Maheshakya, >>>>>>>> thank you.that's good. i have been trying to fix that for couple of >>>>>>>> days. please inform me when it will be fixed.now i have been testing >>>>>>>> the ML >>>>>>>> algorithms and trying to identify the flow and the hierarchy. is that >>>>>>>> "Put >>>>>>>> break points in train methods in Linear Regression class" means the >>>>>>>> spark/algorithms/ LinearRegrassion.java class in the >>>>>>>> org.wso2.carbon.ml.core? is that the correct file? >>>>>>>> And also i am planning to write some programs to use apache spark >>>>>>>> mllib algorithms. and i refer to [1] and some wso2 documentations to >>>>>>>> get >>>>>>>> some idea about ML structure.thank you. >>>>>>>> >>>>>>>> BR, >>>>>>>> Mahesh. >>>>>>>> >>>>>>>> [1]nirmalfdo.blogspot.com >>>>>>>> >>>>>>>> On Tue, May 3, 2016 at 4:36 PM, Maheshakya Wijewardena < >>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>> >>>>>>>>> Hi Mahesh, >>>>>>>>> >>>>>>>>> I have checked. It seems the issue you have encountered is cause >>>>>>>>> only in the current development branch of the product-cep. It doesn't >>>>>>>>> identify the ML siddhi extension as an extension. ML siddhi extension >>>>>>>>> works >>>>>>>>> fine in the latest release of CEP (4.1.0) [1]. >>>>>>>>> Until we figure out the reason and come up with a solution, can >>>>>>>>> you use the latest CEP release for your work. It's fine to use that >>>>>>>>> since >>>>>>>>> you haven't started actual development yet. >>>>>>>>> >>>>>>>>> Best regards. >>>>>>>>> >>>>>>>>> [1] http://wso2.com/products/complex-event-processor/ >>>>>>>>> >>>>>>>>> On Tue, May 3, 2016 at 3:19 PM, Maheshakya Wijewardena < >>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>> >>>>>>>>>> Hi Mahesh, >>>>>>>>>> >>>>>>>>>> >>>>>>>>>>> Is is vital to use those local repo in my upcoming >>>>>>>>>>> implementation? >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> Yes. The remote p2-repo contains the p2-repos of released >>>>>>>>>> versions. What you have to develop on is the current master of the >>>>>>>>>> carbon-ml and product-ml. You can try out with the modification I >>>>>>>>>> have >>>>>>>>>> suggested. In the meantime, I'll verify whether the current repos are >>>>>>>>>> working as expected. >>>>>>>>>> >>>>>>>>>> And also i am trying to debug the carbon-ml >>>>>>>>>>> org.wso2.carbon.ml.core by putting some break point in the >>>>>>>>>>> spark/algorithms/Linear Regression >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> It's great that you have started looking at the implementation of >>>>>>>>>> linear regression as well. Put break points in train methods in >>>>>>>>>> LinearRegression class. This is being used when you run linear >>>>>>>>>> regression >>>>>>>>>> from UI. >>>>>>>>>> >>>>>>>>>> I can see some comments left behind for streaming algo as >>>>>>>>>>> well.thank you >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> You may be referring to the linear regression with SGD model. >>>>>>>>>> Here, there's no retraining with streaming data involved. The SGD >>>>>>>>>> with >>>>>>>>>> minibatches is used to train the model with the data set only once. >>>>>>>>>> What you have to do is create a similar mechanism to involve >>>>>>>>>> streaming data and retrain models. We will get to that part once you >>>>>>>>>> get >>>>>>>>>> comfortable with siddhi extensions. >>>>>>>>>> >>>>>>>>>> BTW, is it possible for you to join a call on this Friday or in >>>>>>>>>> the next week. We'll try to resolve your current issues and discuss >>>>>>>>>> further >>>>>>>>>> on project. >>>>>>>>>> >>>>>>>>>> Best regards. >>>>>>>>>> >>>>>>>>>> On Tue, May 3, 2016 at 1:03 PM, Mahesh Dananjaya < >>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>> >>>>>>>>>>> Hi maheshakya, >>>>>>>>>>> Is it ok to go with p2 repo at >>>>>>>>>>> http://product-dist.wso2.com/p2/carbon/releases/wilkes/features/ >>>>>>>>>>> rather than the P2-repo at >>>>>>>>>>> product-ml/modules/p2-profile/target/p2-repo in >>>>>>>>>>> local repo.What is the impact?.Is is vital to use those local repo >>>>>>>>>>> in my >>>>>>>>>>> upcoming implementation?so i was trying to give remote p2 repo to >>>>>>>>>>> cep built >>>>>>>>>>> by source and trying to debug the cep ml extension and got the same >>>>>>>>>>> error >>>>>>>>>>> as yesterday. But pre-built product is working fine. Therefore now >>>>>>>>>>> i am >>>>>>>>>>> trying as you described in the last email. >>>>>>>>>>> >>>>>>>>>>> And also i am trying to debug the carbon-ml >>>>>>>>>>> org.wso2.carbon.ml.core by putting some break point in the >>>>>>>>>>> spark/algorithms/Linear Regression. I am trying to trigger it with >>>>>>>>>>> product-ml project with data set. Does that Linear Regression also >>>>>>>>>>> in UI is >>>>>>>>>>> consuming that spark algorithms or is it in another place? I can >>>>>>>>>>> see some >>>>>>>>>>> comments left behind for streaming algo as well.thank you. >>>>>>>>>>> BR, >>>>>>>>>>> Mahesh. >>>>>>>>>>> >>>>>>>>>>> On Tue, May 3, 2016 at 9:35 AM, Maheshakya Wijewardena < >>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>> >>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>> >>>>>>>>>>>> The earlier error you have mentioned may occur due to the >>>>>>>>>>>> incompatible Siddhi versions in ML p2-repo and CEP, when you add >>>>>>>>>>>> the >>>>>>>>>>>> p2-repo from prodcut-ml you built. >>>>>>>>>>>> Current siddhi version in product-cep is 3.0.6-SNAPSHOT[1], but >>>>>>>>>>>> in ML it's 3.0.2. >>>>>>>>>>>> >>>>>>>>>>>> Can you try changing the siddhi.version in carbon-ml/pom.xml to >>>>>>>>>>>> 3.0.6-SNAPSHOT, build carbon-ml, then build product-ml again. >>>>>>>>>>>> After this, >>>>>>>>>>>> add p2-repo as a local repository again to fresh CEP pack and try >>>>>>>>>>>> it out. >>>>>>>>>>>> >>>>>>>>>>>> Best regards. >>>>>>>>>>>> >>>>>>>>>>>> On Mon, May 2, 2016 at 7:02 PM, Mahesh Dananjaya < >>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>> >>>>>>>>>>>>> Hi Maheshakya, >>>>>>>>>>>>> now i remotely debug the CEP extension for ML Prediction. What >>>>>>>>>>>>> i did was, i have all the pre-build version of CEP and ML. >>>>>>>>>>>>> Therefore i used >>>>>>>>>>>>> that built CEP and did the same thing that i was doing with the >>>>>>>>>>>>> source >>>>>>>>>>>>> code. I think the only change i did was install those packages >>>>>>>>>>>>> from remote >>>>>>>>>>>>> p2 repo. This was work fine and i debugged the carbon-ml >>>>>>>>>>>>> org.wso2.carbon.ml.siddhi.extension as described in the [1]. So >>>>>>>>>>>>> now i have >>>>>>>>>>>>> to try same thing with the build by source content.thank you. >>>>>>>>>>>>> >>>>>>>>>>>>> [1] >>>>>>>>>>>>> https://docs.wso2.com/display/ML110/WSO2+CEP+Extension+for+ML+Predictions#WSO2CEPExtensionforMLPredictions-Siddhisyntaxfortheextension >>>>>>>>>>>>> >>>>>>>>>>>>> BR, >>>>>>>>>>>>> Mahesh >>>>>>>>>>>>> >>>>>>>>>>>>> On Mon, Apr 25, 2016 at 5:49 PM, Maheshakya Wijewardena < >>>>>>>>>>>>> mahesha...@wso2.com> wrote: >>>>>>>>>>>>> >>>>>>>>>>>>>> Hi Mahesh, >>>>>>>>>>>>>> >>>>>>>>>>>>>> Congratulations and welcome to GSoC 2016. You did a great job >>>>>>>>>>>>>> in preparing the proposal. Now it's time to dig deep and get >>>>>>>>>>>>>> started with >>>>>>>>>>>>>> the project. >>>>>>>>>>>>>> >>>>>>>>>>>>>> First of all you need to familiarize with the code base. We >>>>>>>>>>>>>> have agreed to implement this with CEP event streams. We already >>>>>>>>>>>>>> have a CEP >>>>>>>>>>>>>> extension for predictions [1][2]. Go through this implementation >>>>>>>>>>>>>> and >>>>>>>>>>>>>> familiarize your self with that. You need to understand how: >>>>>>>>>>>>>> >>>>>>>>>>>>>> 1. Even streams are consumed >>>>>>>>>>>>>> 2. predictions are made from individual event >>>>>>>>>>>>>> 3. Results are sent back >>>>>>>>>>>>>> >>>>>>>>>>>>>> Get WSO2 ML and CEP sources (You may use latest released >>>>>>>>>>>>>> version of CEP) and build the products. Get both carbon-ml[3] and >>>>>>>>>>>>>> product-ml[4] masters and create new branches for your work from >>>>>>>>>>>>>> masters. >>>>>>>>>>>>>> >>>>>>>>>>>>>> After you build the products, you may need to do remote >>>>>>>>>>>>>> debugging[5] to understand the flow. So please follow an example >>>>>>>>>>>>>> of real >>>>>>>>>>>>>> time prediction with ML with debugging and get some idea. The >>>>>>>>>>>>>> component you >>>>>>>>>>>>>> need to debug is org.wso2.carbon.ml.siddhi.extension. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Next tasks would be implementing online learning algorithms >>>>>>>>>>>>>> in plain java with spark ml lib and integrating those to ML. We >>>>>>>>>>>>>> also need >>>>>>>>>>>>>> to come up with a proper and detailed architecture to employ >>>>>>>>>>>>>> those >>>>>>>>>>>>>> algorithms in ML. Getting familiar with the aforementioned >>>>>>>>>>>>>> sections would >>>>>>>>>>>>>> give you some insight on how this should be implemented. >>>>>>>>>>>>>> >>>>>>>>>>>>>> So please try to get a quick grasp then you can start the >>>>>>>>>>>>>> implementation. Let us know if you have any questions or you get >>>>>>>>>>>>>> stuck >>>>>>>>>>>>>> somewhere. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Also, please always add WSO2 developer's list as well when >>>>>>>>>>>>>> you communicate with us regarding the project so that you can >>>>>>>>>>>>>> get opinions >>>>>>>>>>>>>> and feedback from others as well. >>>>>>>>>>>>>> >>>>>>>>>>>>>> Best regards. >>>>>>>>>>>>>> >>>>>>>>>>>>>> [1] >>>>>>>>>>>>>> https://docs.wso2.com/display/ML110/WSO2+CEP+Extension+for+ML+Predictions#WSO2CEPExtensionforMLPredictions-Siddhisyntaxfortheextension >>>>>>>>>>>>>> >>>>>>>>>>>>>> [2] >>>>>>>>>>>>>> https://github.com/wso2/carbon-ml/tree/master/components/extensions/org.wso2.carbon.ml.siddhi.extension >>>>>>>>>>>>>> >>>>>>>>>>>>>> [3] https://github.com/wso2/carbon-ml >>>>>>>>>>>>>> >>>>>>>>>>>>>> [4] https://github.com/wso2/product-ml >>>>>>>>>>>>>> >>>>>>>>>>>>>> [5] https://dzone.com/articles/how-debug-wso2-carbon-kernel >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> On Mon, Apr 25, 2016 at 3:33 PM, Mahesh Dananjaya < >>>>>>>>>>>>>> dananjayamah...@gmail.com> wrote: >>>>>>>>>>>>>> >>>>>>>>>>>>>>> Hi, >>>>>>>>>>>>>>> thank you for accepting my GSOC 2016 proposal and i am >>>>>>>>>>>>>>> looking forward for the further instruction and project >>>>>>>>>>>>>>> continuation. thank >>>>>>>>>>>>>>> you very much. >>>>>>>>>>>>>>> regards, >>>>>>>>>>>>>>> Mahesh. >>>>>>>>>>>>>>> >>>>>>>>>>>>>>> -- >>>>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>>>> +94711228855 >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>>> >>>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> -- >>>>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>>>> mahesha...@wso2.com >>>>>>>>>>>> +94711228855 >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>>> >>>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> -- >>>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>>> mahesha...@wso2.com >>>>>>>>>> +94711228855 >>>>>>>>>> >>>>>>>>>> >>>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>>> -- >>>>>>>>> Pruthuvi Maheshakya Wijewardena >>>>>>>>> mahesha...@wso2.com >>>>>>>>> +94711228855 >>>>>>>>> >>>>>>>>> >>>>>>>>> >>>>>>>> >>>>>>> >>>>>> >>>>>> >>>>>> -- >>>>>> Pruthuvi Maheshakya Wijewardena >>>>>> mahesha...@wso2.com >>>>>> +94711228855 >>>>>> >>>>>> >>>>>> >>>>> >>>>> >>>>> -- >>>>> Pruthuvi Maheshakya Wijewardena >>>>> mahesha...@wso2.com >>>>> +94711228855 >>>>> >>>>> >>>>> >>>> >>>> >>>> _______________________________________________ >>>> Dev mailing list >>>> Dev@wso2.org >>>> http://wso2.org/cgi-bin/mailman/listinfo/dev >>>> >>>> >>> >>> >>> -- >>> Pruthuvi Maheshakya Wijewardena >>> mahesha...@wso2.com >>> +94711228855 >>> >>> >>> >> >
_______________________________________________ Dev mailing list Dev@wso2.org http://wso2.org/cgi-bin/mailman/listinfo/dev