Hi Heshani,

Thank you for the interest for this project.

WSO2 Machine Learner supports several learning algorithms at the moment[1].
This project intends is to leverage the existing algorithms in WSO2 Machine
Learner to support streaming data. As an initiative, first you can get an
idea about what WSO2 Machine Learner does and how it operates. You can
download WSO2 Machine Learner from product page[2] and the the source code
[3]. ML is using Apache Spark MLLib[4] for its' algorithms so it's better
to read and understand what it does as well.

In order to get an idea about the deliverables and the scope of this
project, try to understand how Spark streaming[5] (see examples) handles
streaming data. Also, have a look in the streaming algorithms[6][7]
supported by MLLib. There are two approaches discussed to employ
incremental learning in ML in the project proposals page. These streaming
algorithms can be directly used in the first approach. For the other
approach, the your implementation should contain a procedure to create mini
batches from streaming data with relevant sizes (i.e. a moving window) and
do periodic retraining of the same algorithm.

To start with the project, you will need to come up with a suitable plan
and an architecture first.

Please watch the video referenced in the proposal (reference: 5). It will
help you getting a better idea about machine learning algorithms with
streaming data.

Let us know if you need any help with these.

Best regards

[1] https://docs.wso2.com/display/ML110/Machine+Learner+Algorithms
[2] http://wso2.com/products/machine-learner/
[3]
https://docs.wso2.com/display/ML110/Building+from+Source#BuildingfromSource-Downloadingthesourcecheckout
[4] https://spark.apache.org/docs/1.4.1/mllib-guide.html
[5] https://spark.apache.org/docs/1.4.1/streaming-programming-guide.html
[6]
https://spark.apache.org/docs/1.4.1/mllib-linear-methods.html#streaming-linear-regression
[7]
https://spark.apache.org/docs/1.4.1/mllib-clustering.html#streaming-k-means

On Mon, Feb 29, 2016 at 2:54 PM, Heshani Herath <heshani7.her...@gmail.com>
wrote:

> Hi,
>
>
> https://docs.wso2.com/display/GSoC/Project+Proposals+for+2016#ProjectProposalsfor2016-Proposal6:[ML]PredictiveanalyticswithonlinedataforWSO2MachineLearner
>
> I'm a 4th year undergraduate from SLIIT faculty of computing(specializing
> in Software Engineering) who is interested in doing the aforementioned
> project as the final year research. I would like to know more details on
> this topic and the procedure to be followed when implementing it. Please be
> kind enough to reply as soon as possible.
>
> Thank you
>
> --
> Best Regards,
> Heshani Herath
>



-- 
Pruthuvi Maheshakya Wijewardena
mahesha...@wso2.com
+94711228855
_______________________________________________
Dev mailing list
Dev@wso2.org
http://wso2.org/cgi-bin/mailman/listinfo/dev

Reply via email to