Hey Aditya- First of all, welcome to the community. We'd love to have you help contribute.
The new algorithms framework is certainly a 'target rich environment'. Since you already are familiar with DBSCAN, why not start there? If you check out: https://github.com/apache/mahout/tree/master/math-scala/src/main/scala/org/apache/mahout/math/algorithms You'll see in general what our framework looks like.. You'll need to create a ClassificationModel trait similar to: https://github.com/apache/mahout/blob/master/math-scala/src/main/scala/org/apache/mahout/math/algorithms/regression/RegressorModel.scala Then you'll extend the the ClassificationModel with DBSCAN (or possibly some intermediate trait, as LinearRegressionModel does before OLS). Perhaps I should have started by asking- how well do you know scala? Anyway, those are good places to get started! Let me know if I can help. tg Trevor Grant Data Scientist https://github.com/rawkintrevo http://stackexchange.com/users/3002022/rawkintrevo http://trevorgrant.org *"Fortunate is he, who is able to know the causes of things." -Virgil* On Wed, Feb 22, 2017 at 4:23 PM, Aditya <adityasarma...@gmail.com> wrote: > Hello everyone! > > I'm a senior year computer science student from Birla Institute of > Technology and Science, India. I have experience in fields like Data Mining > and Machine Learning. Apart from doing basic coursework which included Data > Mining, Parallel Computing, and Machine Learning I have also worked on > research projects where I worked on building scalable DBSCAN like > clustering algorithms. > > I have gone through the Apache Mahout website and was wondering if I > could *contribute > to Mahout in terms of algorithm **development / improvising existing > algorithms.* > > I would be grateful if you could provide me with a starting point, from > where I can pick up and understand the Mahout ecosystem. I have no previous > experience in working with Apache Mahout or Spark but I have worked with > the Map reduce model before (but haven't used Hadoop) > > I wish to work full time during summer and take part in the Google Summer > of Code 2017 program by contributing to Apache Mahout. > > > Awaiting your replies! > > Cheers! > Aditya >