I'm working with my colleagues at CollabNet who have expressed interest in providing us some EC2 time for this sort of testing. They are working on EC2 deployment of Hadoop using their CUBiT machine allocation environment and the quid pro quo would be that we help them exercise this tool. We have not yet worked out the administrative details but I will keep the list posted on progress.
I also think it is important that we build up some example datasets to test our existing code and I intend to devote some energy to this beginning soon. Jeff Jeff Eastman, Ph.D. Windward Solutions Inc. +1.415.298.0023 http://windwardsolutions.com http://jeffeastman.blogspot.com > -----Original Message----- > From: Samee Zahur [mailto:[EMAIL PROTECTED] > Sent: Friday, March 28, 2008 10:43 PM > To: mahout-dev@lucene.apache.org > Subject: Undergrad stud interested in GSoC > > Hello, > I have read through nips paper and the march archive for this mailing > list, and I feel I can implement some of the algorithms (as permitted > by time) described in the nips paper. Being an undergrad student > interested in the field of data-intensive machine learning techniques > and applications, I am interested in implementing these algorithms as > a way of getting an exposure into this field. > > Even though I have already applied to work in this SoC, I do have one > question though. When coding, how am I expected to test the > effectiveness of my algorithms without running it on a multicore > platform? Or do I simply assume that a sufficiently sensible > application of M/R will allow Hadoop to take care of scalability? What > is the usual development platform used here? Sorry if such questions > seem a bit silly, but it is in order gain the experience in such > development that I want to work in this project. > > And about the application for SoC, I selected the ASF as the mentoring > organisation - how do I make sure that someone from mahout reviews it? > > Initially for the SoC, I want to implement LWLR, NN and PCA, but later > beyond the GSoC I want to continue contributing to this project in > other ways I figure out once I gain familiarity with the scope of this > project. > > Samee