Re: Interested in GSOC

2011-03-26 Thread Vinicius Barrox
Thanks for the tips. I'm going through the code and javadocs right now, I will let you know when I have any doubts. I'm not sure to which part of Lucene I'm intending to right a proposal yet, but search/query and query parsing sounds interesting. On Fri, Mar 25, 2011 at 7:15 PM, Adriano

Re: Interested in GSOC

2011-03-25 Thread Simon Willnauer
Hey there, welcome to Lucene :), good to hear you are interested in Lucene and GSoC! On Fri, Mar 25, 2011 at 4:49 AM, Vinicius Paes de barros viniciuspaesdebar...@yahoo.com.br wrote: Hi there, I heard about GSOC from a friend of mine at college and I decide I want to participate this year.

Re: Interested in GSOC

2011-03-25 Thread Adriano Crestani
Hi Vinicius, Welcome to Lucene! I think a good place to look for internal design documentation is the javadoc package summary. Here is an example: [1], each package usually has its own detailed summary. I hope it helps ;) [1] -

Interested in GSOC

2011-03-24 Thread Vinicius Paes de barros
Hi there, I heard about GSOC from a friend of mine at college and I decide I want to participate this year. I already used Lucene before, so Lucene sounds like a good place to start. I went through the JIRA projects, but I couldn't find something I feel like writing a proposal to, maybe I don't

Re: Introduction for student interested in GSoC

2009-03-31 Thread David Hall
Here's a followup proposal (submitted to GSOC's site. I will add it to the wiki, but I'm having trouble accessing it right now) Thanks! -- David Title/Summary: Distributed Latent Dirichlet Allocation Student: David Hall Student e-mail: d...@cs.stanford.edu Student Major: Symbolic Systems/

Re: Introduction for student interested in GSoC

2009-03-25 Thread Ted Dunning
David, You are right that this is veering a little bit away from Mahout's central focus. We will have to beg a bit of forgiveness on that. I have a question for you and some hints about useful directions. First, is is possible for Scala to move the byte code or other representation of a

Re: Introduction for student interested in GSoC

2009-03-25 Thread Ted Dunning
Groovy closures are just objects as well, but they can't easily be serialized because they can capture references to other objects which are unlikely to exist on the far machine. Can you say more about the compiler plugin? Or provide a pointer? Also, in your example here, how would you deal

Re: Introduction for student interested in GSoC

2009-03-25 Thread Ted Dunning
One very nice thing that Cascading allows in the logical flow is that it allows groups and joins to be expressed which it then translates and schedules reasonably well into MR programs in which the appropriate functions are all collected as you suggest. On Wed, Mar 25, 2009 at 1:23 PM, David Hall

Re: Introduction for student interested in GSoC

2009-03-24 Thread Ted Dunning
This sounds fantastic. I think that your scala code is interesting, but your thoughts on LDA are much more so. I tried doing a similar simplification of map-reduce program writing using groovy and found that in spite of even smaller programs than you quote for word-count, that the benefits in

Re: Introduction for student interested in GSoC

2009-03-24 Thread David Hall
On Tue, Mar 24, 2009 at 4:15 PM, Ted Dunning ted.dunn...@gmail.com wrote: This sounds fantastic. I think that your scala code is interesting, but your thoughts on LDA are much more so.  I tried doing a similar simplification of map-reduce program writing using groovy and found that in spite

Re: Introduction for student interested in GSoC

2009-03-24 Thread David Hall
On Tue, Mar 24, 2009 at 4:34 PM, David Hall d...@cs.stanford.edu wrote: On Tue, Mar 24, 2009 at 4:15 PM, Ted Dunning ted.dunn...@gmail.com wrote: It would also be interesting to see how you might attack semi-supervised multi-task learning using a well-founded Bayesian approach.  For a

Re: Undergrad stud interested in GSoC

2008-04-05 Thread Isabel Drost
On Friday 04 April 2008, Samee Zahur wrote: I do see others at least including signature.asc as an attachment at times. I would guess there is some upper limit on the size of attachments. Don't know for sure though. Isabel -- You cannot use your friends and have them too. |\

Re: Undergrad stud interested in GSoC

2008-03-30 Thread Isabel Drost
On Saturday 29 March 2008, Samee Zahur wrote: Being an undergrad student interested in the field of data-intensive machine learning techniques and applications, I am interested in implementing these algorithms as a way of getting an exposure into this field. Great. Nice to have you here.

RE: Undergrad stud interested in GSoC

2008-03-30 Thread Jeff Eastman
. +1.415.298.0023 http://windwardsolutions.com http://jeffeastman.blogspot.com -Original Message- From: Samee Zahur [mailto:[EMAIL PROTECTED] Sent: Friday, March 28, 2008 10:43 PM To: mahout-dev@lucene.apache.org Subject: Undergrad stud interested in GSoC Hello, I have read through

RE: Undergrad stud interested in GSoC

2008-03-30 Thread Jeff Eastman
-Original Message- From: Isabel Drost [mailto:[EMAIL PROTECTED] Sent: Sunday, March 30, 2008 1:24 PM To: mahout-dev@lucene.apache.org Subject: Re: Undergrad stud interested in GSoC On Sunday 30 March 2008, Jeff Eastman wrote: I'm working with my colleagues at CollabNet who have expressed

Undergrad stud interested in GSoC

2008-03-28 Thread Samee Zahur
Hello, I have read through nips paper and the march archive for this mailing list, and I feel I can implement some of the algorithms (as permitted by time) described in the nips paper. Being an undergrad student interested in the field of data-intensive machine learning techniques and