Choosing appropriate values for T1 and T2 for canopy clustering

2011-04-12 Thread Madhusudan Joshi
I am been using kmeans to cluster some data. For initial cluster I used random seed but that resulted in most of the documents (more than 70%) being clustered in a single cluster. So I want to use canopy clustering to create initial clusters but I am having problems selecting suitable values for T1

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Ted Dunning
The book is all there. All that is happening now are tiny edits and final production formatting. On Tue, Apr 12, 2011 at 9:22 PM, wrote: > Eric the book is still being written, but you can buy the interim PDF > version from the site. It seems quite complete (save for a few typos here > and ther

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Ted Dunning
Yes. That's the one. The hard copy should be out before long. The final passes by the production editors are happening now. On Tue, Apr 12, 2011 at 9:19 PM, Eric Charles wrote: > You were talking about 'Mahout in Action' book. > I suppose you were referring about the EBook version. > Hard copy

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Chris Schilling
Yeah, it works for me. Nice lecture! On Apr 12, 2011, at 9:03 PM, Ted Dunning wrote: > Pity. Don't think I can help. Talk to your internet provider. > > On Tue, Apr 12, 2011 at 7:28 PM, Xiaobo Gu wrote: > >> On Wed, Apr 13, 2011 at 1:03 AM, Ted Dunning >> wrote: >>> This lecture might help

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread pcollins
Eric the book is still being written, but you can buy the interim PDF version from the site. It seems quite complete (save for a few typos here and there). The publisher will email you with updates as the document chapters are being finalized. You also have the option of having the dead-tree editi

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Eric Charles
Hi Ted, Video and PDF are accessible from here. Very instructive. Tks! You were talking about 'Mahout in Action' book. I suppose you were referring about the EBook version. Hard copy are not yet available as far as I can read on http://www.manning.com/owen/. Any idea on shipping date ? Tks, -

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Ted Dunning
Pity. Don't think I can help. Talk to your internet provider. On Tue, Apr 12, 2011 at 7:28 PM, Xiaobo Gu wrote: > On Wed, Apr 13, 2011 at 1:03 AM, Ted Dunning > wrote: > > This lecture might help > > some: > http://www.meetup.com/LA-HUG/pages/Video_from_March_16th_LA-HUG_Ted_Dunning_Mahout >

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Xiaobo Gu
On Wed, Apr 13, 2011 at 1:03 AM, Ted Dunning wrote: > This lecture might help > some: http://www.meetup.com/LA-HUG/pages/Video_from_March_16th_LA-HUG_Ted_Dunning_Mahout Thanks, but I can't access the URL. > On Tue, Apr 12, 2011 at 10:02 AM, Ted Dunning wrote: >> >> Can you be more specific abou

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Ted Dunning
This lecture might help some: http://www.meetup.com/LA-HUG/pages/Video_from_March_16th_LA-HUG_Ted_Dunning_Mahout On Tue, Apr 12, 2011 at 10:02 AM, Ted Dunning wrote: > Can you be more specific about what you have and what you want? > > The book Mahout in Action provides quite a lot of details wi

Re: Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Ted Dunning
Can you be more specific about what you have and what you want? The book Mahout in Action provides quite a lot of details with sample code for a server farm. The TrainNewsGroups example provides code that you can copy. Do you have these resources? Do you want more? Did you want more theory? O

Is any more detailed documentation aout the sgd logistic regression example.

2011-04-12 Thread Xiaobo Gu
Hi, Documents about sgd logistic regression itself are welcome too. Regards, Xiaobo Gu

Re: Oops - props file gets ahead of source tree

2011-04-12 Thread Sebastian Schelter
Argh, I must have forgotten to add the eval packages when committing the last patch from MAHOUT-542, thanks for pointing out, I'll fix that right away. --sebastian On 12.04.2011 09:57, Sean Owen wrote: Sebastian were these just some local classes? they came in in your last commit to that fil

Re: Oops - props file gets ahead of source tree

2011-04-12 Thread Sean Owen
Sebastian were these just some local classes? they came in in your last commit to that file and looks like they aren't in the tree. On Tue, Apr 12, 2011 at 5:55 AM, Lance Norskog wrote: > In the very latest git pull: > > InMemoryFactorizationEvaluator, ParallelFactorizationEvaluator and > Dataset

Re: More for the refactor wish-list

2011-04-12 Thread Sean Owen
You can get an Iterator from the List. Indeed it's a tradeoff of flexibility for the caller versus work for the implementation. It seems like not such a big deal to make the implementations provide a whole List for the caller. After all, the caller said how many things it wanted, and the number is