Re: Demos/Tutorials

2008-03-26 Thread Isabel Drost
On Wednesday 26 March 2008, Grant Ingersoll wrote: > On Mar 24, 2008, at 4:48 PM, Isabel Drost wrote: > > I think Mahout is not really suitable to build demos that explain > > the inner workings of the algorithms implemented. > > I agree, but as we develop, we will probably have programmer's guides

Re: Demos/Tutorials

2008-03-26 Thread Grant Ingersoll
On Mar 24, 2008, at 4:48 PM, Isabel Drost wrote: On Thursday 20 March 2008, Grant Ingersoll wrote: In the longer run, intro to ML would be cool, but there is lots available on that. I think Mahout is not really suitable to build demos that explain the inner workings of the algorithms im

Re: Demos/Tutorials

2008-03-24 Thread Isabel Drost
On Thursday 20 March 2008, Grant Ingersoll wrote: > In the longer run, intro to ML would be cool, but there is lots available > on that. I think Mahout is not really suitable to build demos that explain the inner workings of the algorithms implemented. > I don't think it should be that large,

Re: Demos/Tutorials

2008-03-20 Thread Grant Ingersoll
On Mar 20, 2008, at 9:15 AM, Grant Ingersoll wrote: On Mar 19, 2008, at 9:56 PM, Karl Wettin wrote: Grant Ingersoll skrev: Now that we have some code in place for clustering, I think it would be cool to put together some examples/demos of real world problems. Things like clustering text

Re: Demos/Tutorials

2008-03-20 Thread Grant Ingersoll
On Mar 19, 2008, at 9:56 PM, Karl Wettin wrote: Grant Ingersoll skrev: Now that we have some code in place for clustering, I think it would be cool to put together some examples/demos of real world problems. Things like clustering text (perhaps we can use the wikipedia download or the re

Re: Demos/Tutorials

2008-03-20 Thread Andrzej Bialecki
Karl Wettin wrote: Grant Ingersoll skrev: Now that we have some code in place for clustering, I think it would be cool to put together some examples/demos of real world problems. Things like clustering text (perhaps we can use the wikipedia download or the reuters download that Lucene contrib

Re: Demos/Tutorials

2008-03-19 Thread Karl Wettin
Grant Ingersoll skrev: Now that we have some code in place for clustering, I think it would be cool to put together some examples/demos of real world problems. Things like clustering text (perhaps we can use the wikipedia download or the reuters download that Lucene contrib/benchmark uses) or

RE: Demos/Tutorials

2008-03-18 Thread Jeff Eastman
I've been using the canopy clustering to cluster Apache log time slices by URL frequency. Typical results indicate several big clusters with the "business as usual" access patterns in them and then several small clusters with the unusual patterns. It's a little difficult to interpret beyond that bu

Re: Demos/Tutorials

2008-03-18 Thread Grant Ingersoll
Yeah, I hear you there. I have a project I am working on that will require me to generate examples, but it is a couple of weeks away. The gene expression stuff is great. Text based ones would be really cool too. I haven't done too much clustering work (other than using Dawid's excellent

Re: Demos/Tutorials

2008-03-18 Thread Dawid Weiss
This is absolutely necessary, if not for just showing off with the project, then certainly for verification of correctness of algorithms inside it. I will certainly hop in to such a subtask to the extent of my current available time resources (not much, sadly). D. Grant Ingersoll wrote: No

Re: Demos/Tutorials

2008-03-18 Thread Isabel Drost
On Monday 17 March 2008, Grant Ingersoll wrote: > Now that we have some code in place for clustering, I think it would > be cool to put together some examples/demos of real world problems. One idea I thought of reading the proposal of Allen: I think it might also be great, if people using - or tr

Re: Demos/Tutorials

2008-03-18 Thread Isabel Drost
On Monday 17 March 2008, Allen Day wrote: > I'll be trying out Mahout to do some microarray gene expression > clustering pretty soon. I would be happy to do a small write-up. That sounds really great. Would be a great demo for applications apart from obvious tasks in the area of clustering texts

Re: Demos/Tutorials

2008-03-17 Thread Allen Day
Hi, I'll be trying out Mahout to do some microarray gene expression clustering pretty soon. I would be happy to do a small write-up. -Allen On Mon, Mar 17, 2008 at 7:41 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote: > Now that we have some code in place for clustering, I think it would > be co