On Wednesday 26 March 2008, Grant Ingersoll wrote:
> On Mar 24, 2008, at 4:48 PM, Isabel Drost wrote:
> > I think Mahout is not really suitable to build demos that explain
> > the inner workings of the algorithms implemented.
>
> I agree, but as we develop, we will probably have programmer's guides
On Mar 24, 2008, at 4:48 PM, Isabel Drost wrote:
On Thursday 20 March 2008, Grant Ingersoll wrote:
In the longer run, intro to ML would be cool, but there is lots
available
on that.
I think Mahout is not really suitable to build demos that explain
the inner
workings of the algorithms im
On Thursday 20 March 2008, Grant Ingersoll wrote:
> In the longer run, intro to ML would be cool, but there is lots available
> on that.
I think Mahout is not really suitable to build demos that explain the inner
workings of the algorithms implemented.
> I don't think it should be that large,
On Mar 20, 2008, at 9:15 AM, Grant Ingersoll wrote:
On Mar 19, 2008, at 9:56 PM, Karl Wettin wrote:
Grant Ingersoll skrev:
Now that we have some code in place for clustering, I think it
would be cool to put together some examples/demos of real world
problems. Things like clustering text
On Mar 19, 2008, at 9:56 PM, Karl Wettin wrote:
Grant Ingersoll skrev:
Now that we have some code in place for clustering, I think it
would be cool to put together some examples/demos of real world
problems. Things like clustering text (perhaps we can use the
wikipedia download or the re
Karl Wettin wrote:
Grant Ingersoll skrev:
Now that we have some code in place for clustering, I think it would
be cool to put together some examples/demos of real world problems.
Things like clustering text (perhaps we can use the wikipedia download
or the reuters download that Lucene contrib
Grant Ingersoll skrev:
Now that we have some code in place for clustering, I think it would be
cool to put together some examples/demos of real world problems. Things
like clustering text (perhaps we can use the wikipedia download or the
reuters download that Lucene contrib/benchmark uses) or
I've been using the canopy clustering to cluster Apache log time slices by
URL frequency. Typical results indicate several big clusters with the
"business as usual" access patterns in them and then several small clusters
with the unusual patterns. It's a little difficult to interpret beyond that
bu
Yeah, I hear you there. I have a project I am working on that will
require me to generate examples, but it is a couple of weeks away.
The gene expression stuff is great. Text based ones would be really
cool too. I haven't done too much clustering work (other than using
Dawid's excellent
This is absolutely necessary, if not for just showing off with the project, then
certainly for verification of correctness of algorithms inside it.
I will certainly hop in to such a subtask to the extent of my current available
time resources (not much, sadly).
D.
Grant Ingersoll wrote:
No
On Monday 17 March 2008, Grant Ingersoll wrote:
> Now that we have some code in place for clustering, I think it would
> be cool to put together some examples/demos of real world problems.
One idea I thought of reading the proposal of Allen: I think it might also be
great, if people using - or tr
On Monday 17 March 2008, Allen Day wrote:
> I'll be trying out Mahout to do some microarray gene expression
> clustering pretty soon. I would be happy to do a small write-up.
That sounds really great. Would be a great demo for applications apart from
obvious tasks in the area of clustering texts
Hi,
I'll be trying out Mahout to do some microarray gene expression
clustering pretty soon. I would be happy to do a small write-up.
-Allen
On Mon, Mar 17, 2008 at 7:41 AM, Grant Ingersoll <[EMAIL PROTECTED]> wrote:
> Now that we have some code in place for clustering, I think it would
> be co
13 matches
Mail list logo