Un-observing in a Canopy

2012-03-03 Thread Benson Margulies
So, I'm messing around with a clustering problem where I want to move items from cluster to cluster and maintain centroids. Lazy person that I am, I am using the Canopy class. But it looks to me like I have to recreate the Canopy when I pull something out. Have I missed anything?

Re: Washing machines - Mahout algorithm advice

2012-03-03 Thread Zoltán Tóth-Czifra
Sean, Jack, the PR is an excellent idea, I was not even thinking about it, but a tweaked version of the PageRank algorithm can absolutely work for this, and is relatively easy to implement. Ted, straight question, thank you. You are right, I'm not working for a washing powder company nor a laundry

Re: Washing machines - Mahout algorithm advice

2012-03-03 Thread Ted Dunning
And further linear Markov chains can be expressed as matrix products which can be computed efficiently using SVD's. Zoltan, is this literally the problem that you are working on? Or is this a shadow of the problem that you are interested in? On Sat, Mar 3, 2012 at 9:55 AM, Jack Tanner wrote: >

DistributedRowMatrix - FileNotFoundException

2012-03-03 Thread PEDRO MANUEL JIMENEZ RODRIGUEZ
Hi everyone! I'm trying to use DistributedRowMatrix in my class code but I'm getting the same error all the time: "FileNotFoundException" I have put a file in my hdfs directory under /user/hduser/diffuse. And I run the progam with "diffuse" as input and output directory. The code looks like:

Re: Washing machines - Mahout algorithm advice

2012-03-03 Thread Jack Tanner
It may further help to note that PageRank is a special case of a Markov chain, and this washing system may well be described by a Markov chain model. On 3/3/2012 12:35 PM, Sean Owen wrote: I answered on SO: The only thing I can think of that sounds like this problem is PageRank. It's computed

Re: Washing machines - Mahout algorithm advice

2012-03-03 Thread Sean Owen
I answered on SO: The only thing I can think of that sounds like this problem is PageRank. It's computed by a sort of iterative simluation. Each page has some influence (color) which flows via its links (socks its washed with) and at some point the page influence reaches a steady state (final colo

Washing machines - Mahout algorithm advice

2012-03-03 Thread Zoltán Tóth-Czifra
Hi, I posted this on stack overflow, but I've been informed that the mailing list is the appropriate forum to ask questions like this. What I need is actually just a hint where I can start, as I'm not sure which direction to go. Although this is not a classic machine learning problem, AFAIK Mahou