Also, this algorithm is inherently linear in time in the size of the input. That means it is feasible on large data.
On Sun, Nov 22, 2009 at 9:07 AM, Jake Mannix <[email protected]> wrote: > The machinery to do the above in parallel on "ridiculously big" data on > Hadoop > should be coming in soon with some of the stuff I'm working on contributing > to Mahout. > -- Ted Dunning, CTO DeepDyve
