I would say try it first :) at this size if everything works, then we can think about any record attempt :P
Robin On Mon, Mar 1, 2010 at 9:21 AM, Jake Mannix <jake.man...@gmail.com> wrote: > So a little under a billion nonzero entries. A nice test, but not quite > record breaking yet! > > -jake > > On Feb 28, 2010 7:33 PM, "Robin Anil" <robin.a...@gmail.com> wrote: > > 12 GB uncompressed. I am uploading to s3 at the moment > > regex :) > > s3://mahout-wikipedia/unigram-tfidf-vectors/part-0000[0-9] > > On Mon, Mar 1, 2010 at 8:56 AM, Jake Mannix <jake.man...@gmail.com> wrote: > > > What's the final size... >