heheh. it should give me a better error than permission denied, jeez. that makes it look like it does exist, but I can't get in, fooey!
-jake On Sat, Feb 27, 2010 at 1:36 PM, Robin Anil <robin.a...@gmail.com> wrote: > ahaa, as i suspected. Look at the end.. its a regex. there are 6 chunks > > > On Sun, Feb 28, 2010 at 3:04 AM, Jake Mannix <jake.man...@gmail.com> > wrote: > > > Er, the one you posted! > > > > > > > > > http://mahout-wikipedia.s3.amazonaws.com/wikipedia-jan-2010-seqfile-deflate-chunk-[0-5] > > > > < > > > http://mahout-wikipedia.s3.amazonaws.com/wikipedia-jan-2010-seqfile-deflate-chunk-[0-5] > > > > > > > > > On Sat, Feb 27, 2010 at 1:30 PM, Robin Anil <robin.a...@gmail.com> > wrote: > > > > > Can you give the url you tried > > > > > > > > > On Sun, Feb 28, 2010 at 2:59 AM, Jake Mannix <jake.man...@gmail.com> > > > wrote: > > > > > > > Hey Robin, that http url gives me a permission denied response... I'm > > not > > > > too S3 savvy, not sure if I'm checking on it right... > > > > > > > > On Sat, Feb 27, 2010 at 12:40 PM, Robin Anil <robin.a...@gmail.com> > > > wrote: > > > > > > > > > Its uploaded here and its public. I will monitor usage and see if > my > > > > > credits > > > > > dont get run out easily, then i will take it down and wait for > amazon > > > to > > > > > give me more credits. > > > > > > > > > > Its wikipedia docid => wikitext. You can run the vectorizer over > > this. > > > > Use > > > > > either the wikipedia analyzer or the standard analyzer. > > > > > > > > > > > > > > > > > > > > > > > > > http://mahout-wikipedia.s3.amazonaws.com/wikipedia-jan-2010-seqfile-deflate-chunk-[0-5] > > > > > > s3://mahout-wikipedia/wikipedia-jan-2010-seqfile-deflate-chunk-[0-5] > > > > > > > > > > > > > > > Robin > > > > > > > > > > > > > > >