Fabulous work! There are obviously a lot of local modifications to be done for nutch + gora + accumulo to work. So feel free to propose these to upstream nutch and gora.
It should feel good to run the web crawl, and store the results on accumulo. Cheers, Enis On Tue, Feb 28, 2012 at 6:24 PM, Mattmann, Chris A (388J) < chris.a.mattm...@jpl.nasa.gov> wrote: > UMMM wow! > > That's awesome Jason! Thanks so much! > > Cheers, > Chris > > On Feb 28, 2012, at 5:41 PM, Jason Trost wrote: > > > Blog post for anyone who's interested. I cover a basic howto for > > getting Nutch to use Apache Gora to store web crawl data in Accumulo. > > > > Let me know if you have any questions. > > > > Accumulo, Nutch, and GORA > > http://www.covert.io/post/18414889381/accumulo-nutch-and-gora > > > > --Jason > > > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Chris Mattmann, Ph.D. > Senior Computer Scientist > NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA > Office: 171-266B, Mailstop: 171-246 > Email: chris.a.mattm...@nasa.gov > WWW: http://sunset.usc.edu/~mattmann/ > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > Adjunct Assistant Professor, Computer Science Department > University of Southern California, Los Angeles, CA 90089 USA > ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ > >