Bixo may have some useful components. The thrust is different, but some of the pieces are similar.
http://bixo.101tec.com/ On Mon, Feb 28, 2011 at 7:57 PM, Mark Kerzner <markkerz...@gmail.com> wrote: > Well, it's more complex than that. I packed all files (or selected > directories) into zip files, and those zip files go into HDFS, and they are > processed from there. > > Mark > > On Mon, Feb 28, 2011 at 9:53 PM, Greg Roelofs <roel...@yahoo-inc.com> > wrote: > > > Mark Kerzner <markkerz...@gmail.com> wrote: > > > > > I am working on an open-source project that would be using > > > Hadoop/HDFS/HBase/Tika/Lucene and would make all files on a hard drive > > > searchable. > > > > _A_ hard drive? Hadoop? Seems like a bad match. > > > > Greg > > >