On Sat, Nov 29, 2008 at 7:26 PM, Jon Baer <[EMAIL PROTECTED]> wrote: > HadoopEntityProcessor for the DIH? Reading data from Hadoop with DIH could be really cool There are a few very useful ones which are required badly. Most useful one would be a TikaEntityProcessor.
But I do not see it solving the scalability problem (the original post). > > Ive wondered about this as they make HadoopCluster LiveCDs and EC2 have > images but best way to make use of them is always a challenge. > > - Jon > > On Nov 29, 2008, at 3:34 AM, Erik Hatcher wrote: > >> >> On Nov 28, 2008, at 8:38 PM, Yonik Seeley wrote: >>> >>> Or, it would be relatively trivial to write a Lucene program >>> to merge the indexes. >> >> FYI, such a tool exists in Lucene's API already: >> >> >> >> <http://lucene.apache.org/java/2_4_0/api/org/apache/lucene/misc/IndexMergeTool.html> >> >> Erik >> > > -- --Noble Paul