Has anyone taken a look at using Hadoop RPC for enabling distributed
Lucene?  I am thinking it would implement the Searchable interface and use
serialization to be compatible with the current RMI version.  Somewhat
defeats the purpose of using Hadoop RPC and serialization however Hadoop RPC
scales far beyond what RMI can at the networking level.  RMI uses a thread
per socket and has reportedly has latency issues.  Hadoop RPC uses NIO and
is proven to scale to thousands of servers.  Serialization unfortunately
must be used with Lucene due to the Weight, Query and Filter classes.  There
could be an extended version of Searchable that allows passing Weight,
Query, and Filter classes that implement Hadoop's Writeable interface if a
user wants to bypass using serialization.

Reply via email to