I was using the hadoop.contrib.index code and was able to make a distributed Lucene index
Could then search over that index while it is still in hdfs Does the Katta project uses same hadoop index-contrib to index the documents ? I found that there are also contributions solr-1395 and solr-1301 ( solr -hadoop ) what would be the best approach to begin with Thanks
