Can't really deal with the security issues, but... The resulting indexes created by MRIT are just plain vanilla Solr/Lucene indexes. All the --go-live step does is issue a MERGEINDEXES command from the core where they live to the directory MRIT leaves them in, you might get some joy there, see: https://cwiki.apache.org/confluence/display/solr/CoreAdmin+API#CoreAdminAPI-MERGEINDEXES
Or you can copy them around by hand and start Solr. You have to be _really_ sure that you get the right index for each replica though, if you get the index intended for a replica on shard1 on a replica for shard2 it's A Bad Thing. Best, Erick On Tue, Jul 26, 2016 at 10:52 AM, Archana Satheesh Kumar <aku...@kogentix.com> wrote: > Hi, > > > I was trying to use the Mapreduce Indexer tool from cloudera, to index my > data in Hive table using Solr. > > > hadoop jar /path/to/lib/solr/contrib/mr/search-mr-*-job.jar > org.apache.solr.hadoop.MapReduceIndexerTool -Djute.maxbuffer=<buff > size>--morphline-file /path/to/morphlines.conf --output-dir > hdfs://path/to/output/dir --reducers -1 --mappers -1 --verbose --go-live > --zk-host <zookeeperHostIP>:2181/solr --shards 2 --collection <collection > name> hdfs://location/of/hive/table > > My MR job runs successfully and I am able to view _SUCCESS flag in the > specified output loc > > hadoop fs -ls /path/to/output/results > Found 2 items > -rwxrwx--x+ 3 hive hive 0 2016-07-26 11:35 > /path/to/output/results/_SUCCESS > drwxrwx--x+ - hive hive 0 2016-07-26 11:20 > /path/to/output/results/part-00000 > > But my go-live option is not working. > > Exception: > java.util.concurrent.ExecutionException: > org.apache.solr.client.solrj.impl.HttpSolrServer$RemoteSolrException: > Expected mime type application/octet-stream but got text/html > > I also tried using jaas-client.conf > > Client { > com.sun.security.auth.module.Krb5LoginModule required > useKeyTab=false > useTicketCache=true > principal="<MyID@DOMAIN>"; > }; > > So, before executing the Mapreduce job, HADOOP_OPTS was set to the > jass-client.conf > export > HADOOP_OPTS="-Djava.security.auth.login.config=/path/to/jaas-client.conf" > > > 1. What could be the issue? > 2. Am I missing something? > 3. Since I have my data indexed locally, is there a way to perform the > go-live option separately? > > Thanks in advance > > Archana > > > > >