RE: Need help in Tika on SolrCloud
Is there anyone to help me with my issue? Your help is much appreciated I figured out the problem but need solution In my below data-config file tikaConfig.xml is not recognized by zookeeper ( processor="TikaEntityProcessor" tikaConfig="tikaConfig.xml" ) -Original Message- From: Anatharaman, Srinatha (Contractor) [mailto:srinatha_ananthara...@comcast.com] Sent: Wednesday, February 01, 2017 11:51 AM To: solr-user@lucene.apache.org Subject: RE: Need help in Tika on SolrCloud Hi All, I see below code which is causing my code not to work in SolrCloud @Override public String getConfigDir() { throw new ZooKeeperException( ErrorCode.SERVER_ERROR, "ZkSolrResourceLoader does not support getConfigDir() - likely, what you are trying to do is not supported in ZooKeeper mode"); } https://github.com/apache/lucene-solr/blob/branch_6_3/solr/core/src/java/org/apache/solr/cloud/ZkSolrResourceLoader.java Can someone help me with work around ERROR : 2017-02-01 16:39:55.932 ERROR (Thread-20) [c:dsearch s:shard2 r:core_node3 x:dsearch_shard2_replica2] o.a.s.h.d.DataImporter Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475) at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458) at java.lang.Thread.run(Thread.java:745) Caused by: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232) ... 4 more Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69) at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96) at org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60) at org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414) ... 6 more Caused by: org.apache.solr.common.cloud.ZooKeeperException: ZkSolrResourceLoader does not support getConfigDir() - likely, what you are trying to do is not supported in ZooKeeper mode at org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151) at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91) ... 12 more Thanks, ~Sri From: Anatharaman, Srinatha (Contractor) Sent: Wednesday, February 01, 2017 10:04 AM To: 'solr-user@lucene.apache.org' <solr-user@lucene.apache.org<mailto:solr-user@lucene.apache.org>> Subject: Need help in Tika on SolrCloud Hi, I am new to Solr, I have implemented Solr on single node and my code is working good When I move the same code to SolrCloud it fails (I made few changes for SolrCloud) I am trying to load data using Dataimporthandler but it throws error as below 2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1 x:dsearch_shard2_replica1] o.a.s.h.d.DataImporter Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416) at org.apache.s
RE: Need help in Tika on SolrCloud
Hi All, I see below code which is causing my code not to work in SolrCloud @Override public String getConfigDir() { throw new ZooKeeperException( ErrorCode.SERVER_ERROR, "ZkSolrResourceLoader does not support getConfigDir() - likely, what you are trying to do is not supported in ZooKeeper mode"); } https://github.com/apache/lucene-solr/blob/branch_6_3/solr/core/src/java/org/apache/solr/cloud/ZkSolrResourceLoader.java Can someone help me with work around ERROR : 2017-02-01 16:39:55.932 ERROR (Thread-20) [c:dsearch s:shard2 r:core_node3 x:dsearch_shard2_replica2] o.a.s.h.d.DataImporter Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475) at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458) at java.lang.Thread.run(Thread.java:745) Caused by: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232) ... 4 more Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69) at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96) at org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60) at org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76) at org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516) at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414) ... 6 more Caused by: org.apache.solr.common.cloud.ZooKeeperException: ZkSolrResourceLoader does not support getConfigDir() - likely, what you are trying to do is not supported in ZooKeeper mode at org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151) at org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91) ... 12 more Thanks, ~Sri From: Anatharaman, Srinatha (Contractor) Sent: Wednesday, February 01, 2017 10:04 AM To: 'solr-user@lucene.apache.org'Subject: Need help in Tika on SolrCloud Hi, I am new to Solr, I have implemented Solr on single node and my code is working good When I move the same code to SolrCloud it fails (I made few changes for SolrCloud) I am trying to load data using Dataimporthandler but it throws error as below 2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1 x:dsearch_shard2_replica1] o.a.s.h.d.DataImporter Full Import failed:java.lang.RuntimeException: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270) at org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416) at org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475) at org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458) at java.lang.Thread.run(Thread.java:745) Caused by: java.lang.RuntimeException: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416) at org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329) at org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232) ... 4 more Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load Tika Config Processing Document # 1 at org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69) at
Re: Need help in Tika on SolrCloud
Not quite sure. Are all the directories you expect the Tika docs to be in available to the Solr node? Sidestepping your question, I would be very reluctant to use Tika in SolrCloud mode because you're then putting all of the heavy-duty processing on nodes that also serve queries. I have the same reservation about stand-alone FWIW, but it's worse in SolrCloud. I strongly recommend you do the Tika parsing from a client and send the resulting Solr doc to SorlCloud, I predict you'll eventually do that anyway. Here's a skeletal program that does that in SolrJ: https://lucidworks.com/2012/02/14/indexing-with-solrj/ It also shows indexing from a DB, but that's easy enough to rip out. Best, Erick On Wed, Feb 1, 2017 at 7:04 AM, Anatharaman, Srinatha (Contractor)wrote: > Hi, > > > > I am new to Solr, I have implemented Solr on single node and my code is > working good > > When I move the same code to SolrCloud it fails (I made few changes for > SolrCloud) > > > > I am trying to load data using Dataimporthandler but it throws error as > below > > > > 2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1 > x:dsearch_shard2_replica1] o.a.s.h.d.DataImporter Full Import > failed:java.lang.RuntimeException: java.lang.RuntimeException: > org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to > load Tika Config Processing Document # 1 > > at > org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270) > > at > org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416) > > at > org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475) > > at > org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458) > > at java.lang.Thread.run(Thread.java:745) > > Caused by: java.lang.RuntimeException: > org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to > load Tika Config Processing Document # 1 > > at > org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416) > > at > org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329) > > at > org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232) > > ... 4 more > > Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: > Unable to load Tika Config Processing Document # 1 > > at > org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69) > > at > org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96) > > at > org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60) > > at > org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76) > > at > org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75) > > at > org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433) > > at > org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516) > > at > org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414) > > ... 6 more > > Caused by: org.apache.solr.common.cloud.ZooKeeperException: > ZkSolrResourceLoader does not support getConfigDir() - likely, what you are > trying to do is not supported in ZooKeeper mode > > at > org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151) > > at > org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91) > > ... 12 more > > > > > > I have attached the code for your reference > > Could you please help me with the solution > > > > Regards, > > ~Sri > > > > > >