RE: Need help in Tika on SolrCloud

2017-02-01 Thread Anatharaman, Srinatha (Contractor)
Is there anyone to help me with my issue?

Your help is much appreciated



I figured out the problem but need solution

In my below data-config file tikaConfig.xml is not recognized by zookeeper (
   processor="TikaEntityProcessor" tikaConfig="tikaConfig.xml" )





















































-Original Message-
From: Anatharaman, Srinatha (Contractor) 
[mailto:srinatha_ananthara...@comcast.com]
Sent: Wednesday, February 01, 2017 11:51 AM
To: solr-user@lucene.apache.org
Subject: RE: Need help in Tika on SolrCloud



Hi All,



I see below code which is causing my code not to work in SolrCloud

  @Override





  public String getConfigDir() {





throw new ZooKeeperException(





ErrorCode.SERVER_ERROR,





"ZkSolrResourceLoader does not support getConfigDir() - likely, what 
you are trying to do is not supported in ZooKeeper mode");





  }







https://github.com/apache/lucene-solr/blob/branch_6_3/solr/core/src/java/org/apache/solr/cloud/ZkSolrResourceLoader.java



Can someone help me with work around



ERROR :

2017-02-01 16:39:55.932 ERROR (Thread-20) [c:dsearch s:shard2 r:core_node3 
x:dsearch_shard2_replica2] o.a.s.h.d.DataImporter Full Import 
failed:java.lang.RuntimeException: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1

at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)

at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)

at 
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475)

at 
org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458)

at java.lang.Thread.run(Thread.java:745)

Caused by: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1

at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416)

at 
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329)

at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)

... 4 more

Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: 
Unable to load Tika Config Processing Document # 1

at 
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69)

at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96)

at 
org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60)

at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76)

at 
org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75)

at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433)

at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516)

at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414)

... 6 more

Caused by: org.apache.solr.common.cloud.ZooKeeperException: 
ZkSolrResourceLoader does not support getConfigDir() - likely, what you are 
trying to do is not supported in ZooKeeper mode

at 
org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151)

at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91)

... 12 more





Thanks,

~Sri



From: Anatharaman, Srinatha (Contractor)

Sent: Wednesday, February 01, 2017 10:04 AM

To: 'solr-user@lucene.apache.org' 
<solr-user@lucene.apache.org<mailto:solr-user@lucene.apache.org>>

Subject: Need help in Tika on SolrCloud



Hi,



I am new to Solr, I have implemented Solr on single node and my code is working 
good When I move the same code to SolrCloud it fails (I made few changes for 
SolrCloud)



I am trying to load data using Dataimporthandler but it throws error as below



2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1 
x:dsearch_shard2_replica1] o.a.s.h.d.DataImporter Full Import 
failed:java.lang.RuntimeException: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1

at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)

at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)

at 
org.apache.s

RE: Need help in Tika on SolrCloud

2017-02-01 Thread Anatharaman, Srinatha (Contractor)
Hi All,

I see below code which is causing my code not to work in SolrCloud
  @Override


  public String getConfigDir() {


throw new ZooKeeperException(


ErrorCode.SERVER_ERROR,


"ZkSolrResourceLoader does not support getConfigDir() - likely, what 
you are trying to do is not supported in ZooKeeper mode");


  }



https://github.com/apache/lucene-solr/blob/branch_6_3/solr/core/src/java/org/apache/solr/cloud/ZkSolrResourceLoader.java

Can someone help me with work around

ERROR :
2017-02-01 16:39:55.932 ERROR (Thread-20) [c:dsearch s:shard2 r:core_node3 
x:dsearch_shard2_replica2] o.a.s.h.d.DataImporter Full Import 
failed:java.lang.RuntimeException: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1
at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)
at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)
at 
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475)
at 
org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1
at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416)
at 
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329)
at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)
... 4 more
Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: 
Unable to load Tika Config Processing Document # 1
at 
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69)
at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96)
at 
org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60)
at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76)
at 
org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75)
at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433)
at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516)
at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414)
... 6 more
Caused by: org.apache.solr.common.cloud.ZooKeeperException: 
ZkSolrResourceLoader does not support getConfigDir() - likely, what you are 
trying to do is not supported in ZooKeeper mode
at 
org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151)
at 
org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91)
... 12 more


Thanks,
~Sri

From: Anatharaman, Srinatha (Contractor)
Sent: Wednesday, February 01, 2017 10:04 AM
To: 'solr-user@lucene.apache.org' 
Subject: Need help in Tika on SolrCloud

Hi,

I am new to Solr, I have implemented Solr on single node and my code is working 
good
When I move the same code to SolrCloud it fails (I made few changes for 
SolrCloud)

I am trying to load data using Dataimporthandler but it throws error as below

2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1 
x:dsearch_shard2_replica1] o.a.s.h.d.DataImporter Full Import 
failed:java.lang.RuntimeException: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1
at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)
at 
org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)
at 
org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475)
at 
org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458)
at java.lang.Thread.run(Thread.java:745)
Caused by: java.lang.RuntimeException: 
org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to load 
Tika Config Processing Document # 1
at 
org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416)
at 
org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329)
at 
org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)
... 4 more
Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException: 
Unable to load Tika Config Processing Document # 1
at 
org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69)
at 

Re: Need help in Tika on SolrCloud

2017-02-01 Thread Erick Erickson
Not quite sure. Are all the directories you expect the Tika docs to be
in available to the Solr node?

Sidestepping your question, I would be very reluctant to use Tika in
SolrCloud mode because you're then putting all of the heavy-duty
processing on nodes that also serve queries. I have the same
reservation about stand-alone FWIW, but it's worse in SolrCloud.

I strongly recommend you do the Tika parsing from a client and send
the resulting Solr doc to SorlCloud, I predict you'll eventually do
that anyway. Here's a skeletal program that does that in SolrJ:

https://lucidworks.com/2012/02/14/indexing-with-solrj/

It also shows indexing from a DB, but that's easy enough to rip out.

Best,
Erick

On Wed, Feb 1, 2017 at 7:04 AM, Anatharaman, Srinatha (Contractor)
 wrote:
> Hi,
>
>
>
> I am new to Solr, I have implemented Solr on single node and my code is
> working good
>
> When I move the same code to SolrCloud it fails (I made few changes for
> SolrCloud)
>
>
>
> I am trying to load data using Dataimporthandler but it throws error as
> below
>
>
>
> 2017-02-01 03:23:07.727 ERROR (Thread-18) [c:dsearch s:shard2 r:core_node1
> x:dsearch_shard2_replica1] o.a.s.h.d.DataImporter Full Import
> failed:java.lang.RuntimeException: java.lang.RuntimeException:
> org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to
> load Tika Config Processing Document # 1
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:270)
>
> at
> org.apache.solr.handler.dataimport.DataImporter.doFullImport(DataImporter.java:416)
>
> at
> org.apache.solr.handler.dataimport.DataImporter.runCmd(DataImporter.java:475)
>
> at
> org.apache.solr.handler.dataimport.DataImporter.lambda$runAsync$0(DataImporter.java:458)
>
> at java.lang.Thread.run(Thread.java:745)
>
> Caused by: java.lang.RuntimeException:
> org.apache.solr.handler.dataimport.DataImportHandlerException: Unable to
> load Tika Config Processing Document # 1
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:416)
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.doFullDump(DocBuilder.java:329)
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.execute(DocBuilder.java:232)
>
> ... 4 more
>
> Caused by: org.apache.solr.handler.dataimport.DataImportHandlerException:
> Unable to load Tika Config Processing Document # 1
>
> at
> org.apache.solr.handler.dataimport.DataImportHandlerException.wrapAndThrow(DataImportHandlerException.java:69)
>
> at
> org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:96)
>
> at
> org.apache.solr.handler.dataimport.EntityProcessorBase.init(EntityProcessorBase.java:60)
>
> at
> org.apache.solr.handler.dataimport.TikaEntityProcessor.init(TikaEntityProcessor.java:76)
>
> at
> org.apache.solr.handler.dataimport.EntityProcessorWrapper.init(EntityProcessorWrapper.java:75)
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:433)
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:516)
>
> at
> org.apache.solr.handler.dataimport.DocBuilder.buildDocument(DocBuilder.java:414)
>
> ... 6 more
>
> Caused by: org.apache.solr.common.cloud.ZooKeeperException:
> ZkSolrResourceLoader does not support getConfigDir() - likely, what you are
> trying to do is not supported in ZooKeeper mode
>
> at
> org.apache.solr.cloud.ZkSolrResourceLoader.getConfigDir(ZkSolrResourceLoader.java:151)
>
> at
> org.apache.solr.handler.dataimport.TikaEntityProcessor.firstInit(TikaEntityProcessor.java:91)
>
> ... 12 more
>
>
>
>
>
> I have attached the code for your reference
>
> Could you please help me with the solution
>
>
>
> Regards,
>
> ~Sri
>
>
>
>
>
>