Re: Solr hanging when extracting a some broken .doc files

2013-12-19 Thread Charlie Hull
On 18/12/2013 09:03, Alexandre Rafalovitch wrote: Charlie, Does it mean you are talking to it from a client program? Or are you running Tika in a listen/server mode and build some adapters for standard Solr processes? If we're writing indexers in Python we usually run Tika as a server -

Re: Solr hanging when extracting a some broken .doc files

2013-12-19 Thread Raymond Wiker
On Thu, Dec 19, 2013 at 10:01 AM, Charlie Hull char...@flax.co.uk wrote: On 18/12/2013 09:03, Alexandre Rafalovitch wrote: Charlie, Does it mean you are talking to it from a client program? Or are you running Tika in a listen/server mode and build some adapters for standard Solr processes?

Re: Solr hanging when extracting a some broken .doc files

2013-12-19 Thread Augusto Camarotti
Hey Andrea! thanks for answering, this is the complete stack trace is following below. (the other is just the same): I'm going to try that modification of the logging level but i'm really considering to debug tika and try to correct it myself.

Re: Solr hanging when extracting a some broken .doc files

2013-12-18 Thread Charlie Hull
On 17/12/2013 15:29, Augusto Camarotti wrote: Hi guys, I'm having a problem with solr when trying to index some broken .doc files. I have set up a test case using Solr to index all the files the users save on the shared directorys of the company that i work for and Solr is hanging when

Re: Solr hanging when extracting a some broken .doc files

2013-12-18 Thread Alexandre Rafalovitch
Charlie, Does it mean you are talking to it from a client program? Or are you running Tika in a listen/server mode and build some adapters for standard Solr processes? Regards, Alex. Personal website: http://www.outerthoughts.com/ LinkedIn: http://www.linkedin.com/in/alexandrerafalovitch -

Solr hanging when extracting a some broken .doc files

2013-12-17 Thread Augusto Camarotti
Hi guys, I'm having a problem with solr when trying to index some broken .doc files. I have set up a test case using Solr to index all the files the users save on the shared directorys of the company that i work for and Solr is hanging when trying to index this file in particular(the one

Re: Solr hanging when extracting a some broken .doc files

2013-12-17 Thread Andrea Gazzarini
Hi Augusto, I don't believe the mailing list allows attachments. Could you please post the complete stacktrace? In addition, set the logging level of tika classes to FINEST in solr console, maybe can be helpful Best, Andrea On 17 Dec 2013 16:30, Augusto Camarotti augu...@prpb.mpf.gov.br wrote: