subject:"Bug in ExtractingRequestHandler"

Bug in ExtractingRequestHandler

2016-06-10 Thread Gilbert Boyreau

Hello, I think there's a bug in the |ExtractingRequestHandler|Handler (Tika parser). Some tika's exception are not catch, and the handler return a 0 status, indicating no problem's with that content. I give a look at the code (Solr 5.1, ExtractingDocumentLoader:221), only TikaException are c

Re: bug in ExtractingRequestHandler with PDFs and metadata field Category

2011-07-07 Thread Juan Grande

Hi Andras, I added metadata_ so all PDF metadata fields > should be saved in solr as "metadata_something" fields. > The problem is that the "Category" metadata field from the PDF for some > reason is not prefixed with "metadata_" and > solr will merge the "Category" field I have in the schema with

bug in ExtractingRequestHandler with PDFs and metadata field Category

2011-07-07 Thread Andras Balogh

Hi, I think this is a bug but before reporting to issue tracker I thought I will ask it here first. So the problem is I have a PDF file which among other metadata fields like Author, CreatedDate etc. has a metadata field Category (I can see all metadata fields with tika-app.jar started in