[ 
https://issues.apache.org/jira/browse/TIKA-2403?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16068557#comment-16068557
 ] 

Tim Allison commented on TIKA-2403:
-----------------------------------

Thank you for the ping.  Are you able to share the triggering document with us? 
 If not publicly, can you send it to me privately.  If that won't work, we'll 
try to figure out some other means of figuring this out.

> Elasticsearch 5.2.2 - Ingest Node - PDF - Parsing Issue
> -------------------------------------------------------
>
>                 Key: TIKA-2403
>                 URL: https://issues.apache.org/jira/browse/TIKA-2403
>             Project: Tika
>          Issue Type: Bug
>            Reporter: Boopathi
>
> We are using Elasticsearch 5.2.2  for Full text search. With the help of 
> ingest node we are able to parse the content of files which tika supports. We 
> are facing some issue while parsing the content of some PDF files . It parsed 
> the content of file successfully and in addition to that some additional 
> terms which is not even the content of that document. [sample screen 
> shot|https://www.screencast.com/t/AQWK9Rzvrdo8]. Kindly let me know what is 
> reason for this and how can it be fixed



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to