[jira] [Commented] (TIKA-1857) Enhance PDFParser to extract text from XFA forms

Tim Allison (JIRA) Fri, 19 Feb 2016 04:22:51 -0800

    [ 
https://issues.apache.org/jira/browse/TIKA-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15154162#comment-15154162
 ]


Tim Allison commented on TIKA-1857:
-----------------------------------

No problem at all.  I think this will take some time for me to get 
right...there's no rush. :)

Do I understand correctly then: no matter whether static or dynamic, try to 
pull data from XFA; if that doesn't exist, fall back to the AcroForm?

Also, is there an obvious way to determine static vs. dynamic aside from 
checking to see if there are fields in the AcroForm?

Thank you, again!

> Enhance PDFParser to extract text from XFA forms
> ------------------------------------------------
>
>                 Key: TIKA-1857
>                 URL: https://issues.apache.org/jira/browse/TIKA-1857
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Pascal Essiembre
>            Priority: Trivial
>              Labels: patch
>             Fix For: 1.13
>
>         Attachments: 041617_filled_out.pdf, xfa_in_govdocs1.txt
>
>
> Extract text from PDF Forms (XFA).  Information about XFA: 
> https://en.wikipedia.org/wiki/XFA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

[jira] [Commented] (TIKA-1857) Enhance PDFParser to extract text from XFA forms

Reply via email to