[ https://issues.apache.org/jira/browse/TIKA-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15154162#comment-15154162 ]
Tim Allison commented on TIKA-1857: ----------------------------------- No problem at all. I think this will take some time for me to get right...there's no rush. :) Do I understand correctly then: no matter whether static or dynamic, try to pull data from XFA; if that doesn't exist, fall back to the AcroForm? Also, is there an obvious way to determine static vs. dynamic aside from checking to see if there are fields in the AcroForm? Thank you, again! > Enhance PDFParser to extract text from XFA forms > ------------------------------------------------ > > Key: TIKA-1857 > URL: https://issues.apache.org/jira/browse/TIKA-1857 > Project: Tika > Issue Type: Improvement > Components: parser > Reporter: Pascal Essiembre > Priority: Trivial > Labels: patch > Fix For: 1.13 > > Attachments: 041617_filled_out.pdf, xfa_in_govdocs1.txt > > > Extract text from PDF Forms (XFA). Information about XFA: > https://en.wikipedia.org/wiki/XFA -- This message was sent by Atlassian JIRA (v6.3.4#6332)