[ 
https://issues.apache.org/jira/browse/TIKA-1857?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15154274#comment-15154274
 ] 

Tim Allison commented on TIKA-1857:
-----------------------------------

Doh! Sorry.  I was looking at PDXFAResource.  Thank you, again.

bq. PDF 2.0 as there XFA is deprecated 

Oh, no...I guess we could copy/paste from the current PDFBox if XFA goes away 
in PDFBox...less than ideal. I don't see deprecation tags in PDXFAResource or 
PDAcroForm's {{getXFA()}}...which XFA handling might go away?

> Enhance PDFParser to extract text from XFA forms
> ------------------------------------------------
>
>                 Key: TIKA-1857
>                 URL: https://issues.apache.org/jira/browse/TIKA-1857
>             Project: Tika
>          Issue Type: Improvement
>          Components: parser
>            Reporter: Pascal Essiembre
>            Priority: Trivial
>              Labels: patch
>             Fix For: 1.13
>
>         Attachments: 041617_filled_out.pdf, xfa_in_govdocs1.txt
>
>
> Extract text from PDF Forms (XFA).  Information about XFA: 
> https://en.wikipedia.org/wiki/XFA



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to