[ https://issues.apache.org/jira/browse/TIKA-810?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Jeremy Anderson updated TIKA-810: --------------------------------- Attachment: pdfbox-1.7.0.diff Upgraded to 1.7.0 in revision 1213227 as of 2011-12-12. Change is to TestCase where annotation text extraction is now off by default in PDFBox. (Appeared to be on in 1.6.0 release but no longer is in 1.7.0 daily) Note, a proper fix may be required to change the Tika PDF Parser to turn on annotation extraction by default and then modify the test case appropriately. Or to submit a fix in PDF box to have 1.7.0 behave the same as 1.6.0. > Upgrade to PDFbox 1.7.0 as available > ------------------------------------ > > Key: TIKA-810 > URL: https://issues.apache.org/jira/browse/TIKA-810 > Project: Tika > Issue Type: Improvement > Components: parser > Affects Versions: 1.0 > Reporter: Jeremy Anderson > Priority: Minor > Attachments: pdfbox-1.7.0.diff > > > This isssue is to track upgrading the PDFbox dependency 1.7.0 Final once it's > available, and the daily build before then -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira