[
https://issues.apache.org/jira/browse/PDFBOX-1132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved PDFBOX-1132.
-----------------------------------
Resolution: Fixed
Fix Version/s: 1.7.0
Assignee: Jukka Zitting
Patches committed in revisions 1182880 and 1182881.
> Add Tika parsers for PDF and TTF
> --------------------------------
>
> Key: PDFBOX-1132
> URL: https://issues.apache.org/jira/browse/PDFBOX-1132
> Project: PDFBox
> Issue Type: New Feature
> Components: FontBox, Parsing
> Reporter: Jukka Zitting
> Assignee: Jukka Zitting
> Fix For: 1.7.0
>
> Attachments: 0001-PDFBOX-1132-Add-Tika-parser-classes.patch,
> 0002-PDFBOX-1132-Add-Tika-parser-classes.patch
>
>
> The PDF and TTF parsers in Apache Tika rely more on improvements in PDFBox
> than on those in Tika, so it would make more sense for that code to reside
> inside Apache PDFBox.
> Having the code inside PDFBox would allow for tighter integration with PDFBox
> internals and avoid need to wait for an official PDFBox release before new
> features can be used inside the PDF and TTF parsers.
> To do this, I'd migrate the code PDF and TTF parser classes and related test
> cases and files from Tika to the PDFBox and FontBox components. We'd add an
> optional dependency to tika-core to these components, so people who don't use
> or need Tika wouldn't be affected.
> I'll attach a patch with the proposed changes.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira