[
https://issues.apache.org/jira/browse/PDFBOX-3058?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14973149#comment-14973149
]
Tilman Hausherr edited comment on PDFBOX-3058 at 10/25/15 12:32 PM:
--------------------------------------------------------------------
excel file of my work from yesterdayr, my comments are in column Q.
- Some files can be extracted in 1.8, but not in 2.0 and not in Adobe Reader
- Some files cannot be opened with Adobe Reader but with PDFBox 1.8 and PDF.JS
-
was (Author: tilman):
excel file of my work from yesterday (see last column)
Some files can be extracted in 1.8, but not in 2.0 and not in Adobe Reader, see
my comments in column Q.
> Support TIKA Migration to PDFBox 2.0
> ------------------------------------
>
> Key: PDFBOX-3058
> URL: https://issues.apache.org/jira/browse/PDFBOX-3058
> Project: PDFBox
> Issue Type: Bug
> Affects Versions: 2.0.0
> Reporter: Maruan Sahyoun
> Attachments: content_diffs-1.8-to-2.0.xlsx
>
>
> This issue is to track fixing issues which came up as part of TIKA-1285
> (Upgrade to PDFBox 2.0.0 when available) mainly
> - new exceptions compared to PDFBox 1.8.x
> - regressions in text extraction
> - lower quality text extraction
> There should be individual issues to track tasks/bugs arising from that.
--
This message was sent by Atlassian JIRA
(v6.3.4#6332)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]