[
https://issues.apache.org/jira/browse/PDFBOX-1138?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Brad Reynolds updated PDFBOX-1138:
----------------------------------
Priority: Blocker
Description:
Text extraction fails for some PDFs under the
following circumstances:
- page is in landscape format
- setSortByPosition is true
was:
[imported from SourceForge]
http://sourceforge.net/tracker/index.php?group_id=78314&atid=552832&aid=1410876
Originally submitted by lenuweit on 2006-01-20 07:22.
Text extraction fails for some PDFs (see attached one
generated by PS printer/Ghostscript) under the
following circumstances:
- page is in landscape format
- setSortByPosition is true
Extraction works fine if page is in portrait format.
[attachment on SourceForge]
http://sourceforge.net/tracker/download.php?group_id=78314&atid=552832&aid=1410876&file_id=164169
testpdfbox.pdf (application/pdf), 5442 bytes
sample PDF (1 page in landscape)
Affects Version/s: 1.6.0
This appears to have been reported before a long time ago but it was attached
as a duplicate to a bug that has been resolved. I'm seeing this problem with
1.6.
> CLONE - Text extraction fails for pages in landscape format
> -----------------------------------------------------------
>
> Key: PDFBOX-1138
> URL: https://issues.apache.org/jira/browse/PDFBOX-1138
> Project: PDFBox
> Issue Type: Bug
> Components: Text extraction
> Affects Versions: 1.6.0
> Reporter: Brad Reynolds
> Priority: Blocker
>
> Text extraction fails for some PDFs under the
> following circumstances:
> - page is in landscape format
> - setSortByPosition is true
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators:
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira