[jira] [Updated] (PDFBOX-1886) Merge Function strips OCR layer in acrobat

2014-12-09 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler updated PDFBOX-1886:
---
Fix Version/s: 2.1.0

> Merge Function strips OCR layer in acrobat
> --
>
> Key: PDFBOX-1886
> URL: https://issues.apache.org/jira/browse/PDFBOX-1886
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.4
>Reporter: adam brin
> Fix For: 2.1.0
>
> Attachments: cover_page4818280580458469287.pdf, page1.pdf, 
> santa-cruz-flats-project-part-2 (1).pdf
>
>
> We use the PDFMergerUtility to add cover pages to documents automatically. 
> We're finding that when we do so, it strips the OCR data from the source of 
> the merged files.
> {code}
> PDFMergerUtility merger = new PDFMergerUtility();
> File outputFile = File.createTempFile();
> merger.setDestinationStream(new FileOutputStream(outputFile));
> for (File file : files) {
> merger.addSource(file);
> }
> merger.mergeDocuments();
> return outputFile;
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (PDFBOX-1886) Merge Function strips OCR layer in acrobat

2014-02-08 Thread John Hewson (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

John Hewson updated PDFBOX-1886:


Component/s: Utilities

> Merge Function strips OCR layer in acrobat
> --
>
> Key: PDFBOX-1886
> URL: https://issues.apache.org/jira/browse/PDFBOX-1886
> Project: PDFBox
>  Issue Type: Bug
>  Components: Utilities
>Affects Versions: 1.8.4
>Reporter: adam brin
> Attachments: cover_page4818280580458469287.pdf, page1.pdf, 
> santa-cruz-flats-project-part-2 (1).pdf
>
>
> We use the PDFMergerUtility to add cover pages to documents automatically. 
> We're finding that when we do so, it strips the OCR data from the source of 
> the merged files.
> {code}
> PDFMergerUtility merger = new PDFMergerUtility();
> File outputFile = File.createTempFile();
> merger.setDestinationStream(new FileOutputStream(outputFile));
> for (File file : files) {
> merger.addSource(file);
> }
> merger.mergeDocuments();
> return outputFile;
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Updated] (PDFBOX-1886) Merge Function strips OCR layer in acrobat

2014-02-05 Thread adam brin (JIRA)

 [ 
https://issues.apache.org/jira/browse/PDFBOX-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

adam brin updated PDFBOX-1886:
--

Attachment: cover_page4818280580458469287.pdf
santa-cruz-flats-project-part-2 (1).pdf
page1.pdf

page1 represents a single page document w/OCR layer from Acrobat; 
santa-cruz-flats-project-part-2 represents the merged result; cover_page*** is 
the cover page.

> Merge Function strips OCR layer in acrobat
> --
>
> Key: PDFBOX-1886
> URL: https://issues.apache.org/jira/browse/PDFBOX-1886
> Project: PDFBox
>  Issue Type: Bug
>Affects Versions: 1.8.4
>Reporter: adam brin
> Attachments: cover_page4818280580458469287.pdf, page1.pdf, 
> santa-cruz-flats-project-part-2 (1).pdf
>
>
> We use the PDFMergerUtility to add cover pages to documents automatically. 
> We're finding that when we do so, it strips the OCR data from the source of 
> the merged files.
> {code}
> PDFMergerUtility merger = new PDFMergerUtility();
> File outputFile = File.createTempFile();
> merger.setDestinationStream(new FileOutputStream(outputFile));
> for (File file : files) {
> merger.addSource(file);
> }
> merger.mergeDocuments();
> return outputFile;
> {code}



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)