[jira] [Updated] (PDFBOX-1886) Merge Function strips OCR layer in acrobat
[ https://issues.apache.org/jira/browse/PDFBOX-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler updated PDFBOX-1886: --- Fix Version/s: 2.1.0 > Merge Function strips OCR layer in acrobat > -- > > Key: PDFBOX-1886 > URL: https://issues.apache.org/jira/browse/PDFBOX-1886 > Project: PDFBox > Issue Type: Bug > Components: Utilities >Affects Versions: 1.8.4 >Reporter: adam brin > Fix For: 2.1.0 > > Attachments: cover_page4818280580458469287.pdf, page1.pdf, > santa-cruz-flats-project-part-2 (1).pdf > > > We use the PDFMergerUtility to add cover pages to documents automatically. > We're finding that when we do so, it strips the OCR data from the source of > the merged files. > {code} > PDFMergerUtility merger = new PDFMergerUtility(); > File outputFile = File.createTempFile(); > merger.setDestinationStream(new FileOutputStream(outputFile)); > for (File file : files) { > merger.addSource(file); > } > merger.mergeDocuments(); > return outputFile; > {code} -- This message was sent by Atlassian JIRA (v6.3.4#6332)
[jira] [Updated] (PDFBOX-1886) Merge Function strips OCR layer in acrobat
[ https://issues.apache.org/jira/browse/PDFBOX-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] John Hewson updated PDFBOX-1886: Component/s: Utilities > Merge Function strips OCR layer in acrobat > -- > > Key: PDFBOX-1886 > URL: https://issues.apache.org/jira/browse/PDFBOX-1886 > Project: PDFBox > Issue Type: Bug > Components: Utilities >Affects Versions: 1.8.4 >Reporter: adam brin > Attachments: cover_page4818280580458469287.pdf, page1.pdf, > santa-cruz-flats-project-part-2 (1).pdf > > > We use the PDFMergerUtility to add cover pages to documents automatically. > We're finding that when we do so, it strips the OCR data from the source of > the merged files. > {code} > PDFMergerUtility merger = new PDFMergerUtility(); > File outputFile = File.createTempFile(); > merger.setDestinationStream(new FileOutputStream(outputFile)); > for (File file : files) { > merger.addSource(file); > } > merger.mergeDocuments(); > return outputFile; > {code} -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Updated] (PDFBOX-1886) Merge Function strips OCR layer in acrobat
[ https://issues.apache.org/jira/browse/PDFBOX-1886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] adam brin updated PDFBOX-1886: -- Attachment: cover_page4818280580458469287.pdf santa-cruz-flats-project-part-2 (1).pdf page1.pdf page1 represents a single page document w/OCR layer from Acrobat; santa-cruz-flats-project-part-2 represents the merged result; cover_page*** is the cover page. > Merge Function strips OCR layer in acrobat > -- > > Key: PDFBOX-1886 > URL: https://issues.apache.org/jira/browse/PDFBOX-1886 > Project: PDFBox > Issue Type: Bug >Affects Versions: 1.8.4 >Reporter: adam brin > Attachments: cover_page4818280580458469287.pdf, page1.pdf, > santa-cruz-flats-project-part-2 (1).pdf > > > We use the PDFMergerUtility to add cover pages to documents automatically. > We're finding that when we do so, it strips the OCR data from the source of > the merged files. > {code} > PDFMergerUtility merger = new PDFMergerUtility(); > File outputFile = File.createTempFile(); > merger.setDestinationStream(new FileOutputStream(outputFile)); > for (File file : files) { > merger.addSource(file); > } > merger.mergeDocuments(); > return outputFile; > {code} -- This message was sent by Atlassian JIRA (v6.1.5#6160)