AW: Save a pdf online

2020-01-22 Thread Tilman Hausherr
save() works with files and output streams. To upload to a website is not a file name and is completely different and depends on the website, e.g. ftp, sftp, web UI etc.. This is outside of PDFBox, sorry. You'll have to find out what your web hoster offers. Tilman

Re: can not run application in pdfbox-examples-2.0.18.jar

2020-01-21 Thread Tilman Hausherr
The example subproject is just example source code. There are maybe 50 different "main" there. Depending on your IDE, you can run the files directly there. Or copy what you need in your own projects. Tilman Am 21.01.2020 um 21:37 schrieb Phanu Choomsit: When I try to execute

Re: PDFStreamEngine swallows errors when jbig2-imageio is not installed

2020-01-10 Thread Tilman Hausherr
Hi, Call ImageIO.getImageReadersByFormatName() and see what result you get. Tilman Am 09.01.2020 um 01:00 schrieb Eduardo Quintanilla: Hello, I would like to know if the Apache PDFBox JBIG2 plugin is loaded to solve a weird bug where the plugin is not present and the JAR jbig2-imageio is

Re: Parsing order issue

2020-01-10 Thread Tilman Hausherr
le columns in a page, as it follows an order of left->right, and top->down.  Pls see attached images. Is it possible you can advise me on how to deal with such case? Thanks so much. Luke On Tue, 7 Jan 2020 at 13:00, Tilman Hausherr <mailto:thaush...@t-online.de>> wrote

Re: Parsing order issue

2020-01-06 Thread Tilman Hausherr
[2nd attempt] From my understanding, when you want to use sortbyposition in tika, you need to have a segment like this: ...             true         ... so your whole file would be like:             application/pdf               

Re: Java Heap Space Issue

2020-01-01 Thread Tilman Hausherr
Hi, Try using "MemoryUsageSetting.setupTempFileOnly()" as 2nd parameter when loading your document. And you can slightly lessen the memory footprint by not using "data". Just pass a FileInputStream to PDEmbeddedFile(). This is poorly documented / there is a javadoc bug - the input stream is

Re: Extracting graphics primitives by subclassing PageDrawer

2019-12-31 Thread Tilman Hausherr
Am 31.12.2019 um 10:46 schrieb Peter Murray-Rust: First, many thanks for the quick response and your patience. I On Tue, Dec 31, 2019 at 5:49 AM Tilman Hausherr wrote: Can we simplify this question and focus on one single thing? Is the core of the question that the circle on the bottom left

Re: Extracting graphics primitives by subclassing PageDrawer

2019-12-30 Thread Tilman Hausherr
Can we simplify this question and focus on one single thing? Is the core of the question that the circle on the bottom left is filled with red color, but it shouldn't be? THat |PDGraphicsstroke:AWTfill:java.awt.Color[r=255,g=0,b=0] should be ||PDGraphicsstroke:AWTfill:null| || ? If so,

Re: PDListBox Not Displayed Correctly After setValue()

2019-12-30 Thread Tilman Hausherr
the bug by setting a fixed font size for the listbox. Tilman On Mon, Dec 30, 2019 at 12:27 PM Tilman Hausherr wrote: Hi, Please share the file. Preferably not the sharehoster from last time. Use dropbox / google drive, or filedropper.com. Tilman Am 30.12.2019 um 17:21 schrieb Robert

Re: PDListBox Not Displayed Correctly After setValue()

2019-12-30 Thread Tilman Hausherr
Hi, Please share the file. Preferably not the sharehoster from last time. Use dropbox / google drive, or filedropper.com. Tilman Am 30.12.2019 um 17:21 schrieb Robert Pepersack: Hi all, Thanks in advance for your help. I have created a PDF file using Foxit PhantomPDF. I'm setting the

Re: Delete an Image

2019-12-28 Thread Tilman Hausherr
Get the RemoveAllText.java example from the source code download, then modify it so that it catches the "Do" operator. For example "/Im4 Do". Here "Im4" is the name of the image. Rewrite the content stream without these two tokens. If you want to delete the image from the resources (because it

Re: Date Field Contents Not Visible

2019-12-24 Thread Tilman Hausherr
Am 24.12.2019 um 22:27 schrieb Robert Pepersack: The file from Foxit causes an exception to be thrown. What exception? Please include the stack trace. Also make sure to update to 2.0.18. Tilman - To unsubscribe, e-mail:

Re: Date Field Contents Not Visible

2019-12-24 Thread Tilman Hausherr
e) { throw new AssistBusinessException(e); } } } On Tue, Dec 24, 2019 at 10:43 AM Tilman Hausherr wrote: Am 23.12.2019 um 22:09 schrieb Robert Pepersack: 2. Run my Java program that loads the .pdf file, populates its form fields, and saves it to a new file on my local disk. What code did you

Re: Date Field Contents Not Visible

2019-12-24 Thread Tilman Hausherr
Am 23.12.2019 um 22:09 schrieb Robert Pepersack: 2. Run my Java program that loads the .pdf file, populates its form fields, and saves it to a new file on my local disk. What code did you use to populate the date field, and what PDFBox version did you use (the current version is 2.0.18)? I'm

Re: Date Field Contents Not Visible

2019-12-23 Thread Tilman Hausherr
Please upload the PDF to a sharehoster, most attachments are blocked. Tilman Am 23.12.2019 um 22:09 schrieb Robert Pepersack: Hi, I have used PDFBox to create a .pdf file from a .pdf file that I created in Adobe Acrobat Pro DC version 2019.021.20061. Here is the process I've been working on:

Re: Single directory as source of all available fonts

2019-12-21 Thread Tilman Hausherr
Am 20.12.2019 um 17:58 schrieb Guillem Saint-Germes: Hello there, In order to be system agnostic when deploying a service which use PDFBox 2.0.16, I would like to use a single specific directory as the source of all available fonts (regardless of the system fonts). Looking at the code it seems

Re: Parsing order issue

2019-12-20 Thread Tilman Hausherr
I answered, asked to have a look at your file (upload to a sharehoster), and mentioned that your config file is suspicious. I found your file (it was in the moderation mail), and that is a typical case where the PDF order is different to the visual order. That is what the sort option in

Re: Parsing order issue

2019-12-20 Thread Tilman Hausherr
I answered, asked to have a look at your file (upload to a sharehoster), and mentioned that your config file is suspicious. Tilman Am 20.12.2019 um 19:06 schrieb Lu Sun: Dear PDFBox Dev Team, Hope this message finds you well. Just wanted to raise this for your attention. Please can you

Re: Alignment Issue

2019-12-19 Thread Tilman Hausherr
I got your file through moderation and it works for me, here's with field "Player 1" set http://www.filedropper.com/line-up-22-saved_1 Tilman - To unsubscribe, e-mail: users-unsubscr...@pdfbox.apache.org For additional

Re: Could not instantiate SVGImageReader

2019-12-18 Thread Tilman Hausherr
Am 18.12.2019 um 19:12 schrieb Wade Polk: This is very strange… I started seeing this error pop up. I never saw it before… I even reverted my code back a few days when the error wasn’t present but it’s still there! What’s really weird is that it appears to still function fine after this. Anyone

Re: Parsing order issue

2019-12-17 Thread Tilman Hausherr
I already answered... we need the PDF. But... about the config:             image/jpeg   application/pdf   class="org.apache.tika.parser.executable.ExecutableParser"/>                   application/pdf       Is this a correct setting for PDFs in tika? I notice

Re: Pdf upload error

2019-12-17 Thread Tilman Hausherr
Hi, Please tell more details. When did you get this error? What did you do (you mention "upload")? What jdk version are you using? Did it come when downloading from the browser, or when running it, and which parameters did you use? Also use the current version, which is 2.0.17. I don't

Re: Is it possible to set the line height in a PDTextField?

2019-12-16 Thread Tilman Hausherr
s really advanced, this won't be done in an hour or two IMHO. Tilman On Mon, Dec 16, 2019 at 8:21 PM Tilman Hausherr wrote: Hi, Change the rectangle of the annotation widget(s). Tilman Am 17.12.2019 um 02:31 schrieb Jesse Zhou: Hello, I'm trying to set the line height in a PDTextField

Re: Is it possible to set the line height in a PDTextField?

2019-12-16 Thread Tilman Hausherr
Hi, Change the rectangle of the annotation widget(s). Tilman Am 17.12.2019 um 02:31 schrieb Jesse Zhou: Hello, I'm trying to set the line height in a PDTextField. I'm able to figure out how to configure the font size, but not the line height. I'm doing this because we need to be able to fit

Re: Alignment Issue

2019-12-15 Thread Tilman Hausherr
Hi, Please upload your PDF to a sharehoster. Tilman Am 16.12.2019 um 08:22 schrieb Baskar Murugan: Hi Team, I've found some PDFs where if you just type in a value in PDF reader, it will be left-aligned, but if we use pdfbox to set the value in the form, the text is center-aligned. I

Re: PDFBox 1.8.10 Question

2019-12-12 Thread Tilman Hausherr
figure out a way to replicate the signature image that you see when signing in Acrobat (just using a random jpg). It is possible to put a signature into an empty signature field in 2.0 with the code I posted yesterday. I haven't tested it for 1.8. Tilman -Original Message----- From: Tilma

Re: PDFBox 1.8.10 Question

2019-12-12 Thread Tilman Hausherr
on signPDF() is where the incremental save is for the signature. -Original Message----- From: Tilman Hausherr Sent: December 12, 2019 12:21 PM To: users@pdfbox.apache.org Subject: Re: PDFBox 1.8.10 Question Am 12.12.2019 um 14:31 schrieb Wade Polk: https://drive.google.

Re: PDFBox 1.8.10 Question

2019-12-12 Thread Tilman Hausherr
and that did have an incremental save. Tilman -Original Message- From: Tilman Hausherr Sent: December 11, 2019 11:27 PM To: users@pdfbox.apache.org Subject: Re: PDFBox 1.8.10 Question You responded to the wrong thread, so I am moving this back here... I can't see your code except two

AW: RE: PDFBox 1.8.10 Question

2019-12-12 Thread Tilman Hausherr
I try to do both at the same time I break one or the other. -Original Message- From: Tilman Hausherr mailto:thaush...@t-online.de> > Sent: December 11, 2019 11:27 PM To: users@pdfbox.apache.org <mailto:users@pdfbox.apache.org> Subject: Re: PDFBox 1.8.10 Question You responded t

Re: PDFBox 1.8.10 Question

2019-12-11 Thread Tilman Hausherr
'm missing something, any help is appreciated. -Original Message----- From: Tilman Hausherr Sent: December 11, 2019 1:46 PM To: users@pdfbox.apache.org Subject: Re: Page boxes differ between Acrobat and PDFBox Ouch, I've known about this, but put it on low priority because I've never seen s

Re: Page boxes differ between Acrobat and PDFBox

2019-12-11 Thread Tilman Hausherr
Ouch, I've known about this, but put it on low priority because I've never seen such a file before. I've opened https://issues.apache.org/jira/browse/PDFBOX-4706 I'm not yet sure what to do, but COSName.USER_UNIT is the minimum. Getter/Setter probably too. Tilman Am 11.12.2019 um 18:31

Re: PDFBox 1.8.10 Question

2019-12-11 Thread Tilman Hausherr
Am 11.12.2019 um 14:03 schrieb Wade Polk: 3 days in trying to learn PDFBox 1.8.10 source code examples. I think I finally got it to run, CreateVisibleSignatures.java. The output was the following indicating I need to pass in some arguments: java

Re: PDFBox 1.8.10 Question

2019-12-10 Thread Tilman Hausherr
;*groupId*>com.levigo.jbig2 <*artifactId*>levigo-jbig2-imageio <*version*>1.6.2 <*scope*>test -----Original Message- From: Tilman Hausherr Sent: December 10, 2019 8:50 AM To: users@pdfbox.apache.org Subject: AW: Re: PDFBox 1.8.10 Question or download the sourc

Re: Question about Checkboxes

2019-12-10 Thread Tilman Hausherr
Hi, I recommend the CreateCheckBox.java example from the source code download. That is very close to what Adobe is doing. (But all the answers by mkl are great too) I use only the "normal" appearance, that is the minimum needed. Having them empty is probably worse than not having them at

AW: Re: PDFBox 1.8.10 Question

2019-12-10 Thread Tilman Hausherr
or download the sources for that version and search for CreateVisibleSignature.java and first get it to run with your certificate / pin / keystore / PKI device etc. Tilman Gesendet mit der Telekom Mail App

Re: PDFBox 2.0.17 Fonts on Linux

2019-12-04 Thread Tilman Hausherr
oid involving the System Administrators to find the correct installers that will contain these 14 fonts. I'd rather assemble this myself and provide it for use on the server. Not sure if the newer versions support property files to load .TTF fonts. Thanks Derryl. On Wed, Dec 4, 2019 at 1:55 PM

Re: PDFBox 2.0.17 Fonts on Linux

2019-12-04 Thread Tilman Hausherr
Hi, Please download  the fonts and put them into the font directory of your OS. https://www.pcworld.com/article/2863497/how-to-install-microsoft-fonts-in-linux-office-suites.html Tilman Am 04.12.2019 um 15:52 schrieb Derryl Varghese: First post on this list. my SO question -

Re: TextPosition.equal() fails after getDir()

2019-12-01 Thread Tilman Hausherr
Fixed in https://issues.apache.org/jira/browse/PDFBOX-4701 Tilman Am 27.11.2019 um 17:47 schrieb Esteban R: This is a simplification of a real life problem: when I compare two TextPositions, equality changes after calling getDir() in one of the TextPositions. Code to reproduce:

Re: TextPosition.equal() fails after getDir()

2019-12-01 Thread Tilman Hausherr
Ouch. This is because "direction" is used in equality and it isn't needed, because "matrix" is used as well. Tilman Am 27.11.2019 um 17:47 schrieb Esteban R: This is a simplification of a real life problem: when I compare two TextPositions, equality changes after calling getDir() in one of

Re: NullPointerException in PDSimpleFont.isStandard14()

2019-11-29 Thread Tilman Hausherr
Hi, This was fixed but not yet released: https://issues.apache.org/jira/browse/PDFBOX-4682 Until then, check that the font is not type 3 before calling isStandard14(). Tilman Am 29.11.2019 um 20:31 schrieb Esteban R: I get a NullPointerException while trying to find out if a particular font

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-26 Thread Tilman Hausherr
documents? I hope I now explained a little better. :-) Best regards, Predrag Sent from my ZX Spectrum 48K. Original message From: Tilman Hausherr Date: 26/11/2019 21:42 (GMT+01:00) To: users@pdfbox.apache.org Subject: Re: After adding validation information, Adobe Reader says

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-26 Thread Tilman Hausherr
somewhat on this matter? Thank you again and best regards, Predrag -Original Message- From: Tilman Hausherr [mailto:thaush...@t-online.de] Sent: Sunday, November 24, 2019 11:41 AM To: users@pdfbox.apache.org Subject: Re: After adding validation information, Adobe Reader says Signature

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-24 Thread Tilman Hausherr
endless loop or recursions. Tilman Am 19.11.2019 um 10:55 schrieb Predrag Stojković: That is valuable information regarding TSA. Thank you again for your great efforts in resolving this issue. :) Best regards, Predrag -Original Message- From: Tilman Hausherr [mailto:thaush...@t-online.de

Re: debugger graphic symbols documentation?

2019-11-20 Thread Tilman Hausherr
Hi, There isn't. If there is a number, e.g. [1 0 r] then it's indirect. The blue <<>> is a dictionary, the violet one with the length is a stream. Tilman PS the image didn't get through. Am 20.11.2019 um 21:15 schrieb Jason Pyeron: I was looking for the documentation on the debugger, where

Re: ExceptionInInitializerError from LCMS / ICC Profile

2019-11-18 Thread Tilman Hausherr
Am 18.11.2019 um 01:20 schrieb Charles Duffy: Howdy -- I'm unable to initialize the PDDocument class in PDFBox 2.0.17, as it throws an ExceptionInInitializerError from inside ICC_Profile code on first reference. I'm using AdoptOpenJDK 11.0.4, but also (to my great surprise!) reproduced the

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-17 Thread Tilman Hausherr
Am 14.11.2019 um 20:48 schrieb Tilman Hausherr: My next step will be to sign and timestamp and validate a file with freetsa to see if this works. (this weekend, hopefully) So I had a look... freetsa does have a self-signed root, but it isn't returned when signing. https://safestamper.com

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-14 Thread Tilman Hausherr
Yeah I will. Thanks also to Marc for your analysis! Tilman Thank you again and best regards, Predrag -Original Message----- From: Tilman Hausherr [mailto:thaush...@t-online.de] Sent: Thursday, November 14, 2019 8:49 PM To: users@pdfbox.apache.org Subject: Re: After adding validation informat

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-14 Thread Tilman Hausherr
Am 14.11.2019 um 08:26 schrieb Tilman Hausherr: I will work on this topic again tonight, hopefully. Current status: I found out why the OCSP problem didn't happen, this is because of a problem with recent BC versions. I changed that and now I hit the OCSP error, and it recovers. and uses CRL

Re: PDLineAppearanceHandler

2019-11-14 Thread Tilman Hausherr
Am 14.11.2019 um 09:21 schrieb Jens Bruhn-Hansel: Hi, thanks for your support. We are using PDFBox 2.0.16 and it looks like PDAnnotationLine.constructAppearances() makes the annotation rectangle a lot larger when increasing the border width to values like 20, ... The rectangle doesn't shrink

Re: Parsing huge PDF (400Mb, 2700 pages)

2019-11-14 Thread Tilman Hausherr
The PDF can be much bigger than 3GB when decompressed. What you could try 1) using a scratch file (will be even slower) when opening the document 2) the on-demand parser, see https://issues.apache.org/jira/browse/PDFBOX-4569 there is a branch on the svn server, you have to build from source.

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-13 Thread Tilman Hausherr
Am 13.11.2019 um 22:11 schrieb Predrag Stojković: protected static String getCrlUrlFromExtensionValue(byte[] extensionValue) throws IOException Please use this (this one doesn't abort) instead and retry. I will work on this topic again tonight, hopefully. protected static String

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-13 Thread Tilman Hausherr
Am 13.11.2019 um 01:45 schrieb Predrag Stojković: Hello all. I’m trying to use Apache PDFBox 2.0.17 to ellectronically sign some documents, including LTV information. I have followed the examples provided in class CreateSignature to sign the document, and in class AddValidationInformation

Re: Extract images and get occurrence of same image

2019-11-13 Thread Tilman Hausherr
Am 13.11.2019 um 09:33 schrieb Damien Levasseur: Hello all, When i extract images (version 2.0.17, using PDResources, COSName, PDXObject, PDImageXObject), i correctly get all distinct images, but same image is extracted only once. In the pdf file i'm trying to work on, there is one image

Re: After adding validation information, Adobe Reader says Signature is not LTV enabled

2019-11-12 Thread Tilman Hausherr
Hi, Attachments don't get through. You need to upload the files to a sharehoster. Tilman Am 13.11.2019 um 01:45 schrieb Predrag Stojković: Hello all. I’m trying to use Apache PDFBox 2.0.17 to ellectronically sign some documents, including LTV information. I have followed the examples

Re: default resources and appearance

2019-11-11 Thread Tilman Hausherr
need to set the "resetContent" parameter to true. That is the "fifth parameter" I mentioned. re those "labels", what could be done is to see how another PDF does it, by looking at it with PDFDebugger. Tilman Dan. On Monday, November 11, 2019, 01:28:38 P

Re: slf4j class not found exception

2019-11-11 Thread Tilman Hausherr
Hi, PDFBox does not require slf4j, so the import isn't needed unless you use it yourself. Maybe something else uses it. The exception NoClassDefFoundError means that slf4j jar file(s) is missing in your class path. So you need to check your project whether it is there or not. Did you create

Re: default resources and appearance

2019-11-11 Thread Tilman Hausherr
those "labels", what could be done is to see how another PDF does it, by looking at it with PDFDebugger. Tilman Dan. On Monday, November 11, 2019, 01:28:38 PM EST, Tilman Hausherr wrote: Am 11.11.2019 um 19:18 schrieb Dan Forth:   Th

Re: default resources and appearance

2019-11-11 Thread Tilman Hausherr
catch (IOException e1) { // TODO Auto-generated catch block e1.printStackTrace(); } try { textFieldHomePhone.setValue("(###)-###-"); } catch (IOException e1) { // TODO Auto-generated catch block e1.printStackTrace(); } On Monday, November 11, 2019, 01:11:38 PM EST, Ti

Re: default resources and appearance

2019-11-11 Thread Tilman Hausherr
Hi, There is an example on how to change the default appearance in existing fields https://stackoverflow.com/questions/47995062/pdfbox-api-how-to-change-font-to-handle-cyrillic-values-in-an-acroform-field and see also the CreateSimpleFormWithEmbeddedFont.java example. Maybe this helps.

Re: Unable to find documentation on using PDRadioButton.setValue with regards to PDDocument.saveIncremental (selected appearance not show in PDF viewer)

2019-11-07 Thread Tilman Hausherr
Am 07.11.2019 um 18:35 schrieb Jason Pyeron: If I have anything re-usable, will make a pull request for examples. Rather open an issue in JIRA, the github is read only. re radio buttons, see also in the examples subproject CreateRadioButtons.java. I'm not sure if your question is about

Re: Unable to find documentation on using PDRadioButton.setValue with regards to PDDocument.saveIncremental (selected appearance not show in PDF viewer)

2019-11-07 Thread Tilman Hausherr
Am 07.11.2019 um 14:07 schrieb Jason Pyeron: Most of the form is saving and displaying just fine, since we carefully call setNeedToBeUpdated(true) after setting fields. Radio buttons seem to require a bit more. The actual value is there, but the display appearance is blank when going from

AW: PDFBox library

2019-11-07 Thread Tilman Hausherr
please try sort: https://pdfbox.apache.org/docs/2.0.7/javadocs/org/apache/pdfbox/text/PDFTextStripper.html#setSortByPosition(boolean)

Re: Memory issues

2019-11-06 Thread Tilman Hausherr
Am 06.11.2019 um 08:25 schrieb Ralf Baumert: Hello list, I'm trying to render rather large pdf files with pdfBox (current) and I'm running into memory issues. I created the PDDocument with .setupTempFileOnly() and I can see it's creating a scratch file. However it still consumes loads of

Re: Problem with AddValidationInformation.validateSignature

2019-11-05 Thread Tilman Hausherr
Am 05.11.2019 um 13:29 schrieb Predrag Stojković: Hello, I'm using Apache PDFBox to sign a PDF document, and to add validation information. For that purpose I've used the examples provided in classes CreateSignature and AddValidationInformation. Creating signature works fine, but there is an

Re: Rendering only the annotations, possible?

2019-11-04 Thread Tilman Hausherr
Am 05.11.2019 um 06:54 schrieb Tilman Hausherr: i.e. temporarly moving /Contents away and put it back. Alternatively try creating a new PDPage object with the same MediaBox + CropBox than the old one. Tilman

Re: Rendering only the annotations, possible?

2019-11-04 Thread Tilman Hausherr
age that has no content stream, i.e. temporarly moving /Contents away and put it back. Tilman  jens Am 01.11.2019 um 17:43 schrieb Tilman Hausherr: Hello Jens, This can be done by modifying PageDrawer.java or make your own. Override PageDrawer.processPage() to do nothing (see the sourc

Re: Apache pdfbox multiple indian language support

2019-11-01 Thread Tilman Hausherr
Hello Priyank, Only english is really supported, sorry about that. Complex scripts are not supported, there was an effort on Bengali but it has stalled. https://issues.apache.org/jira/browse/PDFBOX-4189 Tilman Am 01.11.2019 um 10:15 schrieb Priyanka Bansal: Hello, We want to take a tool to

Re: Rendering only the annotations, possible?

2019-11-01 Thread Tilman Hausherr
Hello Jens, This can be done by modifying PageDrawer.java or make your own. Override PageDrawer.processPage() to do nothing (see the source code of PageDrawer.drawPage() to see why). Read the class javadoc of PageDrawer because of the difficulties to subclass it. Tilman Am 01.11.2019 um

Re: Exact PDF text - add it back as an annotation

2019-11-01 Thread Tilman Hausherr
Am 01.11.2019 um 15:53 schrieb John Lussmyer: On Tue Oct 29 21:59:57 PDT 2019 thaush...@t-online.de said: IIRC tesseract can do this. Not as annotation, but as invisible font. As far as I can tell, it does it the same way that other programs do. It's added to the content stream, mixed with all

Re: Exact PDF text - add it back as an annotation

2019-10-29 Thread Tilman Hausherr
Am 30.10.2019 um 03:48 schrieb John Lussmyer: I have a bunch of PDF files that have had an OCR package run against them. The problem is that it adds the text to the normal Page content, and tries to position the recognized text at the location in the image it was found. So the text is mixed

Re: PDFBox - Extracting information about keyword/word count and search from multiple PDF files.

2019-10-22 Thread Tilman Hausherr
Am 22.10.2019 um 15:31 schrieb alist...@seznam.cz: Hello, I was looking at your tool for managing and extracting data from .pdf documents and I’d like to ask you following. Does your library allow keyword search/count from multiple .pdf format as well as counting all words (also, is it

Re: performance issue in v 2.0.17: PDImageXObject.getImage() + MemoryUsageSetting.setupTempFileOnly

2019-10-21 Thread Tilman Hausherr
Am 21.10.2019 um 18:09 schrieb Esteban R: I have tried with v 2.0.18. It still contains a call to file.length() in the following if: if (fileLen != raf.length() || fileLen != file.length()) Performance has improved compared with v.2.0.17 but is still slower than

Re: performance issue in v 2.0.17: PDImageXObject.getImage() + MemoryUsageSetting.setupTempFileOnly

2019-10-21 Thread Tilman Hausherr
le to v.2.0.16 in v.2.0.17 by now. Esteban ____ De: Tilman Hausherr Enviado: viernes, 18 de octubre de 2019 18:07 Para: users@pdfbox.apache.org Asunto: Re: performance issue in v 2.0.17: PDImageXObject.getImage() + MemoryUsageSetting.setupTempFileOnly Hi, Could you try

Re: Export PDAnnotationLink as external XFDF File

2019-10-21 Thread Tilman Hausherr
Hi, I don't think this is supported at all. I looked at it and only acroform export is supported. You could of course do it yourself, if you have a model export file. It will probably be a lot of work, I think I saw one a few months ago. Tilman Am 21.10.2019 um 14:39 schrieb Arun Singh:

Re: performance issue in v 2.0.17: PDImageXObject.getImage() + MemoryUsageSetting.setupTempFileOnly

2019-10-18 Thread Tilman Hausherr
n't use temp files (maybe the machine was more loaded during the tests?) Columns: v2.0.16_with_tempfile v2.0.16_without_tempfile v2.0.17_with_tempfile v2_0_17_without_tempfile 1635 1410 4304 2581 1642 1528 4304 2585 1652 1373 4483 2662 1652 1461 4556 2727 1684 1441 4626 2835 1833 1339 4362 2610 1863 1641 4436

Re: How can I use PDF Template and simply fill in the fields by identifier/name?

2019-10-17 Thread Tilman Hausherr
Am 17.10.2019 um 15:50 schrieb Oscar Vargas: Is there any sample/test/example out there that I can use to start? Thanks in advance! Please download the source code, go to the example subproject and then go to the "org.apache.pdfbox.examples.interactive.form" package. I don't know what you

Re: PDF BOX ( Replace Text Jar)

2019-10-15 Thread Tilman Hausherr
Am 16.10.2019 um 07:23 schrieb Alagarsamy P: Hi Team, I am a coldfusion developer. I want to replace existed text with new text in PDF. I've got the link via google ( https://stackoverflow.com/questions/35420609/pdfbox-2-0-rc3-find-and-replace-text ).I've tried this java code to

Re: Assistance upgrading PDFBox from v1.8 to v2.0

2019-10-15 Thread Tilman Hausherr
I don't want to learn testcomplete, but assuming this is about converting PDFs to images, see here: https://stackoverflow.com/questions/23326562/apache-pdfbox-convert-pdf-to-images/23327024#23327024 So what you have to do is to change these lines into your language: |newPDFRenderer(document);

Re: PDFBOX

2019-10-15 Thread Tilman Hausherr
Am 15.10.2019 um 07:41 schrieb Alagarsamy P: Hi Teams, My PDF doesn't having form fields. But i want to update/Replace existed text with new text .Is there any possibilities ?. Not really, see https://pdfbox.apache.org/2.0/migration.html#why-was-the-replacetext-example-removed

Re: PDFBOX

2019-10-14 Thread Tilman Hausherr
Am 14.10.2019 um 11:30 schrieb Alagarsamy P: Hi teams, I have a Non Editable PDF. I can be able to read that PDF text using getText(). But i want to replace particular text instead of existed text.There is any possibilities to replace/update text in NonEditable PDF? Not really (not

Re: Reg: Pdf to Png Doubt

2019-10-05 Thread Tilman Hausherr
Hi, PDFBox doesn't support converting all pages into one huge image file. But it is answered here (see also my comment): https://stackoverflow.com/questions/58014451/ Tilman Am 05.10.2019 um 10:47 schrieb Murugesan, Sathesh (TNQ Software): Hi Team, I  have a doubt. I am using following

Re: performance issue in v 2.0.17: PDImageXObject.getImage() + MemoryUsageSetting.setupTempFileOnly

2019-09-26 Thread Tilman Hausherr
Ouch... Might be the workaround implemented for this issue: https://issues.apache.org/jira/browse/PDFBOX-4601 Tilman Am 26.09.2019 um 21:39 schrieb Esteban R: Hello. I'm getting a timeout in one of my tests after upgrading to v 2.0.17: PDImageXObject.getImage() takes more than 1:10 minutes

Re: question on characters not showing

2019-09-26 Thread Tilman Hausherr
Because that is the content. The "greek" annotation has this:

Re: Need Help on Check Box Background Color

2019-09-25 Thread Tilman Hausherr
und and borderColor to preview in all the browser compatibility. I need like below attached image for all the three browser edgeOutPutwithTextCheckbox.PNG If any help on this will greatly appreciated.. On Tue, Sep 24, 2019 at 10:38 PM Tilman Hausherr mailto:thaush...@t-online.de>&

Re: Rogue characters corrupting AcroForm object

2019-09-24 Thread Tilman Hausherr
://www.imgload.org/image/5IcLx I've no problem moving to the latest release of PDFBox, but the problem has only occurred in three files. It could be months before I see the problem again. CheersFrancis -From: "Tilman Hausherr" To: users@pdfbox.apa

Re: Question about JBIG2ImageReader usage

2019-09-23 Thread Tilman Hausherr
witter <https://twitter.com/trumpetinc> On Fri, Sep 20, 2019 at 9:28 PM Tilman Hausherr wrote: I wonder if the PDF can be displayed with PDFDebugger. If no => bug. If yes, then you should debug this to see what calls are done, and whether you have the same data input. Your calls seem to be OK,

Re: Rogue characters corrupting AcroForm object

2019-09-23 Thread Tilman Hausherr
Hi, The current version is 2.0.17. Please retry with that one and upload screenshots and PDF on a sharehoster. Tilman Am 23.09.2019 um 21:09 schrieb francisgra...@ca.rr.com: Hi, I've used PDFBox (2.0.7) to develop an interactive PDF document. When complete, the user uploads the PDF to an

Re: Question about JBIG2ImageReader usage

2019-09-20 Thread Tilman Hausherr
I wonder if the PDF can be displayed with PDFDebugger. If no => bug. If yes, then you should debug this to see what calls are done, and whether you have the same data input. Your calls seem to be OK, they look similar to those I did when I debugged something in the jbig2 reader (link is before

Re: Finding a Box containing text

2019-09-20 Thread Tilman Hausherr
Am 20.09.2019 um 10:07 schrieb STAMPF Lukas: but I had to realize that I don’t know PDF Operators well enough to locate the box. Please have a look at this answer https://stackoverflow.com/questions/38931422/ it shows how to catch the lines of a PDF. Tilman

Re: Need Help on Check Box Background Color

2019-09-20 Thread Tilman Hausherr
Please have a look at the CreateCheckBox.java example. This is done with the PDAppearanceCharacteristicsDictionary: PDAppearanceCharacteristicsDictionary appearanceCharacteristics = new PDAppearanceCharacteristicsDictionary(new COSDictionary()); appearanceCharacteristics.setBorderColour(new

Re: warning „Using fallback font LiberationSans“ when using the 14 standard fonts for pdf creation only

2019-09-16 Thread Tilman Hausherr
Am 16.09.2019 um 14:35 schrieb Jens Hohmuth: Hi, we‘re using PdfBox 2.x successfully to create PDF documents. We only use the 14 Adobe Standard fonts which are not embedded in the PDF. Everything works well. However, in the server environment we see a couple of the following warnings: WARN

Re: Problem when adding images

2019-09-15 Thread Tilman Hausherr
Am 15.09.2019 um 07:53 schrieb Martin:       InputStream is = file.getContent().getInputStream();   BufferedImage awtImage = ImageIO.read( is );

Re: Problem when adding images

2019-09-15 Thread Tilman Hausherr
Am 15.09.2019 um 07:53 schrieb Martin: when I paint an image on a pdfpage where a image already exists in the template pdf the image from the template get’s invisible after reading this sentence again I think I understand the problem - you wanted to keep what existed before. Please change

Re: Problem when adding images

2019-09-15 Thread Tilman Hausherr
Please share the source and result PDFs. Also check whether you had any log messages. And output "awtImage" to see what its height and width are. Also check whether the image contains anything. Tilman Am 15.09.2019 um 07:53 schrieb Martin: Hi, hope you can help me fixing the following

Re: Hardware Acceleration

2019-09-03 Thread Tilman Hausherr
Some more ideas: - I have about 30 files that are slow or were slow (but many still are), if you like to investigate for "opportunities" - the one I mentioned yesterday renders in better speed with PDF.js. I've never heard that they use the GPU, usually they are faster than us - video about PDF

Re: Java Docs for PDFBox 2.0.0 & 2.0.16

2019-09-03 Thread Tilman Hausherr
Am 04.09.2019 um 01:52 schrieb Joseph McCay: I have searched online with Google, and I searched the website. I can't find any Javadoc downloads. Where can I get the Javadocs? Highest one I found is 2.0.8. https://pdfbox.apache.org/docs/2.0.8/javadocs/ There are no javadoc downloads as jar

Re: Detect wether pdf contains JBIG2 encoded images

2019-09-03 Thread Tilman Hausherr
Am 03.09.2019 um 13:07 schrieb Peter Pinnau: Is there an easy way to determine if a pdf contains any JBIG2, JPEG2000 ... images. I mean all image types which are not supported by plain pdfbox renderer. It would be sufficient to get that information from the first page. I know that I could

Re: Font replacement

2019-08-21 Thread Tilman Hausherr
Am 21.08.2019 um 12:53 schrieb Евгений Король: Hello. I would like to replace all unknown fonts in PDF to standard fonts (installed in Windows). How i can do it? Probably not at all. This would be extremely difficult, you can't just replace them. The codes used in a PDF are not always the

Re: Multithreaded PDF Rendering

2019-08-14 Thread Tilman Hausherr
Am 14.08.2019 um 16:36 schrieb STAMPF Lukas: Hello, After seemingly random errors, I stumbled upon

Re: PDAnnotationRubberStamp

2019-08-06 Thread Tilman Hausherr
It's not implemented. Try "SBApproved". That one is supported by Adobe Reader, it will draw the stamp. Tilman Am 06.08.2019 um 12:17 schrieb Jens Bruhn-Hansel: Hi, using PDFBox 2.0.16 I am trying to add a simple (no image) RubberStamp annotation to a PDF document. My current code is:    

<    3   4   5   6   7   8   9   10   11   12   >