There was a case[1] where we found that PDDocument.load(input, true) /* second parameter is to force parsing[2] */ threw an exception while not forcing it to load, didn't throw any exception. This is because there's some code which handles non-conforming PDFs which have garbage at the end of the file. With all of the code we now have to handle non-conforming PDFs in the normal load process, throwing an exception on load seems to be fairly rare. Forcing it to load doesn't seem to help things, and in this case, it actually backfired. We already deal with many corrupt pdf objects and continue parsing without this option.
Should we mark the force option as deprecated? It seems like the answer is "yes", but I wanted to bring it up for discussion before just going ahead and making the change. [1] https://issues.apache.org/jira/browse/PDFBOX-813 [2] http://pdfbox.apache.org/apidocs/org/apache/pdfbox/pdmodel/PDDocument.html#load%28java.net.URL,%20boolean%29 ---- Thanks, Adam ? Click here to submit conditions This email and any content within or attached hereto from Sun West Mortgage Company, Inc. is confidential and/or legally privileged. The information is intended only for the use of the individual or entity named on this email. If you are not the intended recipient, you are hereby notified that any disclosure, copying, distribution or the taking of any action in reliance on the contents of this email information is strictly prohibited, and that the documents should be returned to this office immediately by email. Receipt by anyone other than the intended recipient is not a waiver of any privilege. Please do not include your social security number, account number, or any other personal or financial information in the content of the email. Should you have any questions, please call (800) 453 7884.
