[jira] [Comment Edited] (PDFBOX-4384) PDF/A Document Validation out of memory

2018-11-27 Thread Tilman Hausherr (JIRA)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16700645#comment-16700645
 ] 

Tilman Hausherr edited comment on PDFBOX-4384 at 11/27/18 4:43 PM:
---

This is now a different strategy, the page tree validation process is aborted. 
It is still possible that other errors appear after that. It is also possible 
that it doesn't abort because there is a super-complicated page. This is too 
difficult to change.

To configure, call {{document.getContext().getConfig().setMaxErrors(xxx)}}.

(snapshot available at link previously mentioned)


was (Author: tilman):
This is now a different strategy, the page tree validation process is aborted. 
It is still possible that other errors appear after that. It is also possible 
that it doesn't abort because there is a super-complicated page. This is too 
difficult to change.

To configure, call {{document.getContext().getConfig().setMaxErrors(xxx)}}.

(snapshot will be available soon)

> PDF/A Document Validation out of memory
> ---
>
> Key: PDFBOX-4384
> URL: https://issues.apache.org/jira/browse/PDFBOX-4384
> Project: PDFBox
>  Issue Type: Bug
>  Components: Preflight
>Affects Versions: 2.0.8, 2.0.12
>Reporter: Vincenzo Mangiapanello
>Priority: Major
>
> Hi everyone,
> validating a customer PDF file, using
> {code:java}
> document.validate(){code}
> we recognise that if the file itself has an enormous numbers of validation 
> errors, the process goes to OutOfMemory and at the end the we get the GC 
> error.
> In our case the file has more than 550.000 errors. So we cannot go head with 
> the conversion to PDF/A. 
> To avoid this kind of error it could be useful to configure a max number of 
> validation errors to stop the process if this value has been reached.
> We cannot attach the original document, because it is a customer's file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-4384) PDF/A Document Validation out of memory

2018-11-27 Thread Tilman Hausherr (JIRA)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16700645#comment-16700645
 ] 

Tilman Hausherr edited comment on PDFBOX-4384 at 11/27/18 4:18 PM:
---

This is now a different strategy, the page tree validation process is aborted. 
It is still possible that other errors appear after that. It is also possible 
that it doesn't abort because there is a super-complicated page. This is too 
difficult to change.

To configure, call {{document.getContext().getConfig().setMaxErrors(xxx)}}.

(snapshot will be available soon)


was (Author: tilman):
This is now a different strategy, the page tree validation process is aborted. 
It is still possible that other errors appear after that. It is also possible 
that it doesn't abort because there is a super-complicated page. This is too 
difficult to change.

To configure, call {{document.getContext().getConfig().setMaxErrors(xxx)}}.

> PDF/A Document Validation out of memory
> ---
>
> Key: PDFBOX-4384
> URL: https://issues.apache.org/jira/browse/PDFBOX-4384
> Project: PDFBox
>  Issue Type: Bug
>  Components: Preflight
>Affects Versions: 2.0.8, 2.0.12
>Reporter: Vincenzo Mangiapanello
>Priority: Major
>
> Hi everyone,
> validating a customer PDF file, using
> {code:java}
> document.validate(){code}
> we recognise that if the file itself has an enormous numbers of validation 
> errors, the process goes to OutOfMemory and at the end the we get the GC 
> error.
> In our case the file has more than 550.000 errors. So we cannot go head with 
> the conversion to PDF/A. 
> To avoid this kind of error it could be useful to configure a max number of 
> validation errors to stop the process if this value has been reached.
> We cannot attach the original document, because it is a customer's file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-4384) PDF/A Document Validation out of memory

2018-11-26 Thread Vincenzo Mangiapanello (JIRA)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16698681#comment-16698681
 ] 

Vincenzo Mangiapanello edited comment on PDFBOX-4384 at 11/26/18 9:49 AM:
--

Hi [~tilman],

the exception isn't raised, but the validation process isn't stop. So the file 
validation take an enormous time (at 25 minutes we stopped the process). 
 If is it possible, it would be useful to exit from the validation process if 
the maximum is reached. At the same time configuring the error limit it could 
be useful to optimise the validation according to the context where it is used.

Thank you for your time.

 


was (Author: burzillibus):
Hi [~tilman],

the exception isn't raised, but the validation process isn't stop. So the file 
validation take an enormous time (at 25 minutes we stopped the process). 
If it is possible, it would be useful to exit from the validation process if 
the maximum is reached. At the same time configuring the error limit it could 
be useful to optimise the validation according to the context where it is used.

Thank you for your time.

 

> PDF/A Document Validation out of memory
> ---
>
> Key: PDFBOX-4384
> URL: https://issues.apache.org/jira/browse/PDFBOX-4384
> Project: PDFBox
>  Issue Type: Bug
>  Components: Preflight
>Affects Versions: 2.0.8, 2.0.12
>Reporter: Vincenzo Mangiapanello
>Priority: Major
>
> Hi everyone,
> validating a customer PDF file, using
> {code:java}
> document.validate(){code}
> we recognise that if the file itself has an enormous numbers of validation 
> errors, the process goes to OutOfMemory and at the end the we get the GC 
> error.
> In our case the file has more than 550.000 errors. So we cannot go head with 
> the conversion to PDF/A. 
> To avoid this kind of error it could be useful to configure a max number of 
> validation errors to stop the process if this value has been reached.
> We cannot attach the original document, because it is a customer's file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-4384) PDF/A Document Validation out of memory

2018-11-26 Thread Vincenzo Mangiapanello (JIRA)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16698681#comment-16698681
 ] 

Vincenzo Mangiapanello edited comment on PDFBOX-4384 at 11/26/18 9:49 AM:
--

Hi [~tilman],

the exception isn't raised, but the validation process isn't stop. So the file 
validation take an enormous time (at 25 minutes we stopped the process). 
 If it is possible, it would be useful to exit from the validation process if 
the maximum is reached. At the same time configuring the error limit it could 
be useful to optimise the validation according to the context where it is used.

Thank you for your time.

 


was (Author: burzillibus):
Hi [~tilman],

the exception isn't raised, but the validation process isn't stop. So the file 
validation take an enormous time (at 25 minutes we stopped the process). 
 If is it possible, it would be useful to exit from the validation process if 
the maximum is reached. At the same time configuring the error limit it could 
be useful to optimise the validation according to the context where it is used.

Thank you for your time.

 

> PDF/A Document Validation out of memory
> ---
>
> Key: PDFBOX-4384
> URL: https://issues.apache.org/jira/browse/PDFBOX-4384
> Project: PDFBox
>  Issue Type: Bug
>  Components: Preflight
>Affects Versions: 2.0.8, 2.0.12
>Reporter: Vincenzo Mangiapanello
>Priority: Major
>
> Hi everyone,
> validating a customer PDF file, using
> {code:java}
> document.validate(){code}
> we recognise that if the file itself has an enormous numbers of validation 
> errors, the process goes to OutOfMemory and at the end the we get the GC 
> error.
> In our case the file has more than 550.000 errors. So we cannot go head with 
> the conversion to PDF/A. 
> To avoid this kind of error it could be useful to configure a max number of 
> validation errors to stop the process if this value has been reached.
> We cannot attach the original document, because it is a customer's file.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org