[jira] [Commented] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-18 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17766567#comment-17766567 ] Tim Allison commented on PDFBOX-5682: - Wow. Thank you! > Long/permanent hang in PDFBox 3.x >

[jira] [Comment Edited] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17764228#comment-17764228 ] Tim Allison edited comment on PDFBOX-5682 at 9/12/23 2:41 PM: -- This is the

[jira] [Comment Edited] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17764225#comment-17764225 ] Tim Allison edited comment on PDFBOX-5682 at 9/12/23 2:41 PM: -- Thank you,

[jira] [Commented] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17764228#comment-17764228 ] Tim Allison commented on PDFBOX-5682: - This is the part from that document that is, erm,

[jira] [Commented] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17764225#comment-17764225 ] Tim Allison commented on PDFBOX-5682: - Thank you, [~lehmi]. In Tika, we initially copied PDFBox's

[jira] [Commented] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17763903#comment-17763903 ] Tim Allison commented on PDFBOX-5682: - Both files spend quite a bit of time in

[jira] [Commented] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17763904#comment-17763904 ] Tim Allison commented on PDFBOX-5682: - It looks like that causes a full parse of the file? >

[jira] [Updated] (PDFBOX-5682) Long/permanent hang in PDFBox 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5682?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5682: Summary: Long/permanent hang in PDFBox 3.x (was: Long/permanent hang i n PDFBox 3.x) >

[jira] [Created] (PDFBOX-5682) Long/permanent hang i n PDFBox 3.x

2023-09-11 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5682: --- Summary: Long/permanent hang i n PDFBox 3.x Key: PDFBOX-5682 URL: https://issues.apache.org/jira/browse/PDFBOX-5682 Project: PDFBox Issue Type: Bug

[jira] [Commented] (PDFBOX-5681) ConcurrentModificationException in getObjectsByType() in 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17763759#comment-17763759 ] Tim Allison commented on PDFBOX-5681: - When I run the demo code in PDFBox trunk with logging on, I

[jira] [Commented] (PDFBOX-5681) ConcurrentModificationException in getObjectsByType() in 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17763754#comment-17763754 ] Tim Allison commented on PDFBOX-5681: - I initially thought this was a threading issue, but it isn't.

[jira] [Created] (PDFBOX-5681) ConcurrentModificationException in getObjectsByType() in 3.x

2023-09-11 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5681: --- Summary: ConcurrentModificationException in getObjectsByType() in 3.x Key: PDFBOX-5681 URL: https://issues.apache.org/jira/browse/PDFBOX-5681 Project: PDFBox

[jira] [Updated] (PDFBOX-5681) ConcurrentModificationException in getObjectsByType() in 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5681: Affects Version/s: 3.0.0 PDFBox > ConcurrentModificationException in getObjectsByType() in 3.x >

[jira] [Updated] (PDFBOX-5681) ConcurrentModificationException in getObjectsByType() in 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5681: Description: [~tilman]'s regression testing turned up this exception when we integrate PDFBox

[jira] [Updated] (PDFBOX-5681) ConcurrentModificationException in getObjectsByType() in 3.x

2023-09-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5681?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5681: Issue Type: Bug (was: Task) > ConcurrentModificationException in getObjectsByType() in 3.x >

[jira] [Updated] (PDFBOX-5595) Slight regression on corrupt bug tracker file

2023-05-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5595?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5595: Description: I'm not sure this is a regression, and apologies if you already dealt with this

[jira] [Created] (PDFBOX-5595) Slight regression on corrupt bug tracker file

2023-05-05 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5595: --- Summary: Slight regression on corrupt bug tracker file Key: PDFBOX-5595 URL: https://issues.apache.org/jira/browse/PDFBOX-5595 Project: PDFBox Issue Type:

[jira] [Updated] (PDFBOX-5550) reduce number of open files

2022-12-05 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5550?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5550: Summary: reduce number of open files (was: redcuce number of open files) > reduce number of open

[jira] [Commented] (PDFBOX-5540) export:text creates jibberish / malformed output

2022-11-17 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5540?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17635337#comment-17635337 ] Tim Allison commented on PDFBOX-5540: - Should I kick that off now? > export:text creates jibberish

[jira] [Commented] (PDFBOX-5501) Jempbox is slow on xmp with large event histories

2022-09-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17602789#comment-17602789 ] Tim Allison commented on PDFBOX-5501: - Thank you! > Jempbox is slow on xmp with large event

[jira] [Resolved] (PDFBOX-5501) Jempbox is slow on xmp with large event histories

2022-09-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5501?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved PDFBOX-5501. - Resolution: Not A Problem Y. I just also confirmed that this is fixed in 1.8.17-SNAPSHOT.

[jira] [Created] (PDFBOX-5501) Jempbox is slow on xmp with large event histories

2022-09-08 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5501: --- Summary: Jempbox is slow on xmp with large event histories Key: PDFBOX-5501 URL: https://issues.apache.org/jira/browse/PDFBOX-5501 Project: PDFBox Issue Type:

[jira] [Commented] (PDFBOX-5490) Add reconstruction information to the PDDocument

2022-08-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578904#comment-17578904 ] Tim Allison commented on PDFBOX-5490: - Y. Completely understand. I don't want to impede 3.0.0.

[jira] [Commented] (PDFBOX-5490) Add reconstruction information to the PDDocument

2022-08-11 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578510#comment-17578510 ] Tim Allison commented on PDFBOX-5490: - My initial request would be for whether or not the xref table

[jira] [Commented] (PDFBOX-5490) Add reconstruction information to the PDDocument

2022-08-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578129#comment-17578129 ] Tim Allison commented on PDFBOX-5490: - Oh, that looks great. > Add reconstruction information to

[jira] [Commented] (PDFBOX-5490) Add reconstruction information to the PDDocument

2022-08-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17578055#comment-17578055 ] Tim Allison commented on PDFBOX-5490: - A Listener would be great. Any mechanism that would allow

[jira] [Updated] (PDFBOX-5490) Add reconstruction information to the PDDocument

2022-08-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5490?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5490: Component/s: Parsing > Add reconstruction information to the PDDocument >

[jira] [Created] (PDFBOX-5490) Add reconstruction information to the PDDocument

2022-08-10 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5490: --- Summary: Add reconstruction information to the PDDocument Key: PDFBOX-5490 URL: https://issues.apache.org/jira/browse/PDFBOX-5490 Project: PDFBox Issue Type:

[jira] [Updated] (PDFBOX-5431) New NPE in xmpbox parser in trunk

2022-05-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5431: Description: I noticed a new NPE in one of our test files on Tika when I recently built PDFBox's

[jira] [Updated] (PDFBOX-5431) New NPE in xmpbox parser in trunk

2022-05-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5431: Component/s: XmpBox > New NPE in xmpbox parser in trunk > - > >

[jira] [Updated] (PDFBOX-5431) New NPE in xmpbox parser in trunk

2022-05-10 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5431?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5431: Affects Version/s: 3.0.0 PDFBox > New NPE in xmpbox parser in trunk >

[jira] [Created] (PDFBOX-5431) New NPE in xmpbox parser in trunk

2022-05-10 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5431: --- Summary: New NPE in xmpbox parser in trunk Key: PDFBOX-5431 URL: https://issues.apache.org/jira/browse/PDFBOX-5431 Project: PDFBox Issue Type: Task

[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-14 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17522531#comment-17522531 ] Tim Allison commented on PDFBOX-5415: - An answer on the Tika side. Yes, parsing is dangerous and

[jira] [Commented] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17521382#comment-17521382 ] Tim Allison commented on PDFBOX-5415: - Michael Demey's diagnosis:

[jira] [Updated] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5415: Affects Version/s: 2.0.26 > Infinite loop in ExtractText in 2.x branch on a specific pdf >

[jira] [Updated] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5415?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5415: Component/s: Parsing > Infinite loop in ExtractText in 2.x branch on a specific pdf >

[jira] [Created] (PDFBOX-5415) Infinite loop in ExtractText in 2.x branch on a specific pdf

2022-04-12 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5415: --- Summary: Infinite loop in ExtractText in 2.x branch on a specific pdf Key: PDFBOX-5415 URL: https://issues.apache.org/jira/browse/PDFBOX-5415 Project: PDFBox

[jira] [Resolved] (PDFBOX-5396) Add maven enforcer rule to ensure that JAVA_HOME is set

2022-04-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison resolved PDFBOX-5396. - Fix Version/s: 2.0.26 Resolution: Fixed > Add maven enforcer rule to ensure that

[jira] [Commented] (PDFBOX-5401) A carefully crafted pdf can trigger an infinite loop while parsing

2022-03-25 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17512474#comment-17512474 ] Tim Allison commented on PDFBOX-5401: - bq. Hi, I didn't test these samples on PDFBOX 2.0 Sorry, my

[jira] [Comment Edited] (PDFBOX-5401) A carefully crafted pdf can trigger an infinite loop while parsing

2022-03-25 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17512397#comment-17512397 ] Tim Allison edited comment on PDFBOX-5401 at 3/25/22, 4:38 PM: --- I

[jira] [Comment Edited] (PDFBOX-5401) A carefully crafted pdf can trigger an infinite loop while parsing

2022-03-25 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17512397#comment-17512397 ] Tim Allison edited comment on PDFBOX-5401 at 3/25/22, 2:07 PM: --- Can

[jira] [Commented] (PDFBOX-5401) A carefully crafted pdf can trigger an infinite loop while parsing

2022-03-25 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5401?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17512397#comment-17512397 ] Tim Allison commented on PDFBOX-5401: - Can confirm behavior with the last 2.0.26-SNAPSHOT I used for

[jira] [Commented] (PDFBOX-5396) Add maven enforcer rule to ensure that JAVA_HOME is set

2022-03-21 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17509892#comment-17509892 ] Tim Allison commented on PDFBOX-5396: - This is not a problem in trunk. > Add maven enforcer rule to

[jira] [Updated] (PDFBOX-5396) Add maven enforcer rule to ensure that JAVA_HOME is set

2022-03-21 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5396?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5396: Description: I recently stubbed my toe on this one again. At least in the 2.x branch, the

[jira] [Created] (PDFBOX-5396) Add maven enforcer rule to ensure that JAVA_HOME is set

2022-03-21 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5396: --- Summary: Add maven enforcer rule to ensure that JAVA_HOME is set Key: PDFBOX-5396 URL: https://issues.apache.org/jira/browse/PDFBOX-5396 Project: PDFBox Issue

[jira] [Created] (PDFBOX-5358) Add support for UTF-8 in strings

2022-01-06 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5358: --- Summary: Add support for UTF-8 in strings Key: PDFBOX-5358 URL: https://issues.apache.org/jira/browse/PDFBOX-5358 Project: PDFBox Issue Type: Improvement

[jira] [Commented] (PDFBOX-5164) Create portable collection PDF

2021-04-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17326042#comment-17326042 ] Tim Allison commented on PDFBOX-5164: - Thank you, [~tilman]! > Create portable collection PDF >

[jira] [Commented] (PDFBOX-5164) Create portable collection PDF

2021-04-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5164?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17325972#comment-17325972 ] Tim Allison commented on PDFBOX-5164: - Sorry to hijack this, but I wanted to confirm with

[jira] [Updated] (PDFBOX-5164) Create portable collection PDF

2021-04-20 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5164: Attachment: tika-output.json > Create portable collection PDF > -- >

[jira] [Commented] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324082#comment-17324082 ] Tim Allison commented on PDFBOX-5166: - Ha @bitsgalore has an example of subtype=Screen. Yay!

[jira] [Commented] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324048#comment-17324048 ] Tim Allison commented on PDFBOX-5166: - Are those also streams in subtype=RichMedia or do we need to

[jira] [Comment Edited] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324002#comment-17324002 ] Tim Allison edited comment on PDFBOX-5166 at 4/16/21, 6:07 PM: --- Extraction

[jira] [Commented] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17324002#comment-17324002 ] Tim Allison commented on PDFBOX-5166: - Extraction only, yes...for our purposes on Tika, we wouldn't

[jira] [Updated] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5166: Issue Type: New Feature (was: Task) > Implement RichMedia annotation >

[jira] [Comment Edited] (PDFBOX-5165) Exceedingly slow processing of XMPSchemaMediaManagement's getHistory in JempBox

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17323831#comment-17323831 ] Tim Allison edited comment on PDFBOX-5165 at 4/16/21, 1:52 PM: --- Thank you

[jira] [Commented] (PDFBOX-5165) Exceedingly slow processing of XMPSchemaMediaManagement's getHistory in JempBox

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17323831#comment-17323831 ] Tim Allison commented on PDFBOX-5165: - Unless there are needs on other projects, we have no

[jira] [Updated] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5166: Priority: Minor (was: Major) > Implement RichMedia annotation > -- >

[jira] [Commented] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5166?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17323809#comment-17323809 ] Tim Allison commented on PDFBOX-5166: - Completely unsurprisingly, [~tilman] has already shown how to

[jira] [Created] (PDFBOX-5166) Implement RichMedia annotation

2021-04-16 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5166: --- Summary: Implement RichMedia annotation Key: PDFBOX-5166 URL: https://issues.apache.org/jira/browse/PDFBOX-5166 Project: PDFBox Issue Type: Task

[jira] [Commented] (PDFBOX-5165) Exceedingly slow processing of XMPSchemaMediaManagement's getHistory in JempBox

2021-04-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17322323#comment-17322323 ] Tim Allison commented on PDFBOX-5165: - I realize that Jempbox is out dated, but we're still using it

[jira] [Created] (PDFBOX-5165) Exceedingly slow processing of XMPSchemaMediaManagement

2021-04-15 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5165: --- Summary: Exceedingly slow processing of XMPSchemaMediaManagement Key: PDFBOX-5165 URL: https://issues.apache.org/jira/browse/PDFBOX-5165 Project: PDFBox Issue

[jira] [Updated] (PDFBOX-5165) Exceedingly slow processing of XMPSchemaMediaManagement's getHistory in JempBox

2021-04-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5165: Summary: Exceedingly slow processing of XMPSchemaMediaManagement's getHistory in JempBox (was:

[jira] [Comment Edited] (PDFBOX-5158) Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT

2021-04-09 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17317514#comment-17317514 ] Tim Allison edited comment on PDFBOX-5158 at 4/9/21, 1:36 PM: -- Which in

[jira] [Commented] (PDFBOX-5158) Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT

2021-04-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17317514#comment-17317514 ] Tim Allison commented on PDFBOX-5158: - Which in turn led me to find a bug in Tika's integration with

[jira] [Commented] (PDFBOX-5158) Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT

2021-04-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17317509#comment-17317509 ] Tim Allison commented on PDFBOX-5158: - Y, I get your stacktrace with a file, but I get an infinite

[jira] [Commented] (PDFBOX-5158) Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT

2021-04-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17317499#comment-17317499 ] Tim Allison commented on PDFBOX-5158: - Hmmm...will try to replicate with pure PDFBox. Thank you! >

[jira] [Updated] (PDFBOX-5158) Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT

2021-04-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5158: Description: I found a bunch of files that had a "read too many EOFs", which is a safety check

[jira] [Created] (PDFBOX-5158) Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT

2021-04-08 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5158: --- Summary: Infinite loop on corrupted PDF in 3.0.0-SNAPSHOT Key: PDFBOX-5158 URL: https://issues.apache.org/jira/browse/PDFBOX-5158 Project: PDFBox Issue Type:

[jira] [Created] (PDFBOX-5153) New flatefilter exception on Tika unit test files with 3.0.0-RC1

2021-04-06 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5153: --- Summary: New flatefilter exception on Tika unit test files with 3.0.0-RC1 Key: PDFBOX-5153 URL: https://issues.apache.org/jira/browse/PDFBOX-5153 Project: PDFBox

[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303522#comment-17303522 ] Tim Allison commented on PDFBOX-5128: - The process hasn't finished, but I'm dumping the files here:

[jira] [Comment Edited] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303391#comment-17303391 ] Tim Allison edited comment on PDFBOX-5128 at 3/17/21, 1:01 PM: --- Side

[jira] [Updated] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5128: Attachment: image-2021-03-17-09-00-57-653.png > Support parsing non standardized XMP >

[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-17 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17303391#comment-17303391 ] Tim Allison commented on PDFBOX-5128: - Side note...I'm looking at the EOFs for my xmp byte scanner,

[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302946#comment-17302946 ] Tim Allison commented on PDFBOX-5128: - [~msahyoun] ... does the attached look about right?  If so,

[jira] [Updated] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5128: Attachment: PDFBOX.zip > Support parsing non standardized XMP >

[jira] [Commented] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302784#comment-17302784 ] Tim Allison commented on PDFBOX-5133: - +1 that's how I got the rest of the build to work on Ubuntu. 

[jira] [Commented] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302700#comment-17302700 ] Tim Allison commented on PDFBOX-5133: - [~msahyoun] failed the build on Ubuntu.  I had no problems

[jira] [Updated] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5133: Attachment: out-testPDF_acroForm.pdf-7.png-diff.png out-testPDF_acroForm.pdf-7.png

[jira] [Updated] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5133: Attachment: out-testPDF_acroForm.pdf > Failing testFlattenPDFBox2469Filled on Ubuntu >

[jira] [Commented] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17302596#comment-17302596 ] Tim Allison commented on PDFBOX-5133: - I _think_ I attached the right files to help with diagnosis. 

[jira] [Updated] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5133: Attachment: in-testPDF_acroForm.pdf-7.png > Failing testFlattenPDFBox2469Filled on Ubuntu >

[jira] [Updated] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5133: Attachment: (was: image-2021-03-16-10-57-14-639.png) > Failing testFlattenPDFBox2469Filled on

[jira] [Updated] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5133: Attachment: (was: image-2021-03-16-10-57-14-489.png) > Failing testFlattenPDFBox2469Filled on

[jira] [Updated] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-5133: Attachment: (was: testPDF_acroForm.pdf-7.png) > Failing testFlattenPDFBox2469Filled on Ubuntu

[jira] [Created] (PDFBOX-5133) Failing testFlattenPDFBox2469Filled on Ubuntu

2021-03-16 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5133: --- Summary: Failing testFlattenPDFBox2469Filled on Ubuntu Key: PDFBOX-5133 URL: https://issues.apache.org/jira/browse/PDFBOX-5133 Project: PDFBox Issue Type:

[jira] [Commented] (PDFBOX-5127) Multithreading issue in JempBox's DateConverter

2021-03-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5127?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17300589#comment-17300589 ] Tim Allison commented on PDFBOX-5127: - My personal pref would be to generate SimpleDateFormat

[jira] [Commented] (PDFBOX-5128) Support parsing non standardized XMP

2021-03-12 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5128?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17300365#comment-17300365 ] Tim Allison commented on PDFBOX-5128: - I’ll scrape xmp out of our regression corpus. I should retain

[jira] [Created] (PDFBOX-5127) Multithreading issue in JempBox's DateConverter

2021-03-12 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5127: --- Summary: Multithreading issue in JempBox's DateConverter Key: PDFBOX-5127 URL: https://issues.apache.org/jira/browse/PDFBOX-5127 Project: PDFBox Issue Type:

[jira] [Commented] (PDFBOX-3953) StackOverflowError in org.apache.pdfbox.pdmodel.PDPageTree.getKids

2020-11-04 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-3953?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17226417#comment-17226417 ] Tim Allison commented on PDFBOX-3953: - Related? > StackOverflowError in

[jira] [Created] (PDFBOX-5009) Corrupt PDF can lead to a StackOverflow

2020-11-04 Thread Tim Allison (Jira)
Tim Allison created PDFBOX-5009: --- Summary: Corrupt PDF can lead to a StackOverflow Key: PDFBOX-5009 URL: https://issues.apache.org/jira/browse/PDFBOX-5009 Project: PDFBox Issue Type: Task

[jira] [Commented] (PDFBOX-4623) COSParser: Infinite recursion

2020-02-14 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17037202#comment-17037202 ] Tim Allison commented on PDFBOX-4623: - Adding a page tree infinite loop. > COSParser: Infinite

[jira] [Comment Edited] (PDFBOX-4623) COSParser: Infinite recursion

2020-02-14 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17037202#comment-17037202 ] Tim Allison edited comment on PDFBOX-4623 at 2/14/20 6:51 PM: -- Adding a

[jira] [Updated] (PDFBOX-4623) COSParser: Infinite recursion

2020-02-14 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4623?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tim Allison updated PDFBOX-4623: Attachment: loop_in_page_tree.pdf > COSParser: Infinite recursion > -

[jira] [Commented] (PDFBOX-4768) Unable to extract text from PDF

2020-02-07 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17032556#comment-17032556 ] Tim Allison commented on PDFBOX-4768: - To complement Tilman's points...qpdf complains about this

[jira] [Commented] (PDFBOX-4737) Text extraction is gibberish

2020-01-15 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4737?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17016209#comment-17016209 ] Tim Allison commented on PDFBOX-4737: - The following reinforces points already made, I think. >On

[jira] [Commented] (PDFBOX-4549) No Unicode mapping

2020-01-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010747#comment-17010747 ] Tim Allison commented on PDFBOX-4549: - And then there's this gem on content masking attacks: 

[jira] [Commented] (PDFBOX-4549) No Unicode mapping

2020-01-08 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17010741#comment-17010741 ] Tim Allison commented on PDFBOX-4549: - These are good points [~mkl].  See e.g.:

[jira] [Commented] (PDFBOX-4549) No Unicode mapping

2020-01-06 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4549?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17009133#comment-17009133 ] Tim Allison commented on PDFBOX-4549: - Perhaps tika-eval's out of vocabulary statistic?  Or

[jira] [Commented] (PDFBOX-4715) Need to add release version for maven-compiler-plugin

2019-12-19 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1763#comment-1763 ] Tim Allison commented on PDFBOX-4715: - {noformat} [ERROR] error: release version 6 not supported

[jira] [Commented] (PDFBOX-4715) Need to add release version for maven-compiler-plugin

2019-12-19 Thread Tim Allison (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4715?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=1757#comment-1757 ] Tim Allison commented on PDFBOX-4715: - Added requireJavaVersion in 2.x branch.   > Need to add

  1   2   3   4   5   >