[jira] [Commented] (PDFBOX-5454) Add a setter for `PageDrawer.graphics` member variable

2022-06-16 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555356#comment-17555356 ] Tilman Hausherr commented on PDFBOX-5454: - Your initial suggestion sounded like a good idea, I

[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2022-06-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555225#comment-17555225 ] ASF subversion and git services commented on PDFBOX-4892: - Commit

[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2022-06-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555210#comment-17555210 ] ASF subversion and git services commented on PDFBOX-4892: - Commit 1901986 from

[jira] [Commented] (PDFBOX-4892) Improve code quality (4)

2022-06-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-4892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555209#comment-17555209 ] ASF subversion and git services commented on PDFBOX-4892: - Commit 1901985 from

[jira] [Comment Edited] (PDFBOX-5460) Deadlock in TrueTypeFont and RAFDataStream

2022-06-16 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17554892#comment-17554892 ] Tilman Hausherr edited comment on PDFBOX-5460 at 6/16/22 4:18 PM: --

[jira] [Commented] (PDFBOX-5460) Deadlock in TrueTypeFont and RAFDataStream

2022-06-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555178#comment-17555178 ] ASF subversion and git services commented on PDFBOX-5460: - Commit 1901984 from

[jira] [Commented] (PDFBOX-5460) Deadlock in TrueTypeFont and RAFDataStream

2022-06-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555177#comment-17555177 ] ASF subversion and git services commented on PDFBOX-5460: - Commit 1901983 from

Re: text extraction regression tests for 3.x?

2022-06-16 Thread Tilman Hausherr
Am 15.06.2022 um 12:19 schrieb Tim Allison: Reports are here: https://corpora.tika.apache.org/base/reports/pdfbox-3-20220614.tgz govdocs1/372/372582.pdf commoncrawl3/KH/KHDACXIPFMWP632LZ3S4TRRSZPDGHGM5 commoncrawl3/VN/VNCWMY6Y4C3XYWA65CQPPSNZSY6OQEEA have lost text. But the first one is a

[jira] [Commented] (PDFBOX-5460) Deadlock in TrueTypeFont and RAFDataStream

2022-06-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5460?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555022#comment-17555022 ] Andreas Lehmkühler commented on PDFBOX-5460: [~Ram Lakshmanan] Just out of curiosity: what

Re: text extraction regression tests for 3.x?

2022-06-16 Thread Andreas Lehmkuehler
Am 15.06.22 um 13:07 schrieb Tim Allison: In "parse_time_millis_details.xlsx", there are some that took much longer in 3.x during the multithreaded run but do not show much of a difference singlethreaded...likely accidents of resources available at parse time. Overall, the sum of processing

Re: text extraction regression tests for 3.x?

2022-06-16 Thread Andreas Lehmkuehler
Am 15.06.22 um 12:19 schrieb Tim Allison: Reports are here: https://corpora.tika.apache.org/base/reports/pdfbox-3-20220614.tgz @Tim thanks again Looks like there aren't any new exceptions in 3.0.0 at all, ergo we are good to target a new release :-) Andreas On Mon, Jun 13, 2022 at 4:54

[jira] [Commented] (PDFBOX-5451) Avoid copying byte array for COSString

2022-06-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17555003#comment-17555003 ] Andreas Lehmkühler commented on PDFBOX-5451: [~msahyoun] My first idea was to make COSString