Re: PDFBox 3.0.4 regression tests

2025-01-16 Thread Andreas Lehmkühler
Hi all, @Tilman thanks for the analysis Looks like we are ready to go for the next 3.0.x release. I'm planing to cut the 3.0.4 release next Monday if nobody objects ANderas Am 13.01.25 um 17:35 schrieb Tilman Hausherr: On 13.01.2025 14:23, Tilman Hausherr wrote: On 12.01.2025 16:52, Tilman

Re: [ANNOUNCE] Apache PDFBox 2.0.33 released

2025-01-16 Thread Andreas Lehmkühler
HI Craig, thanks for the pointer. I've fixed all mentioned links and the new version of the download page should be up an running by now. Cheers Andreas Am 17.01.25 um 01:50 schrieb Craig Russell: Dear Andreas, Congratulations on the release, and thank you for following announcement and d

Re: [ANNOUNCE] Apache PDFBox 2.0.33 released

2025-01-16 Thread Craig Russell
Dear Andreas, Congratulations on the release, and thank you for following announcement and download requirements. The download page https://pdfbox.apache.org/download.html needs a bit of work. The artifact links properly use closer.lua, but the links to KEYS, signatures, and checksums must be

[jira] [Closed] (PDFBOX-5865) Show ASN.1 decoded Contents for Signature-Dictionary

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5865?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5865. -- > Show ASN.1 decoded Contents for Signature-Dictionary > -

[jira] [Closed] (PDFBOX-5869) Checkstyle

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5869. -- > Checkstyle > -- > > Key: PDFBOX-5869 > URL: http

[jira] [Closed] (PDFBOX-5225) Flattening removes all annotations when widget annotation has no page

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5225?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5225. -- > Flattening removes all annotations when widget annotation has no page >

[jira] [Closed] (PDFBOX-5890) BDC sequence with resource reference instead of with MCID

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5890. -- > BDC sequence with resource reference instead of with MCID >

[jira] [Closed] (PDFBOX-5924) Icons of text annotations sometimes too large

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5924?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5924. -- > Icons of text annotations sometimes too large >

[jira] [Closed] (PDFBOX-5906) IOException when reading isolated "+"

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5906. -- > IOException when reading isolated "+" > - > >

[jira] [Closed] (PDFBOX-5025) BaseParser fails when a number is followed by a string starting with 'e'

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5025?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5025. -- > BaseParser fails when a number is followed by a string starting with 'e' > -

[jira] [Closed] (PDFBOX-5868) PDFBox not extracting text of non-latin languages(tamil, bengali) properly but adobe reader's save as text does

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5868?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5868. -- > PDFBox not extracting text of non-latin languages(tamil, bengali) properly > but adobe

JIRA notification flood

2025-01-16 Thread Andreas Lehmkühler
Hi, sorry for the noise. I've closed the JIRA tickets concerning 2.0.33 after the release and was pretty sure that I'd deactivated the mail notifications. Obviously something went wrong, most likely on my side. Andreas - To

[jira] [Closed] (PDFBOX-3774) Incorrectly extracted text (broken words)

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-3774?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-3774. -- > Incorrectly extracted text (broken words) > - >

[jira] [Closed] (PDFBOX-5932) NPE in PagePane.mouseMoved()

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5932?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5932. -- > NPE in PagePane.mouseMoved() > > > Key: PDF

[jira] [Closed] (PDFBOX-5657) SMaskInData not supported for JPX images

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5657?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5657. -- > SMaskInData not supported for JPX images > > >

[jira] [Closed] (PDFBOX-5307) Image lost on page render

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5307?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5307. -- > Image lost on page render > - > > Key: PDFBOX-53

[jira] [Closed] (PDFBOX-3690) Character positions shifted

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-3690?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-3690. -- > Character positions shifted > --- > > Key: PDFBO

[jira] [Closed] (PDFBOX-5874) Change Loglevel from Warn to info when rebuilding font cache

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5874. -- > Change Loglevel from Warn to info when rebuilding font cache > -

[jira] [Closed] (PDFBOX-5891) Empty constructor for PDViewerPreferences

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5891. -- > Empty constructor for PDViewerPreferences > - >

[jira] [Closed] (PDFBOX-4601) in AWS lambda pdf merge giving error as Error in pdf consolidation: Expected scratch file size of 196608 but found 192512

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-4601?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-4601. -- > in AWS lambda pdf merge giving error as Error in pdf consolidation: Expected > scratch

[jira] [Closed] (PDFBOX-4743) Long rendering time of fonts in a specific PDF

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-4743?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-4743. -- > Long rendering time of fonts in a specific PDF > ---

[jira] [Closed] (PDFBOX-5930) Add font name to PrintTextLocations

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5930?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5930. -- > Add font name to PrintTextLocations > --- > >

[jira] [Closed] (PDFBOX-5917) Particular PDF fails on renderImageWithDPI call

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5917. -- > Particular PDF fails on renderImageWithDPI call > --

[jira] [Closed] (PDFBOX-5929) Remove orphan annotations in structure tree

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5929. -- > Remove orphan annotations in structure tree > --

[jira] [Closed] (PDFBOX-5797) Kid Widget /DA is ignored in setDefaultAppearance() call

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5797?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5797. -- > Kid Widget /DA is ignored in setDefaultAppearance() call > -

[jira] [Closed] (PDFBOX-5884) Support OCG visibility expressions

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5884. -- > Support OCG visibility expressions > -- > >

[jira] [Closed] (PDFBOX-5914) Implement PDF 2.0 dash phase clarification (2)

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5914?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5914. -- > Implement PDF 2.0 dash phase clarification (2) > ---

[jira] [Closed] (PDFBOX-5928) Orphan page check doesn't check annotation destinations

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5928. -- > Orphan page check doesn't check annotation destinations > --

[jira] [Closed] (PDFBOX-5889) Support long values for COSInteger objects

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5889?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5889. -- > Support long values for COSInteger objects > --

[jira] [Closed] (PDFBOX-5852) Hi CPU and memory usage when converting a PDF with type 4 shading

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5852?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5852. -- > Hi CPU and memory usage when converting a PDF with type 4 shading >

[jira] [Closed] (PDFBOX-5911) Calculate dpi dynamically when printing with raster

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5911. -- > Calculate dpi dynamically when printing with raster > --

[jira] [Closed] (PDFBOX-5908) IllegalArgumentException: capacity < 0: (-75475220 < 0) in RandomAccessReadBuffer constructor

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5908?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5908. -- > IllegalArgumentException: capacity < 0: (-75475220 < 0) in > RandomAccessReadBuffer con

[jira] [Closed] (PDFBOX-5913) FontBox spawns a `cmd` subprocess to read an environment variable (on Windows)

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5913. -- > FontBox spawns a `cmd` subprocess to read an environment variable (on Windows) > ---

[jira] [Closed] (PDFBOX-3345) Improve detection whether printing or viewing

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-3345?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-3345. -- > Improve detection whether printing or viewing >

[jira] [Closed] (PDFBOX-5892) Add check of /P to PDFMergerUtilityTest

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5892. -- > Add check of /P to PDFMergerUtilityTest > > >

[jira] [Closed] (PDFBOX-5920) PDType0Font return invalid space width

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5920. -- > PDType0Font return invalid space width > -- > >

[jira] [Closed] (PDFBOX-5882) The pattern created with PDFBox shows inconsistent colors between Safari and Adobe.

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5882?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5882. -- > The pattern created with PDFBox shows inconsistent colors between Safari and > Adobe. >

[jira] [Closed] (PDFBOX-5933) ArrayIndexOutOfBoundsException in CMap.toInt()

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5933?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5933. -- > ArrayIndexOutOfBoundsException in CMap.toInt() > ---

[jira] [Closed] (PDFBOX-5881) CVE for Lucene libraries

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5881. -- > CVE for Lucene libraries > > > Key: PDFBOX-5881

[jira] [Closed] (PDFBOX-5872) Support imageio-jnr / imageio-openjpeg library for JPEG2000 decoding

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5872?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5872. -- > Support imageio-jnr / imageio-openjpeg library for JPEG2000 decoding > -

[jira] [Closed] (PDFBOX-5887) Add page getter/setter to PDObjectReference

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5887?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5887. -- > Add page getter/setter to PDObjectReference > --

[jira] [Closed] (PDFBOX-5905) Many ZapfDingbats symbols do not appear when page is rendered.

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5905. -- > Many ZapfDingbats symbols do not appear when page is rendered. > ---

[jira] [Closed] (PDFBOX-5854) PDFBox is unable to remove ID

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5854. -- > PDFBox is unable to remove ID > - > > Key: P

[jira] [Closed] (PDFBOX-5880) PDF render blank page: The end of the stream doesn't point to the correct offset, using workaround to read the stream, stream start position: 196, length: 0, expected end

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5880. -- > PDF render blank page: The end of the stream doesn't point to the correct > offset, usi

[jira] [Closed] (PDFBOX-1529) Exchange hard-coded values for variables and provide command-line options in TextToPDF component

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-1529. -- > Exchange hard-coded values for variables and provide command-line options in > TextToPD

[jira] [Closed] (PDFBOX-5870) [PATCH] Detect CMYK image without relying on metadata

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5870. -- > [PATCH] Detect CMYK image without relying on metadata >

[jira] [Closed] (PDFBOX-5487) extra whitespaces when extracting Arabic text

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5487?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5487. -- > extra whitespaces when extracting Arabic text >

[jira] [Closed] (PDFBOX-5896) StackOverflowError in PDFieldFactory.findFieldType

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5896. -- > StackOverflowError in PDFieldFactory.findFieldType > ---

[jira] [Closed] (PDFBOX-4627) Wrong color of uncolored tiling pattern

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-4627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-4627. -- > Wrong color of uncolored tiling pattern > --- > >

[jira] [Closed] (PDFBOX-5879) Regression from PDFBOX-5841: Text extraction with rotation magic fails for PDF with multiple content streams in a page

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5879?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5879. -- > Regression from PDFBOX-5841: Text extraction with rotation magic fails for > PDF with m

[jira] [Closed] (PDFBOX-5866) Unable to load password protected pdf

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5866?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5866. -- > Unable to load password protected pdf > -- > >

[jira] [Closed] (PDFBOX-5895) support Markdown extraction from the command line

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5895. -- > support Markdown extraction from the command line >

[jira] [Closed] (PDFBOX-5900) ClassCastException in AnnotationValidator

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5900. -- > ClassCastException in AnnotationValidator > - >

[jira] [Closed] (PDFBOX-5902) The CPU usage of a PDF file with a size of 85.6 MB is abnormal

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5902. -- > The CPU usage of a PDF file with a size of 85.6 MB is abnormal > ---

[jira] [Closed] (PDFBOX-4718) OutOfMemoryError - during renderImageWithDPI

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-4718?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-4718. -- > OutOfMemoryError - during renderImageWithDPI > -

[jira] [Closed] (PDFBOX-5054) Type3 font is not rendered

2025-01-16 Thread Jira
[ https://issues.apache.org/jira/browse/PDFBOX-5054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andreas Lehmkühler closed PDFBOX-5054. -- > Type3 font is not rendered > -- > > Key: PDFBOX-

[RESULT][VOTE] Release Apache PDFBox 2.0.33

2025-01-16 Thread Andreas Lehmkühler
Am 13.01.25 um 17:26 schrieb Andreas Lehmkühler: Please vote on releasing this package as Apache PDFBox 2.0.33. +1 Tilman Hausherr +1 Maruan Sahyoun +1 Timo Boehme +1 Andreas Lehmkühler Thanks for your support and help!! I'm going to push the release out. Andreas

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2025-01-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17913605#comment-17913605 ] ASF subversion and git services commented on PDFBOX-5660: - Commi

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2025-01-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17913603#comment-17913603 ] ASF subversion and git services commented on PDFBOX-5660: - Commi

[jira] [Commented] (PDFBOX-5660) Improve code quality (5)

2025-01-16 Thread ASF subversion and git services (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5660?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17913602#comment-17913602 ] ASF subversion and git services commented on PDFBOX-5660: - Commi