[jira] [Commented] (PDFBOX-3452) IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber

2020-11-02 Thread Yauheni Salopiy (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17224480#comment-17224480 ] Yauheni Salopiy commented on PDFBOX-3452: - Hi [~tilman], [~msahyoun], Thank You! Best

[jira] [Commented] (PDFBOX-3451) IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong

2020-11-02 Thread Yauheni Salopiy (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-3451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17224478#comment-17224478 ] Yauheni Salopiy commented on PDFBOX-3451: - Hi [~tilman], [~msahyoun], Thank You! Best

[jira] [Commented] (PDFBOX-3449) NullPointerException at org.apache.pdfbox.pdmodel.PDPageTree.isPageTreeNode

2020-11-02 Thread Yauheni Salopiy (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-3449?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17224474#comment-17224474 ] Yauheni Salopiy commented on PDFBOX-3449: - Hi [~msahyoun], [~tilman], Thank You! Best Regards,

[jira] [Commented] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-19 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16017671#comment-16017671 ] Yauheni Salopiy commented on PDFBOX-3796: - Hi [~tilman], Thank You. We are using Apache Tika for

[jira] [Updated] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-18 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3796: Description: Content of different table cells concatenated on text extraction in some

[jira] [Updated] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-18 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3796: Labels: table text_extraction (was: ) > Content of different table cells concatenated on

[jira] [Updated] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-18 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3796: Description: Content of different table cells concatenated on text extraction in some

[jira] [Updated] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-18 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3796: Attachment: fdl_relpub_foi_dailyre0313172017_3.0.txt

[jira] [Updated] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-18 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3796: Attachment: fdl_relpub_foi_dailyre0313172017.pdf > Content of different table cells

[jira] [Created] (PDFBOX-3796) Content of different table cells concatenated on text extraction in some cases

2017-05-18 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3796: --- Summary: Content of different table cells concatenated on text extraction in some cases Key: PDFBOX-3796 URL: https://issues.apache.org/jira/browse/PDFBOX-3796

[jira] [Commented] (PDFBOX-3452) IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber

2016-08-03 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15406193#comment-15406193 ] Yauheni Salopiy commented on PDFBOX-3452: - [~tilman], You are right and such document are not

[jira] [Commented] (PDFBOX-3452) IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber

2016-08-03 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15405584#comment-15405584 ] Yauheni Salopiy commented on PDFBOX-3452: - Hi [~tilman], Thank You for the investigation. Is it

[jira] [Comment Edited] (PDFBOX-3451) IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong

2016-08-03 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15405581#comment-15405581 ] Yauheni Salopiy edited comment on PDFBOX-3451 at 8/3/16 8:47 AM: - Hi

[jira] [Commented] (PDFBOX-3451) IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong

2016-08-03 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15405581#comment-15405581 ] Yauheni Salopiy commented on PDFBOX-3451: - Hi [~tilman], Thank You for the investigation. Is it

[jira] [Commented] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404332#comment-15404332 ] Yauheni Salopiy commented on PDFBOX-3448: - Hi [~tilman], Thank You very much! Best Regards,

[jira] [Commented] (PDFBOX-3450) ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15404276#comment-15404276 ] Yauheni Salopiy commented on PDFBOX-3450: - Hi [~tilman], Thank You very much for efficiency :)

[jira] [Updated] (PDFBOX-3450) ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3450: Component/s: Text extraction > ArrayIndexOutOfBoundsException at

[jira] [Updated] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3448: Component/s: Text extraction > NullPointerException at >

[jira] [Updated] (PDFBOX-3449) NullPointerException at org.apache.pdfbox.pdmodel.PDPageTree.isPageTreeNode

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3449: Component/s: Text extraction > NullPointerException at

[jira] [Updated] (PDFBOX-3452) IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3452: Description: Apache Tika 1.14-SNAPSHOT (PDF Box 2.0.2) throws following exception on text

[jira] [Updated] (PDFBOX-3452) IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3452?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3452: Attachment: 95s-0316-rpt0242-21-appendix-16-f-vol177.pdf

[jira] [Created] (PDFBOX-3452) IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber

2016-08-02 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3452: --- Summary: IOException at org.apache.pdfbox.pdfparser.BaseParser.readStringNumber Key: PDFBOX-3452 URL: https://issues.apache.org/jira/browse/PDFBOX-3452

[jira] [Updated] (PDFBOX-3451) IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3451: Attachment: PDFBOX-3451_LOG.txt att3x1l.pdf > IOException at

[jira] [Updated] (PDFBOX-3451) IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3451: Description: Apache Tika 1.14-SNAPSHOT (PDF Box 2.0.2) throws following exception on text

[jira] [Created] (PDFBOX-3451) IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong

2016-08-02 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3451: --- Summary: IOException at org.apache.pdfbox.pdfparser.BaseParser.readLong Key: PDFBOX-3451 URL: https://issues.apache.org/jira/browse/PDFBOX-3451 Project: PDFBox

[jira] [Updated] (PDFBOX-3450) ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3450: Component/s: FontBox > ArrayIndexOutOfBoundsException at

[jira] [Updated] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3448: Description: A number of valid PDF documents failing in Apache Tika 1.14-SNAPSHOT (PDF Box

[jira] [Updated] (PDFBOX-3449) NullPointerException at org.apache.pdfbox.pdmodel.PDPageTree.isPageTreeNode

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3449: Description: A number of valid PDF documents failing in Apache Tika 1.14-SNAPSHOT (PDF Box

[jira] [Updated] (PDFBOX-3450) ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3450: Description: Apache Tika 1.14-SNAPSHOT (PDF Box 2.0.2) throws following exception on text

[jira] [Updated] (PDFBOX-3450) ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3450: Attachment: PDFBOX-3450_LOG.txt xenazine_080213.pdf >

[jira] [Created] (PDFBOX-3450) ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment

2016-08-02 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3450: --- Summary: ArrayIndexOutOfBoundsException at org.apache.fontbox.cmap.CMapParser.increment Key: PDFBOX-3450 URL: https://issues.apache.org/jira/browse/PDFBOX-3450

[jira] [Updated] (PDFBOX-3449) NullPointerException at org.apache.pdfbox.pdmodel.PDPageTree.isPageTreeNode

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3449: Attachment: R2521CP_20121112.pdf R2464CP_20121123.pdf

[jira] [Created] (PDFBOX-3449) NullPointerException at org.apache.pdfbox.pdmodel.PDPageTree.isPageTreeNode

2016-08-02 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3449: --- Summary: NullPointerException at org.apache.pdfbox.pdmodel.PDPageTree.isPageTreeNode Key: PDFBOX-3449 URL: https://issues.apache.org/jira/browse/PDFBOX-3449

[jira] [Updated] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3448: Description: A number of PDF documents failing in Apache Tika 1.14-SNAPSHOT (PDF Box

[jira] [Updated] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3448: Attachment: PDFBOX-3448_LOG.txt > NullPointerException at >

[jira] [Updated] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3448: Attachment: 130429hospauthalbanydoughccrequestadmiss.pdf

[jira] [Created] (PDFBOX-3448) NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList

2016-08-02 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3448: --- Summary: NullPointerException at org.apache.pdfbox.pdmodel.common.COSArrayList.convertFloatCOSArrayToList Key: PDFBOX-3448 URL:

[jira] [Comment Edited] (PDFBOX-3189) java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser

2016-01-12 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15094640#comment-15094640 ] Yauheni Salopiy edited comment on PDFBOX-3189 at 1/12/16 7:48 PM: -- Hi

[jira] [Comment Edited] (PDFBOX-3189) java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser

2016-01-12 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15094640#comment-15094640 ] Yauheni Salopiy edited comment on PDFBOX-3189 at 1/12/16 7:53 PM: -- Hi

[jira] [Commented] (PDFBOX-3189) java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser

2016-01-12 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15094640#comment-15094640 ] Yauheni Salopiy commented on PDFBOX-3189: - Hi [~tilman], Thank You for Your investigation. Yes,

[jira] [Updated] (PDFBOX-3189) java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser

2016-01-11 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3189: Attachment: PDFBOX-3189_StackTrace.txt obannual35_2015.pdf >

[jira] [Updated] (PDFBOX-3189) java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser

2016-01-11 Thread Yauheni Salopiy (JIRA)
[ https://issues.apache.org/jira/browse/PDFBOX-3189?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yauheni Salopiy updated PDFBOX-3189: Description: On parsing of complex PDF document both NonSequentialPDFParser and PDFParser

[jira] [Created] (PDFBOX-3189) java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser

2016-01-11 Thread Yauheni Salopiy (JIRA)
Yauheni Salopiy created PDFBOX-3189: --- Summary: java.io.IOException is thrown from both NonSequentialPDFParser and PDFParser Key: PDFBOX-3189 URL: https://issues.apache.org/jira/browse/PDFBOX-3189