[jira] [Commented] (PDFBOX-3911) Handle new line characters in single line text fields

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178250#comment-16178250
 ] 

ASF subversion and git services commented on PDFBOX-3911:
-

Commit 1809524 from [~msahyoun] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1809524 ]

PDFBOX-3911: ignore line breaking and similar characters in single line text 
fields

> Handle new line characters in single line text fields
> -
>
> Key: PDFBOX-3911
> URL: https://issues.apache.org/jira/browse/PDFBOX-3911
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 2.0.7, 3.0.0
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Minor
>  Labels: Appearance
> Fix For: 2.0.8, 3.0.0
>
>
> As discussed on the users mailing list Adobe will generate an appearance 
> stream for text containing new line characters in singe line fields. There 
> are two ways this is currently done
> - for manual data entry new line characters are ignored
> - when setting the data via an API and enforcing Adobe Reader to generate the 
> appearance stream multiline output will be achieved
> We'd need to decide which mode we'd like to replicate



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3911) Handle new line characters in single line text fields

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3911?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178249#comment-16178249
 ] 

ASF subversion and git services commented on PDFBOX-3911:
-

Commit 1809523 from [~msahyoun] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1809523 ]

PDFBOX-3911: ignore line breaking and similar characters in single line text 
fields

> Handle new line characters in single line text fields
> -
>
> Key: PDFBOX-3911
> URL: https://issues.apache.org/jira/browse/PDFBOX-3911
> Project: PDFBox
>  Issue Type: Improvement
>  Components: AcroForm
>Affects Versions: 2.0.7, 3.0.0
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Minor
>  Labels: Appearance
> Fix For: 2.0.8, 3.0.0
>
>
> As discussed on the users mailing list Adobe will generate an appearance 
> stream for text containing new line characters in singe line fields. There 
> are two ways this is currently done
> - for manual data entry new line characters are ignored
> - when setting the data via an API and enforcing Adobe Reader to generate the 
> appearance stream multiline output will be achieved
> We'd need to decide which mode we'd like to replicate



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-3906) Contributing the JBig2 ImageIO Plugin to PDFBox​

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178230#comment-16178230
 ] 

Andreas Lehmkühler edited comment on PDFBOX-3906 at 9/24/17 2:35 PM:
-

I've finally called for a 
[vote|https://lists.apache.org/thread.html/0a72b795cb22e461aa25faa043a266b4ad49c2e90f2d27a807d734d0@%3Cgeneral.incubator.apache.org%3E]
 (lazy consensus) on general@incubator to get the ip-clearance. Sorry for the 
delay.


was (Author: lehmi):
I've finally call for a 
[vote|https://lists.apache.org/thread.html/0a72b795cb22e461aa25faa043a266b4ad49c2e90f2d27a807d734d0@%3Cgeneral.incubator.apache.org%3E]
 (lazy consensus) on general@incubator to get the ip-clearance. Sorry for the 
delay.

> Contributing the JBig2 ImageIO Plugin to PDFBox​
> 
>
> Key: PDFBOX-3906
> URL: https://issues.apache.org/jira/browse/PDFBOX-3906
> Project: PDFBox
>  Issue Type: Task
>Reporter: Jörg Henne
> Attachments: jbig2-imageio.tgz, Re_JBIG2bitstreamTestfiles.eml
>
>
> Levigo solutions GmbH donates the Java ImageIO-Plugin for the JBIG2 to the 
> PDFBox project. The Plugin is currently hosted at 
> https://github.com/levigo/jbig2-imageio and has already been prepared for 
> integration. 
> The steps completed so far are:
> - ​IP vetting for contributions by non-levigo developers
> - Merging/application of all pending pull requests
> - Update of the project structure in anticipation of the new home:
> -- package names
> -- license headers
> -- license files
> -- README.md
> -- release notes
> -- Maven project information
> A tgz containing the source code has been attached:
> - It is based on commit 483aab3eb9bbc02f6995a637155adf6b922ed0c0 
> (https://github.com/levigo/jbig2-imageio/commit/483aab3eb9bbc02f6995a637155adf6b922ed0c0).
>  
> - Its SHA1 is 0e07111b4bf7f5a51bf0fdd903f02f082ea3bf65



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3906) Contributing the JBig2 ImageIO Plugin to PDFBox​

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178230#comment-16178230
 ] 

Andreas Lehmkühler commented on PDFBOX-3906:


I've finally call for a 
[vote|https://lists.apache.org/thread.html/0a72b795cb22e461aa25faa043a266b4ad49c2e90f2d27a807d734d0@%3Cgeneral.incubator.apache.org%3E]
 (lazy consensus) on general@incubator to get the ip-clearance. Sorry for the 
delay.

> Contributing the JBig2 ImageIO Plugin to PDFBox​
> 
>
> Key: PDFBOX-3906
> URL: https://issues.apache.org/jira/browse/PDFBOX-3906
> Project: PDFBox
>  Issue Type: Task
>Reporter: Jörg Henne
> Attachments: jbig2-imageio.tgz, Re_JBIG2bitstreamTestfiles.eml
>
>
> Levigo solutions GmbH donates the Java ImageIO-Plugin for the JBIG2 to the 
> PDFBox project. The Plugin is currently hosted at 
> https://github.com/levigo/jbig2-imageio and has already been prepared for 
> integration. 
> The steps completed so far are:
> - ​IP vetting for contributions by non-levigo developers
> - Merging/application of all pending pull requests
> - Update of the project structure in anticipation of the new home:
> -- package names
> -- license headers
> -- license files
> -- README.md
> -- release notes
> -- Maven project information
> A tgz containing the source code has been attached:
> - It is based on commit 483aab3eb9bbc02f6995a637155adf6b922ed0c0 
> (https://github.com/levigo/jbig2-imageio/commit/483aab3eb9bbc02f6995a637155adf6b922ed0c0).
>  
> - Its SHA1 is 0e07111b4bf7f5a51bf0fdd903f02f082ea3bf65



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Comment Edited] (PDFBOX-3906) Contributing the JBig2 ImageIO Plugin to PDFBox​

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16165073#comment-16165073
 ] 

Andreas Lehmkühler edited comment on PDFBOX-3906 at 9/24/17 1:59 PM:
-

I've did some more research. Here is the current status of the testfiles:

+official and public records from the US+
- 002.jb2, 006.jb2
- 2012311000*.jb2 (11 files in total)

Representations of public U.S. government documents should be in the public 
domain and could be included 

+Lewinsky+
- 003.jb2

One page from an [official 
report|https://www.gpo.gov/fdsys/pkg/GPO-CDOC-106sdoc3/pdf/GPO-CDOC-106sdoc3-2.pdf]
 concerning the Lewinsky scandal. Should be in the public domain as well

+University of Britisch Columbia+
- 007.jb2, 042_*.jb2 (23 files in total): all files contain the same single 
page from the CCITT spec)
- amb_1.jb2, amb_2.jb2: a dithered low resolution image of Ally McBeal aka 
Callista Flockhart

We have the written permission from the generator of these files to use them. 
All files contain some copyrighted stuff. As we are not interested in the 
"visual content" at all and IMHO no one else might find a use case to benefit 
from the "visual content" itself, it shall be safe to claim fair use for that 
files.
I'm going to create a JIRA-ticket to clarify that.

+Constitution+
- 005.jb2

Seems to be 1 page from an older version of "The Constitution of the United 
States of America, Analyis and Interpretation"
[Here|https://www.gpo.gov/fdsys/pkg/GPO-CONAN-REV-2016/content-detail.html] is 
a more recent version

+ITU+
- sampledata_page1.jb2
- sampledata_page2.jb2
- sampledata_page3.jb2

Bitstreams which are reproduced as hex-dumps within the T.88 specification 
document. The ITU grants us a license to use them but with a non-commercial 
restriction.
-I'm going to create a JIRA-ticket to see if that is an issue or if we can make 
an exception as these files are used for tests only.- I'm afraid it doesn't 
matter how we use those files, we must not include them unless the ITU grants 
us to use them at least for tests without any restriction. 

+Libertarianism: A Primer+
- 004.jb2

2 pages from "Libertarianism: A Primer" by David Boaz, a copyrighted work from 
1997. We might claim fair use as well, as these are just 2 pages from a 300 
pages book and we are not interested in the content itself. I'm going to create 
a JIRA-ticket to clarify that.

In the end there might be some files which must not be included in our repo. We 
have to decide wether we host them somewhere outside like the the isator files 
or if we simply omit them


was (Author: lehmi):
I've did some more research. Here is the current status of the testfiles:

+official and public records from the US+
- 002.jb2, 006.jb2
- 2012311000*.jb2 (11 files in total)

Representations of public U.S. government documents should be in the public 
domain and could be included 

+Lewinsky+
- 003.jb2

One page from an [official 
report|https://www.gpo.gov/fdsys/pkg/GPO-CDOC-106sdoc3/pdf/GPO-CDOC-106sdoc3-2.pdf]
 concerning the Lewinsky scandal. Should be in the public domain as well

+University of Britisch Columbia+
- 007.jb2, 042_*.jb2 (23 files in total): all files contain the same single 
page from the CCITT spec)
- amb_1.jb2, amb_2.jb2: a dithered low resolution image of Ally McBeal aka 
Callista Flockhart

We have the written permission from the generator of these files to use them. 
All files contain some copyrighted stuff. As we are not interested in the 
"visual content" at all and IMHO no one else might find a use case to benefit 
from the "visual content" itself, it shall be safe to claim fair use for that 
files.
I'm going to create a JIRA-ticket to clarify that.

+Constitution+
- 005.jb2

Seems to be 1 page from an older version of "The Constitution of the United 
States of America, Analyis and Interpretation"
[Here|https://www.gpo.gov/fdsys/pkg/GPO-CONAN-REV-2016/content-detail.html] is 
a more recent version

+ITU+
- sampledata_page1.jb2
- sampledata_page2.jb2
- sampledata_page3.jb2

Bitstreams which are reproduced as hex-dumps within the T.88 specification 
document. The ITU grants us a license to use them but with a non-commercial 
restriction.
I'm going to create a JIRA-ticket to see if that is an issue or if we can make 
an exception as these files are used for tests only. 

+Libertarianism: A Primer+
- 004.jb2

2 pages from "Libertarianism: A Primer" by David Boaz, a copyrighted work from 
1997. We might claim fair use as well, as these are just 2 pages from a 300 
pages book and we are not interested in the content itself. I'm going to create 
a JIRA-ticket to clarify that.

In the end there might be some files which must not be included in our repo. We 
have to decide wether we host them somewhere outside like the the isator files 
or if we simply omit them

> Contributing the JBig2 ImageIO Plugin to PDFBox​
> 

[jira] [Commented] (PDFBOX-3906) Contributing the JBig2 ImageIO Plugin to PDFBox​

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178213#comment-16178213
 ] 

Andreas Lehmkühler commented on PDFBOX-3906:


I've created LEGAL-331 to clarify if we are allowed to include the vcs history 
of the plugin and LEGAL-332 about the fair use issue.

> Contributing the JBig2 ImageIO Plugin to PDFBox​
> 
>
> Key: PDFBOX-3906
> URL: https://issues.apache.org/jira/browse/PDFBOX-3906
> Project: PDFBox
>  Issue Type: Task
>Reporter: Jörg Henne
> Attachments: jbig2-imageio.tgz, Re_JBIG2bitstreamTestfiles.eml
>
>
> Levigo solutions GmbH donates the Java ImageIO-Plugin for the JBIG2 to the 
> PDFBox project. The Plugin is currently hosted at 
> https://github.com/levigo/jbig2-imageio and has already been prepared for 
> integration. 
> The steps completed so far are:
> - ​IP vetting for contributions by non-levigo developers
> - Merging/application of all pending pull requests
> - Update of the project structure in anticipation of the new home:
> -- package names
> -- license headers
> -- license files
> -- README.md
> -- release notes
> -- Maven project information
> A tgz containing the source code has been attached:
> - It is based on commit 483aab3eb9bbc02f6995a637155adf6b922ed0c0 
> (https://github.com/levigo/jbig2-imageio/commit/483aab3eb9bbc02f6995a637155adf6b922ed0c0).
>  
> - Its SHA1 is 0e07111b4bf7f5a51bf0fdd903f02f082ea3bf65



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178167#comment-16178167
 ] 

Andreas Lehmkühler commented on PDFBOX-3934:


The former brute force search doesn't work with truncated files as it omits any 
valid content after the last found startxref entry. Unfortunately the 
regression from PDFBOX-3318 is back.

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178166#comment-16178166
 ] 

Tilman Hausherr commented on PDFBOX-3934:
-

PDFBOX-3318 fails now with "root cannot be null".

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178155#comment-16178155
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1809500 from [~lehmi] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1809500 ]

PDFBOX-3934: removed brute force search for last startxref entry fall back to 
rebuildTrailer instead, improved garbage detection

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3934) Page missing

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3934?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178156#comment-16178156
 ] 

ASF subversion and git services commented on PDFBOX-3934:
-

Commit 1809501 from [~lehmi] in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1809501 ]

PDFBOX-3934: removed brute force search for last startxref entry fall back to 
rebuildTrailer instead, improved garbage detection

> Page missing
> 
>
> Key: PDFBOX-3934
> URL: https://issues.apache.org/jira/browse/PDFBOX-3934
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.5, 2.0.6, 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: BCZSFNQAB62TUBURWG6B3ZOZCG5IH46P.pdf, 
> PDFBOX-3934-KTUUMJNQ7NYGJJMEDGSY5OEU76G6JX2V.pdf
>
>
> The first page (with "iéseg") was in 2.0.4 but is no longer there since 2.0.5.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3913) Japanese URI improperly encoded

2017-09-24 Thread JIRA

[ 
https://issues.apache.org/jira/browse/PDFBOX-3913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178147#comment-16178147
 ] 

Andreas Lehmkühler commented on PDFBOX-3913:


Me neither, however "my" eclipse complained about it and it's fixed now. :)

> Japanese URI improperly encoded
> ---
>
> Key: PDFBOX-3913
> URL: https://issues.apache.org/jira/browse/PDFBOX-3913
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.7
>Reporter: Roman
>Assignee: Tilman Hausherr
> Fix For: 2.0.8, 3.0.0
>
> Attachments: business.pdf
>
>
> *PDActionURI.getURI()* function returns bad result for the first link 
> annotation in the attached document [^business.pdf] (this doc works fine in 
> Acrobat Reader).
> We are using PdfBox 2.0.5
> {code}
> //..
> //Iterates by all annotations on the first page of doc, searches for first 
> annotation of type PDAnnotationLink
> //..
>   PDAnnotationLink link = (PDAnnotationLink) annotation;
>   PDAction action = link.getAction();
> //this would return buggy result: http://組匶替綎.com/
> String badURI = ((PDActionURI) action).getURI();
> //although, here fine result is returned: http://経営承継.com/
> String goodURI = new 
> String(((COSString)action.getCOSObject().getDictionaryObject("URI")).getBytes());
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3913) Japanese URI improperly encoded

2017-09-24 Thread Tilman Hausherr (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178145#comment-16178145
 ] 

Tilman Hausherr commented on PDFBOX-3913:
-

Thanks... I have no idea how this happened. And why the test didn't fail on the 
CI server.

> Japanese URI improperly encoded
> ---
>
> Key: PDFBOX-3913
> URL: https://issues.apache.org/jira/browse/PDFBOX-3913
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.7
>Reporter: Roman
>Assignee: Tilman Hausherr
> Fix For: 2.0.8, 3.0.0
>
> Attachments: business.pdf
>
>
> *PDActionURI.getURI()* function returns bad result for the first link 
> annotation in the attached document [^business.pdf] (this doc works fine in 
> Acrobat Reader).
> We are using PdfBox 2.0.5
> {code}
> //..
> //Iterates by all annotations on the first page of doc, searches for first 
> annotation of type PDAnnotationLink
> //..
>   PDAnnotationLink link = (PDAnnotationLink) annotation;
>   PDAction action = link.getAction();
> //this would return buggy result: http://組匶替綎.com/
> String badURI = ((PDActionURI) action).getURI();
> //although, here fine result is returned: http://経営承継.com/
> String goodURI = new 
> String(((COSString)action.getCOSObject().getDictionaryObject("URI")).getBytes());
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-3913) Japanese URI improperly encoded

2017-09-24 Thread ASF subversion and git services (JIRA)

[ 
https://issues.apache.org/jira/browse/PDFBOX-3913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16178141#comment-16178141
 ] 

ASF subversion and git services commented on PDFBOX-3913:
-

Commit 1809485 from [~lehmi] in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1809485 ]

PDFBOX-3913: fix package of test class

> Japanese URI improperly encoded
> ---
>
> Key: PDFBOX-3913
> URL: https://issues.apache.org/jira/browse/PDFBOX-3913
> Project: PDFBox
>  Issue Type: Bug
>  Components: PDModel
>Affects Versions: 2.0.7
>Reporter: Roman
>Assignee: Tilman Hausherr
> Fix For: 2.0.8, 3.0.0
>
> Attachments: business.pdf
>
>
> *PDActionURI.getURI()* function returns bad result for the first link 
> annotation in the attached document [^business.pdf] (this doc works fine in 
> Acrobat Reader).
> We are using PdfBox 2.0.5
> {code}
> //..
> //Iterates by all annotations on the first page of doc, searches for first 
> annotation of type PDAnnotationLink
> //..
>   PDAnnotationLink link = (PDAnnotationLink) annotation;
>   PDAction action = link.getAction();
> //this would return buggy result: http://組匶替綎.com/
> String badURI = ((PDActionURI) action).getURI();
> //although, here fine result is returned: http://経営承継.com/
> String goodURI = new 
> String(((COSString)action.getCOSObject().getDictionaryObject("URI")).getBytes());
> {code}



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Assigned] (PDFBOX-3936) IllegalArgumentException: root cannot be null with truncated file (2)

2017-09-24 Thread JIRA

 [ 
https://issues.apache.org/jira/browse/PDFBOX-3936?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Andreas Lehmkühler reassigned PDFBOX-3936:
--

Assignee: Andreas Lehmkühler

> IllegalArgumentException: root cannot be null with truncated file (2)
> -
>
> Key: PDFBOX-3936
> URL: https://issues.apache.org/jira/browse/PDFBOX-3936
> Project: PDFBox
>  Issue Type: Bug
>  Components: Parsing
>Affects Versions: 2.0.7
>Reporter: Tilman Hausherr
>Assignee: Andreas Lehmkühler
>  Labels: regression
> Attachments: PDFBOX-3936-BF4G4JVMMC4I7RDNIPQQF6DQSVKSVVXJ.pdf
>
>
> {code}
> Exception in thread "main" java.lang.IllegalArgumentException: root cannot be 
> null
> at org.apache.pdfbox.pdmodel.PDPageTree.(PDPageTree.java:75)
> at 
> org.apache.pdfbox.pdmodel.PDDocumentCatalog.getPages(PDDocumentCatalog.java:129)
> {code}
> This was OK with 2.0.6.
> [~talli...@apache.org] please move that file to the "truncated" subdir.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org