[jira] [Resolved] (PDFBOX-5028) Partial field names must not contain period characters
[ https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr resolved PDFBOX-5028. - Assignee: Tilman Hausherr Resolution: Fixed > Partial field names must not contain period characters > -- > > Key: PDFBOX-5028 > URL: https://issues.apache.org/jira/browse/PDFBOX-5028 > Project: PDFBox > Issue Type: Bug > Components: AcroForm >Affects Versions: 1.8.16, 2.0.21 >Reporter: Tilman Hausherr >Assignee: Tilman Hausherr >Priority: Major > Fix For: 1.8.17, 2.0.22, 3.0.0 PDFBox > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-5028) Partial field names must not contain period characters
[ https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5028: Affects Version/s: 1.8.16 > Partial field names must not contain period characters > -- > > Key: PDFBOX-5028 > URL: https://issues.apache.org/jira/browse/PDFBOX-5028 > Project: PDFBox > Issue Type: Bug > Components: AcroForm >Affects Versions: 1.8.16, 2.0.21 >Reporter: Tilman Hausherr >Priority: Major > Fix For: 2.0.22, 3.0.0 PDFBox > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-5028) Partial field names must not contain period characters
[ https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5028: Fix Version/s: 1.8.17 > Partial field names must not contain period characters > -- > > Key: PDFBOX-5028 > URL: https://issues.apache.org/jira/browse/PDFBOX-5028 > Project: PDFBox > Issue Type: Bug > Components: AcroForm >Affects Versions: 1.8.16, 2.0.21 >Reporter: Tilman Hausherr >Priority: Major > Fix For: 1.8.17, 2.0.22, 3.0.0 PDFBox > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5028) Partial field names must not contain period characters
[ https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239871#comment-17239871 ] ASF subversion and git services commented on PDFBOX-5028: - Commit 1883888 from Tilman Hausherr in branch 'pdfbox/branches/2.0' [ https://svn.apache.org/r1883888 ] PDFBOX-5028: partial field names must not contain period characters > Partial field names must not contain period characters > -- > > Key: PDFBOX-5028 > URL: https://issues.apache.org/jira/browse/PDFBOX-5028 > Project: PDFBox > Issue Type: Bug > Components: AcroForm >Affects Versions: 2.0.21 >Reporter: Tilman Hausherr >Priority: Major > Fix For: 2.0.22, 3.0.0 PDFBox > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5028) Partial field names must not contain period characters
[ https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239872#comment-17239872 ] ASF subversion and git services commented on PDFBOX-5028: - Commit 1883889 from Tilman Hausherr in branch 'pdfbox/branches/1.8' [ https://svn.apache.org/r1883889 ] PDFBOX-5028: partial field names must not contain period characters > Partial field names must not contain period characters > -- > > Key: PDFBOX-5028 > URL: https://issues.apache.org/jira/browse/PDFBOX-5028 > Project: PDFBox > Issue Type: Bug > Components: AcroForm >Affects Versions: 2.0.21 >Reporter: Tilman Hausherr >Priority: Major > Fix For: 2.0.22, 3.0.0 PDFBox > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-5028) Partial field names must not contain period characters
[ https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239870#comment-17239870 ] ASF subversion and git services commented on PDFBOX-5028: - Commit 1883887 from Tilman Hausherr in branch 'pdfbox/trunk' [ https://svn.apache.org/r1883887 ] PDFBOX-5028: partial field names must not contain period characters > Partial field names must not contain period characters > -- > > Key: PDFBOX-5028 > URL: https://issues.apache.org/jira/browse/PDFBOX-5028 > Project: PDFBox > Issue Type: Bug > Components: AcroForm >Affects Versions: 2.0.21 >Reporter: Tilman Hausherr >Priority: Major > Fix For: 2.0.22, 3.0.0 PDFBox > > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf
Christian created PDFBOX-5029: -- Summary: Tika - Issues extracting Arabic script from pdf Key: PDFBOX-5029 URL: https://issues.apache.org/jira/browse/PDFBOX-5029 Project: PDFBox Issue Type: Bug Environment: Windows - Anaconda / Spyder Reporter: Christian Attachments: extracting_text_asian_pdf.py, test.pdf, test_scraped.utf8 I'm working on building a corpus of Uygur texts and some of the content is coming from pdf files. I wrote a short python script to scrape text from pdf using tika-python. The script is Arabic, and the output looks good but there is one major problem: there are many missing spaces between words and I really do not know how to address this issue. I am attaching a pdf file, the script to scrape its text and the output (test_scraped.utf8). Thanks in advance for your help. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Created] (PDFBOX-5028) Partial field names must not contain period characters
Tilman Hausherr created PDFBOX-5028: --- Summary: Partial field names must not contain period characters Key: PDFBOX-5028 URL: https://issues.apache.org/jira/browse/PDFBOX-5028 Project: PDFBox Issue Type: Bug Components: AcroForm Affects Versions: 2.0.21 Reporter: Tilman Hausherr Fix For: 2.0.22, 3.0.0 PDFBox -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-5027) Protect/Encrypt PDF with multiple certificates on command line
[ https://issues.apache.org/jira/browse/PDFBOX-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5027: Fix Version/s: 3.0.0 PDFBox 2.0.22 > Protect/Encrypt PDF with multiple certificates on command line > -- > > Key: PDFBOX-5027 > URL: https://issues.apache.org/jira/browse/PDFBOX-5027 > Project: PDFBox > Issue Type: Improvement > Components: Crypto >Reporter: jakatal >Priority: Trivial > Fix For: 2.0.22, 3.0.0 PDFBox > > Original Estimate: 6h > Remaining Estimate: 6h > > Hi, > PDFBox has (obviously) the ability to protect a file with several > certificates by adding teh recipient's certificates one after another: > > > {code:java} > //Class PublicKeyProtectionPolicy has > public void addRecipient(PublicKeyRecipient recipient) > {recipients.add(recipient);} > {code} > For the commandline tool functionality, it just offers "-cert" with the > option to add a SINGLE certificate. I expect that in most serious use cases > actually two certificates are used to protect the document (the actual > recipient and the creator who wants to be able still to open the document as > well). > > I propose to extend the command line functionality (Encrypt.java) by having > an iteration through several cert files, e.g. separated by special character. > > Thanks. > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Updated] (PDFBOX-5027) Protect/Encrypt PDF with multiple certificates on command line
[ https://issues.apache.org/jira/browse/PDFBOX-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Tilman Hausherr updated PDFBOX-5027: Affects Version/s: 2.0.21 > Protect/Encrypt PDF with multiple certificates on command line > -- > > Key: PDFBOX-5027 > URL: https://issues.apache.org/jira/browse/PDFBOX-5027 > Project: PDFBox > Issue Type: Improvement > Components: Crypto >Affects Versions: 2.0.21 >Reporter: jakatal >Priority: Trivial > Fix For: 2.0.22, 3.0.0 PDFBox > > Original Estimate: 6h > Remaining Estimate: 6h > > Hi, > PDFBox has (obviously) the ability to protect a file with several > certificates by adding teh recipient's certificates one after another: > > > {code:java} > //Class PublicKeyProtectionPolicy has > public void addRecipient(PublicKeyRecipient recipient) > {recipients.add(recipient);} > {code} > For the commandline tool functionality, it just offers "-cert" with the > option to add a SINGLE certificate. I expect that in most serious use cases > actually two certificates are used to protect the document (the actual > recipient and the creator who wants to be able still to open the document as > well). > > I propose to extend the command line functionality (Encrypt.java) by having > an iteration through several cert files, e.g. separated by special character. > > Thanks. > -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org
[jira] [Commented] (PDFBOX-4848) Automate building website without local install
[ https://issues.apache.org/jira/browse/PDFBOX-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239719#comment-17239719 ] ASF subversion and git services commented on PDFBOX-4848: - Commit a9a7397987f11da73d536c3dd9a8c879f519bd4b in pdfbox-docs's branch refs/heads/master from Maruan Sahyoun [ https://gitbox.apache.org/repos/asf?p=pdfbox-docs.git;h=a9a7397 ] PDFBOX-4848: ensure sass changes are picked up in preview > Automate building website without local install > --- > > Key: PDFBOX-4848 > URL: https://issues.apache.org/jira/browse/PDFBOX-4848 > Project: PDFBox > Issue Type: Improvement > Components: Documentation >Reporter: Maruan Sahyoun >Assignee: Maruan Sahyoun >Priority: Minor > > As discussed on the dev mailing list we are looking to utilize the [git - > .asf.yaml > features|https://cwiki.apache.org/confluence/display/INFRA/git+-+.asf.yaml+features] > and/or other capabilities to simplify building the website without the need > to install the site generation locally. -- This message was sent by Atlassian Jira (v8.3.4#803005) - To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org For additional commands, e-mail: dev-h...@pdfbox.apache.org