[jira] [Resolved] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr resolved PDFBOX-5028.
-
  Assignee: Tilman Hausherr
Resolution: Fixed

> Partial field names must not contain period characters
> --
>
> Key: PDFBOX-5028
> URL: https://issues.apache.org/jira/browse/PDFBOX-5028
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 1.8.16, 2.0.21
>Reporter: Tilman Hausherr
>Assignee: Tilman Hausherr
>Priority: Major
> Fix For: 1.8.17, 2.0.22, 3.0.0 PDFBox
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5028:

Affects Version/s: 1.8.16

> Partial field names must not contain period characters
> --
>
> Key: PDFBOX-5028
> URL: https://issues.apache.org/jira/browse/PDFBOX-5028
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 1.8.16, 2.0.21
>Reporter: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5028:

Fix Version/s: 1.8.17

> Partial field names must not contain period characters
> --
>
> Key: PDFBOX-5028
> URL: https://issues.apache.org/jira/browse/PDFBOX-5028
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 1.8.16, 2.0.21
>Reporter: Tilman Hausherr
>Priority: Major
> Fix For: 1.8.17, 2.0.22, 3.0.0 PDFBox
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239871#comment-17239871
 ] 

ASF subversion and git services commented on PDFBOX-5028:
-

Commit 1883888 from Tilman Hausherr in branch 'pdfbox/branches/2.0'
[ https://svn.apache.org/r1883888 ]

PDFBOX-5028: partial field names must not contain period characters

> Partial field names must not contain period characters
> --
>
> Key: PDFBOX-5028
> URL: https://issues.apache.org/jira/browse/PDFBOX-5028
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.21
>Reporter: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239872#comment-17239872
 ] 

ASF subversion and git services commented on PDFBOX-5028:
-

Commit 1883889 from Tilman Hausherr in branch 'pdfbox/branches/1.8'
[ https://svn.apache.org/r1883889 ]

PDFBOX-5028: partial field names must not contain period characters

> Partial field names must not contain period characters
> --
>
> Key: PDFBOX-5028
> URL: https://issues.apache.org/jira/browse/PDFBOX-5028
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.21
>Reporter: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-5028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239870#comment-17239870
 ] 

ASF subversion and git services commented on PDFBOX-5028:
-

Commit 1883887 from Tilman Hausherr in branch 'pdfbox/trunk'
[ https://svn.apache.org/r1883887 ]

PDFBOX-5028: partial field names must not contain period characters

> Partial field names must not contain period characters
> --
>
> Key: PDFBOX-5028
> URL: https://issues.apache.org/jira/browse/PDFBOX-5028
> Project: PDFBox
>  Issue Type: Bug
>  Components: AcroForm
>Affects Versions: 2.0.21
>Reporter: Tilman Hausherr
>Priority: Major
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-11-27 Thread Christian (Jira)
Christian  created PDFBOX-5029:
--

 Summary: Tika - Issues extracting Arabic script from pdf
 Key: PDFBOX-5029
 URL: https://issues.apache.org/jira/browse/PDFBOX-5029
 Project: PDFBox
  Issue Type: Bug
 Environment: Windows - Anaconda / Spyder
Reporter: Christian 
 Attachments: extracting_text_asian_pdf.py, test.pdf, test_scraped.utf8

I'm working on building a corpus of Uygur texts and some of the content is 
coming from pdf files. I wrote a short python script to scrape text from pdf 
using tika-python. The script is Arabic, and the output looks good but there is 
one major problem: there are many missing spaces between words and I really do 
not know how to address this issue. I am attaching a pdf file, the script to 
scrape its text and the output (test_scraped.utf8). Thanks in advance for your 
help.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Created] (PDFBOX-5028) Partial field names must not contain period characters

2020-11-27 Thread Tilman Hausherr (Jira)
Tilman Hausherr created PDFBOX-5028:
---

 Summary: Partial field names must not contain period characters
 Key: PDFBOX-5028
 URL: https://issues.apache.org/jira/browse/PDFBOX-5028
 Project: PDFBox
  Issue Type: Bug
  Components: AcroForm
Affects Versions: 2.0.21
Reporter: Tilman Hausherr
 Fix For: 2.0.22, 3.0.0 PDFBox






--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5027) Protect/Encrypt PDF with multiple certificates on command line

2020-11-27 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5027:

Fix Version/s: 3.0.0 PDFBox
   2.0.22

> Protect/Encrypt PDF with multiple certificates on command line
> --
>
> Key: PDFBOX-5027
> URL: https://issues.apache.org/jira/browse/PDFBOX-5027
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Crypto
>Reporter: jakatal
>Priority: Trivial
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>   Original Estimate: 6h
>  Remaining Estimate: 6h
>
> Hi,
> PDFBox has (obviously) the ability to protect a file with several 
> certificates by adding teh recipient's certificates one after another:
>  
>  
> {code:java}
> //Class PublicKeyProtectionPolicy has 
> public void addRecipient(PublicKeyRecipient recipient)
> {recipients.add(recipient);}
> {code}
> For the commandline tool functionality, it just offers "-cert" with the 
> option to add a SINGLE certificate. I expect that in most serious use cases 
> actually two certificates are used to protect the document (the actual 
> recipient and the creator who wants to be able still to open the document as 
> well).
>  
> I propose to extend the command line functionality (Encrypt.java) by having 
> an iteration through several cert files, e.g. separated by special character.
>  
> Thanks.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Updated] (PDFBOX-5027) Protect/Encrypt PDF with multiple certificates on command line

2020-11-27 Thread Tilman Hausherr (Jira)


 [ 
https://issues.apache.org/jira/browse/PDFBOX-5027?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tilman Hausherr updated PDFBOX-5027:

Affects Version/s: 2.0.21

> Protect/Encrypt PDF with multiple certificates on command line
> --
>
> Key: PDFBOX-5027
> URL: https://issues.apache.org/jira/browse/PDFBOX-5027
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Crypto
>Affects Versions: 2.0.21
>Reporter: jakatal
>Priority: Trivial
> Fix For: 2.0.22, 3.0.0 PDFBox
>
>   Original Estimate: 6h
>  Remaining Estimate: 6h
>
> Hi,
> PDFBox has (obviously) the ability to protect a file with several 
> certificates by adding teh recipient's certificates one after another:
>  
>  
> {code:java}
> //Class PublicKeyProtectionPolicy has 
> public void addRecipient(PublicKeyRecipient recipient)
> {recipients.add(recipient);}
> {code}
> For the commandline tool functionality, it just offers "-cert" with the 
> option to add a SINGLE certificate. I expect that in most serious use cases 
> actually two certificates are used to protect the document (the actual 
> recipient and the creator who wants to be able still to open the document as 
> well).
>  
> I propose to extend the command line functionality (Encrypt.java) by having 
> an iteration through several cert files, e.g. separated by special character.
>  
> Thanks.
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org



[jira] [Commented] (PDFBOX-4848) Automate building website without local install

2020-11-27 Thread ASF subversion and git services (Jira)


[ 
https://issues.apache.org/jira/browse/PDFBOX-4848?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239719#comment-17239719
 ] 

ASF subversion and git services commented on PDFBOX-4848:
-

Commit a9a7397987f11da73d536c3dd9a8c879f519bd4b in pdfbox-docs's branch 
refs/heads/master from Maruan Sahyoun
[ https://gitbox.apache.org/repos/asf?p=pdfbox-docs.git;h=a9a7397 ]

PDFBOX-4848: ensure sass changes are picked up in preview


> Automate building website without local install
> ---
>
> Key: PDFBOX-4848
> URL: https://issues.apache.org/jira/browse/PDFBOX-4848
> Project: PDFBox
>  Issue Type: Improvement
>  Components: Documentation
>Reporter: Maruan Sahyoun
>Assignee: Maruan Sahyoun
>Priority: Minor
>
> As discussed on the dev mailing list we are looking to utilize the [git - 
> .asf.yaml 
> features|https://cwiki.apache.org/confluence/display/INFRA/git+-+.asf.yaml+features]
>  and/or other capabilities to simplify building the website without the need 
> to install the site generation locally.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

-
To unsubscribe, e-mail: dev-unsubscr...@pdfbox.apache.org
For additional commands, e-mail: dev-h...@pdfbox.apache.org