[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2021-01-07 Thread Christian (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17260786#comment-17260786 ] Christian commented on PDFBOX-5029: Hi Tilman, first of all Happy New Year - I have been very busy

[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-12-02 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17242621#comment-17242621 ] Tilman Hausherr commented on PDFBOX-5029: - Could you please tell what segment in the PDF is

[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-12-01 Thread Christian (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17241476#comment-17241476 ] Christian commented on PDFBOX-5029: Hi Tilman, in your "sorted" files there are spaces between

[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-11-29 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17240465#comment-17240465 ] Tilman Hausherr commented on PDFBOX-5029: - No I did not use the script. I don't have python

[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-11-29 Thread Christian (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17240363#comment-17240363 ] Christian commented on PDFBOX-5029: Also, what is the difference between the sorted and not-sorted

[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-11-29 Thread Christian (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17240362#comment-17240362 ] Christian commented on PDFBOX-5029: Thanks Tilman, will do - tomorrow I will be in touch with a

[jira] [Commented] (PDFBOX-5029) Tika - Issues extracting Arabic script from pdf

2020-11-28 Thread Tilman Hausherr (Jira)
[ https://issues.apache.org/jira/browse/PDFBOX-5029?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17239942#comment-17239942 ] Tilman Hausherr commented on PDFBOX-5029: - I attached 4 files. IMHO spaces are there. Can you