[jira] [Commented] (TIKA-713) Tika can not parse all of the persian pdf files

2011-10-31 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13140148#comment-13140148 ] Robert Muir commented on TIKA-713: -- Thanks for uploading another test file Ahmad, we'll

[jira] [Commented] (TIKA-722) Arabic PDF doesn't extract correctly

2011-10-03 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119403#comment-13119403 ] Robert Muir commented on TIKA-722: -- Actually in this case the original TTF font (AxtManal)

[jira] [Commented] (TIKA-721) UTF16-LE not detected

2011-10-02 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119038#comment-13119038 ] Robert Muir commented on TIKA-721: -- {quote} Finally, for the valid code points, I count how

[jira] [Commented] (TIKA-713) Tika can not parse all of the persian pdf files

2011-10-02 Thread Robert Muir (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-713?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13119060#comment-13119060 ] Robert Muir commented on TIKA-713: -- I created PDFBOX-1127 for this with some screenshots