[jira] [Created] (TIKA-2524) Apache Tika returns empty string when parsing text from XPS files

2017-12-11 Thread Peter Davies (JIRA)
Peter Davies created TIKA-2524: -- Summary: Apache Tika returns empty string when parsing text from XPS files Key: TIKA-2524 URL: https://issues.apache.org/jira/browse/TIKA-2524 Project: Tika Iss

[jira] [Updated] (TIKA-2524) Apache Tika returns empty string when parsing text from XPS files

2017-12-11 Thread Peter Davies (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Davies updated TIKA-2524: --- Description: When we parse XPS files using the AutoParser we always get an empty string. If we use Defa

[jira] [Updated] (TIKA-2524) Apache Tika returns empty string when parsing text from XPS files

2017-12-11 Thread Peter Davies (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Davies updated TIKA-2524: --- Attachment: doc_xps.xps > Apache Tika returns empty string when parsing text from XPS files > -

[jira] [Updated] (TIKA-2524) Apache Tika returns empty string when parsing text from XPS files

2017-12-11 Thread Peter Davies (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2524?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Davies updated TIKA-2524: --- Description: When we parse XPS files using the AutoParser we always get an empty string. If we use Defa

[jira] [Commented] (TIKA-2524) Create/integrate a parser for XPS

2017-12-11 Thread Peter Davies (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2524?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16285951#comment-16285951 ] Peter Davies commented on TIKA-2524: Good to see we were using the AutoParser correctly

[jira] [Created] (TIKA-2640) MS Word document checkboxes and dropdowns not fully converted to text

2018-05-02 Thread Peter Davies (JIRA)
Peter Davies created TIKA-2640: -- Summary: MS Word document checkboxes and dropdowns not fully converted to text Key: TIKA-2640 URL: https://issues.apache.org/jira/browse/TIKA-2640 Project: Tika

[jira] [Updated] (TIKA-2640) MS Word document checkboxes and dropdowns not fully converted to text

2018-05-03 Thread Peter Davies (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-2640?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Peter Davies updated TIKA-2640: --- Description: When we use Tika to parse the text from a Microsoft Word document (.doc) file with a chec