Peter Davies created TIKA-2524: ---------------------------------- Summary: Apache Tika returns empty string when parsing text from XPS files Key: TIKA-2524 URL: https://issues.apache.org/jira/browse/TIKA-2524 Project: Tika Issue Type: Bug Components: parser Affects Versions: 1.16 Reporter: Peter Davies
When we parse XPS files using the AutoParser we always get an empty string. If we use DefaultDetector.detect() it correctly detects the MediaType as "application/vnd.ms-xpsdocument". This page https://tika.apache.org/1.16/formats.html suggests that XPS (application/vnd.ms-xpsdocument) is supported however. -- This message was sent by Atlassian JIRA (v6.4.14#64029)