Grzegorz Kaczmarczyk created TIKA-1171:
------------------------------------------
Summary: Invalid characters in text extracted from *.ppt files
Key: TIKA-1171
URL: https://issues.apache.org/jira/browse/TIKA-1171
Project: Tika
Issue Type: Bug
Affects Versions: 1.4, 1.3
Reporter: Grzegorz Kaczmarczyk
Attachments: output.txt, tika-ppt-bug.tar.gz
Since tika 1.3 in text extracted from *.ppt files some unwanted asterisks
occurs. I'm attaching simple sample project that reproduces that bug.
--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira