[ https://issues.apache.org/jira/browse/TIKA-1257?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hong-Thai Nguyen updated TIKA-1257: ----------------------------------- Attachment: tika-doc-control-char.png 5f01ae23-9e6e-4faa-808a-f78dbb20cc71.doc > MS Word Filter out control characters on ouput > ---------------------------------------------- > > Key: TIKA-1257 > URL: https://issues.apache.org/jira/browse/TIKA-1257 > Project: Tika > Issue Type: Bug > Components: parser > Affects Versions: 1.4 > Reporter: Hong-Thai Nguyen > Fix For: 1.6 > > Attachments: 5f01ae23-9e6e-4faa-808a-f78dbb20cc71.doc, > tika-doc-control-char.png > > > Control characters present mostly in table of index and un-visualizable. We > should filter out them. -- This message was sent by Atlassian JIRA (v6.2#6252)