[
https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-782:
Attachment: logo.zip
Bingo, found one in the published Enron data. It's an RTF with the Enron logo.
[
https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-782:
Attachment: bin3.patch
New patch with the requested changes.
> Add support for parsi
[
https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-782:
Attachment: bin2.patch
improved patch
> Add support for parsing binary data in RTF f
[
https://issues.apache.org/jira/browse/TIKA-782?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-782:
Attachment: bin.patch
Patch adding \bin support.
> Add support for parsing binary da
[
https://issues.apache.org/jira/browse/TIKA-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-781:
Description: The RTF parser should ignore control words like \par, \line
and \tab when these occur in
[
https://issues.apache.org/jira/browse/TIKA-781?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-781:
Attachment: tika781.patch
> RTF parser should ignore most control words in ignore groups
> --
[
https://issues.apache.org/jira/browse/TIKA-777?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Arjohn Kampman updated TIKA-777:
Description:
Tika's RTF parser processes the following rtf document incorrectly, applying
the wrong