[jira] [Created] (TIKA-631) Improve handling of Outlook emails which contain html, and those with non-unicode text bodies

2011-04-01 Thread Nick Burch (JIRA)
Improve handling of Outlook emails which contain html, and those with non-unicode text bodies - Key: TIKA-631 URL: https://issues.apache.org/jira/browse/TIKA-631

Build failed in Jenkins: Tika-trunk #505

2011-04-01 Thread Apache Hudson Server
See Changes: [nick] TIKA-631 - Stub out the work for improving the outlook parsing WRT html body content and better encoding detection [nick] TIKA-631 - Sample Chinese outlook file -- Started

[jira] [Updated] (TIKA-632) Rtf parsing ignores links

2011-04-01 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-632?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch updated TIKA-632: Attachment: test.rtf > Rtf parsing ignores links > - > > Key: TIKA-63

[jira] [Created] (TIKA-632) Rtf parsing ignores links

2011-04-01 Thread Nick Burch (JIRA)
Rtf parsing ignores links - Key: TIKA-632 URL: https://issues.apache.org/jira/browse/TIKA-632 Project: Tika Issue Type: Bug Components: parser Affects Versions: 0.9 Reporter: Nick Burch

[jira] [Updated] (TIKA-631) Improve handling of Outlook emails which contain html, and those with non-unicode text bodies

2011-04-01 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch updated TIKA-631: Attachment: OutlookRTFStub.patch The attached patch OutlookRTFStub.patch requires POI 3.8 beta 2, and fetches

[jira] [Updated] (TIKA-631) Improve handling of Outlook emails which contain html, and those with non-unicode text bodies

2011-04-01 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-631?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch updated TIKA-631: Attachment: OutlookHtmlRtf.patch OutlookHtmlRtf.patch requires POI 3.8 beta 3, but enables the use of the HTM