[jira] Commented: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Benjamin Douglas (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965154#action_12965154 ] Benjamin Douglas commented on TIKA-461: --- Following a discussion with Julien, I am attac

[jira] Updated: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Benjamin Douglas (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Douglas updated TIKA-461: -- Attachment: TIKA-461-tests-1.patch > RFC822 messages not parsed > -- > >

[jira] Updated: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Julien Nioche updated TIKA-461: --- Attachment: testRFC822-multipart Test document for mail parsing with multiparts, text + html representa

[jira] Commented: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965271#action_12965271 ] Julien Nioche commented on TIKA-461: Benjamin, thanks for your patch. Could you generate

[jira] Commented: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Benjamin Douglas (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965279#action_12965279 ] Benjamin Douglas commented on TIKA-461: --- Did you try patch -p1? The test patch was mean

[jira] Commented: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Julien Nioche (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965286#action_12965286 ] Julien Nioche commented on TIKA-461: patch -p1 failed peb...@lucid-vostro:/data/tika$ p

[jira] Updated: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Benjamin Douglas (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Douglas updated TIKA-461: -- Attachment: (was: TIKA-461-tests-1.patch) > RFC822 messages not parsed >

[jira] Updated: (TIKA-461) RFC822 messages not parsed

2010-11-30 Thread Benjamin Douglas (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benjamin Douglas updated TIKA-461: -- Attachment: TIKA-461-plus-tests-1.patch Sorry about that. I am attaching a cumulative diff from s

[jira] Commented: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965428#action_12965428 ] Antoni Mylka commented on TIKA-560: --- It seems that when applying changes to tika-mimetypes.

[jira] Issue Comment Edited: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965428#action_12965428 ] Antoni Mylka edited comment on TIKA-560 at 11/30/10 3:49 PM: - It

[jira] Commented: (TIKA-389) Garbled metadata when dealing with encrypted PDF files.

2010-11-30 Thread Michel Tremblay (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-389?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965470#action_12965470 ] Michel Tremblay commented on TIKA-389: -- I have the same problem here using the command l

[jira] Created: (TIKA-562) In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent

2010-11-30 Thread Antoni Mylka (JIRA)
In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent Key: TIKA-562 URL: https://issues.apache.org/jira/browse/TIKA-562 Project: Tika Issue T

[jira] Updated: (TIKA-562) In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoni Mylka updated TIKA-562: -- Attachment: ooxml-children.patch > In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their

[jira] Updated: (TIKA-563) .vor files are Staroffice Templates, not Staroffice Writer documents

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoni Mylka updated TIKA-563: -- Attachment: staroffice.5.2.templates.zip staroffice-templates.patch A patch and some exam

[jira] Created: (TIKA-563) .vor files are Staroffice Templates, not Staroffice Writer documents

2010-11-30 Thread Antoni Mylka (JIRA)
.vor files are Staroffice Templates, not Staroffice Writer documents Key: TIKA-563 URL: https://issues.apache.org/jira/browse/TIKA-563 Project: Tika Issue Type: Bug

[jira] Commented: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files

2010-11-30 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965500#action_12965500 ] Nick Burch commented on TIKA-560: - I skipped one which looked to be incorrectly switching an

[jira] Commented: (TIKA-562) In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent

2010-11-30 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965501#action_12965501 ] Nick Burch commented on TIKA-562: - Do you have some example files for these? (If they're inc

[jira] Created: (TIKA-564) Support returning original markup in BoilerpipeContentHandler

2010-11-30 Thread Ken Krugler (JIRA)
Support returning original markup in BoilerpipeContentHandler - Key: TIKA-564 URL: https://issues.apache.org/jira/browse/TIKA-564 Project: Tika Issue Type: Improvement Com

[jira] Updated: (TIKA-564) Support returning original markup in BoilerpipeContentHandler

2010-11-30 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler updated TIKA-564: - Attachment: TIKA-564.patch Patch sponsored by Mashlogic - thanks! > Support returning original markup in

[jira] Resolved: (TIKA-564) Support returning original markup in BoilerpipeContentHandler

2010-11-30 Thread Ken Krugler (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-564?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ken Krugler resolved TIKA-564. -- Resolution: Fixed Committed: http://svn.apache.org/viewvc?view=revision&revision=1040841 > Support retur

[jira] Commented: (TIKA-562) In tika-mimetypes.xml OpenXML types should have x-tika-ooxml as their parent

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965590#action_12965590 ] Antoni Mylka commented on TIKA-562: --- Your unit tests test identification by name and by dat

[jira] Commented: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-560?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12965592#action_12965592 ] Antoni Mylka commented on TIKA-560: --- It's about detecting the testEXCEL.xlsb file with plai

[jira] Resolved: (TIKA-560) Improve detection of .mht, Foxmail, and OOXML files

2010-11-30 Thread Antoni Mylka (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoni Mylka resolved TIKA-560. --- Resolution: Fixed This is fixed from my POV, if you don't want to accept null stream in ContainerAware