[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-16 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13971622#comment-13971622 ] Tim Allison commented on TIKA-1010: --- Great to hear. Thank you for your help in submittin

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-16 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13970597#comment-13970597 ] Chris Bamford commented on TIKA-1010: - Tim I have done a lot of testing now and am ver

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-12 Thread Peter Hamelberg (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13967649#comment-13967649 ] Peter Hamelberg commented on TIKA-1010: --- The RTF objdata destination contains the obj

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-10 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965289#comment-13965289 ] Tim Allison commented on TIKA-1010: --- Interesting... Y, my untested belief is that with t

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-10 Thread Luis Filipe Nassif (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13965283#comment-13965283 ] Luis Filipe Nassif commented on TIKA-1010: -- If you extract the embedded xls file f

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-07 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13962059#comment-13962059 ] Tim Allison commented on TIKA-1010: --- Hmmm... In the zip file from April 3, there should b

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-07 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13961821#comment-13961821 ] Chris Bamford commented on TIKA-1010: - Thanks Tim, all compiles now. I think at least

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-04 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13959988#comment-13959988 ] Tim Allison commented on TIKA-1010: --- trunk svn co http://svn.apache.org/repos/asf/tika/t

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-04 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13959978#comment-13959978 ] Chris Bamford commented on TIKA-1010: - Hi Tim Am about to play with the patch - which

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-01 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13956928#comment-13956928 ] Tim Allison commented on TIKA-1010: --- Absolutely, this is more of a question for the tika-

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-04-01 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13956274#comment-13956274 ] Chris Bamford commented on TIKA-1010: - Tim A quick question - where do the extracted fi

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-31 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13955104#comment-13955104 ] Tim Allison commented on TIKA-1010: --- Thank you, Chris. The test doc that I posted late l

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-31 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13955048#comment-13955048 ] Chris Bamford commented on TIKA-1010: - Hi Tim I have created an RTF with 5 embedded of

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-30 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13954819#comment-13954819 ] Chris Bamford commented on TIKA-1010: - Hi Tim Are you saying you would like to test ag

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950835#comment-13950835 ] Chris Bamford commented on TIKA-1010: - Hi Tim I have found one - please see https://is

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13951116#comment-13951116 ] Tim Allison commented on TIKA-1010: --- As a side note, I can grab file names for: 1) image

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13951107#comment-13951107 ] Tim Allison commented on TIKA-1010: --- Y, thanks, I got that. I can add an "extract all" m

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13951092#comment-13951092 ] Chris Bamford commented on TIKA-1010: - Ideally I'd like to be able to extract any file,

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13951097#comment-13951097 ] Chris Bamford commented on TIKA-1010: - The binary actually looks like this: {noformat}

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13951009#comment-13951009 ] Chris Bamford commented on TIKA-1010: - Hi again Tim Dunno if this helps, but there is

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13951004#comment-13951004 ] Tim Allison commented on TIKA-1010: --- Chris, Thanks for pointing that out. The objdata

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Chris Bamford (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950714#comment-13950714 ] Chris Bamford commented on TIKA-1010: - Hi Tim Sorry about the confusion with the GIFs

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-28 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13950628#comment-13950628 ] Tim Allison commented on TIKA-1010: --- Chris, Thank you for digging into the spec and sha

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-21 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13943423#comment-13943423 ] Tim Allison commented on TIKA-1010: --- In {themedata, I'm seeing the magic 50 4B (PK)...th

[jira] [Commented] (TIKA-1010) Embedded documents in RTF are not extracted

2014-03-21 Thread Tim Allison (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-1010?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13943404#comment-13943404 ] Tim Allison commented on TIKA-1010: --- This might be of use: http://palashray.com/2006/10/