[jira] [Commented] (TIKA-817) (PPT/PPTX) Missing date/time in text content.

2011-12-19 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172371#comment-13172371 ] Albert L. commented on TIKA-817: I wonder if "update automatically" Date/Time objects don't

[jira] [Commented] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content.

2011-12-19 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172469#comment-13172469 ] Albert L. commented on TIKA-816: XLS files seem to work when calling text extraction via HSS

[jira] [Commented] (TIKA-816) (XLS/XLSX) Improperly formatted date/time in text content.

2011-12-19 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172470#comment-13172470 ] Albert L. commented on TIKA-816: Bug 52369 - XLSX: text extraction malformed "=NOW()" and "=

[jira] [Commented] (TIKA-817) (PPT/PPTX) Missing date/time in text content.

2011-12-19 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-817?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172472#comment-13172472 ] Albert L. commented on TIKA-817: Reported the bug in POI v3.8 beta 5. Bug 52367 - PPT: text

[jira] [Commented] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content

2011-12-20 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13173225#comment-13173225 ] Albert L. commented on TIKA-819: Oh, I see. Could this be a command-line option when using

[jira] [Commented] (TIKA-819) Make Option to Exclude Embedded Files' Text for Text Content

2011-12-21 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-819?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174121#comment-13174121 ] Albert L. commented on TIKA-819: I think that by default retrieving the text content should

[jira] [Commented] (TIKA-873) Tika --extract fails for DOC

2012-03-09 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226475#comment-13226475 ] Albert L. commented on TIKA-873: Hi Nick, In the case of my attached file to this bug, I ge

[jira] [Commented] (TIKA-873) Tika --extract fails for DOC

2012-03-09 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226476#comment-13226476 ] Albert L. commented on TIKA-873: Hi Nick, ps: I am getting this result with all DOC files I

[jira] [Commented] (TIKA-873) Tika --extract fails for DOC

2012-03-09 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226499#comment-13226499 ] Albert L. commented on TIKA-873: Hi Nick, "testWORD_embeded.doc" is working. I get the fol

[jira] [Commented] (TIKA-873) Tika --extract fails for DOC

2012-03-21 Thread Albert L. (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234350#comment-13234350 ] Albert L. commented on TIKA-873: Thanks, Maxim. > Tika --extract fails for