[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226461#comment-13226461
]
Nick Burch commented on TIKA-873:
-
Tika has a number of unit tests for the extraction of emb
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226475#comment-13226475
]
Albert L. commented on TIKA-873:
Hi Nick,
In the case of my attached file to this bug, I ge
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226476#comment-13226476
]
Albert L. commented on TIKA-873:
Hi Nick,
ps: I am getting this result with all DOC files I
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226489#comment-13226489
]
Nick Burch commented on TIKA-873:
-
What about with the test files that ship with Tika (eg te
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13226499#comment-13226499
]
Albert L. commented on TIKA-873:
Hi Nick,
"testWORD_embeded.doc" is working. I get the fol
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234294#comment-13234294
]
Maxim Valyanskiy commented on TIKA-873:
---
Current trunk version extracts following file
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234310#comment-13234310
]
Maxim Valyanskiy commented on TIKA-873:
---
hm, 1.0 extracts something that is not valid.
[
https://issues.apache.org/jira/browse/TIKA-873?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13234350#comment-13234350
]
Albert L. commented on TIKA-873:
Thanks, Maxim.
> Tika --extract fails for