[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371960#comment-14371960
]
Tomas Safarik commented on TIKA-1194:
-
Sorry but no.
1) I don't have the source code.
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik updated TIKA-1194:
Attachment: apache-tika-1.5.patch
Just for information. Patch of our changes that workarounds the pro
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363062#comment-14363062
]
Tomas Safarik commented on TIKA-1194:
-
I was finally able to prepare version of documen
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14363062#comment-14363062
]
Tomas Safarik edited comment on TIKA-1194 at 3/16/15 10:57 AM:
--
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik updated TIKA-1194:
Attachment: OP-06-015.doc
> Missing text from MS Word (DOC) file
> --
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik reopened TIKA-1194:
-
Sorry for my late response.
I still do not have file I can upload.
I did more testing and the bug is
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13820139#comment-13820139
]
Tomas Safarik commented on TIKA-1194:
-
I can see the text missing in Apache POI WordToT
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13820080#comment-13820080
]
Tomas Safarik commented on TIKA-1194:
-
Sorry I needed to remove the document because it
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik updated TIKA-1194:
Attachment: (was: OP-06-015.doc)
> Missing text from MS Word (DOC) file
> --
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik updated TIKA-1194:
Description:
Hello,
we noticed that filtered text from some MS Word DOC files is missing one line
[
https://issues.apache.org/jira/browse/TIKA-1194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik updated TIKA-1194:
Attachment: OP-06-015.doc
"mluvil s Milouškem: poslat nabídku" is the problematic line/cell
> Missi
Tomas Safarik created TIKA-1194:
---
Summary: Missing text from MS Word (DOC) file
Key: TIKA-1194
URL: https://issues.apache.org/jira/browse/TIKA-1194
Project: Tika
Issue Type: Bug
Compo
[
https://issues.apache.org/jira/browse/TIKA-431?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13421711#comment-13421711
]
Tomas Safarik commented on TIKA-431:
Hello,
it seems that I created duplicate issue TIK
[
https://issues.apache.org/jira/browse/TIKA-952?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tomas Safarik closed TIKA-952.
--
Resolution: Duplicate
> HTML meta tags ignored for encoding detection
> -
Tomas Safarik created TIKA-952:
--
Summary: HTML meta tags ignored for encoding detection
Key: TIKA-952
URL: https://issues.apache.org/jira/browse/TIKA-952
Project: Tika
Issue Type: Bug
15 matches
Mail list logo