Theodor Sjöstedt created TIKA-1428:
--
Summary: Microsoft Word 97 - 2003 (.doc) footnote references are
Unicode Replacement Character
Key: TIKA-1428
URL: https://issues.apache.org/jira/browse/TIKA-1428
[
https://issues.apache.org/jira/browse/TIKA-1428?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Theodor Sjöstedt updated TIKA-1428:
---
Attachment: TIKA-doc-footnotes-issue.png
Original document to the left.
TIKA 1.4 in Center
[
https://issues.apache.org/jira/browse/TIKA-1428?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147880#comment-14147880
]
Hong-Thai Nguyen commented on TIKA-1428:
Thanks [~theoettheo], any chance to have a
[
https://issues.apache.org/jira/browse/TIKA-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Tim Allison updated TIKA-1330:
--
Attachment: TIKA-1330v1-patch.zip
This is the first version of tika-batch. Much cleanup remains.
This
[
https://issues.apache.org/jira/browse/TIKA-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14121454#comment-14121454
]
Tim Allison edited comment on TIKA-1330 at 9/25/14 4:18 PM:
[
https://issues.apache.org/jira/browse/TIKA-1330?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14147922#comment-14147922
]
Tim Allison commented on TIKA-1330:
---
[~tilman], I leave it as an exercise to implement a
Hey Nick,
On 22 Sep 2014, at 23:21, Nick Burch n...@apache.org wrote:
It's only 2 months to go until ApacheCon Europe in Budapest. I'm
simultaneously exciting by all the great Tika stuff going on, and worried by
how many talks I need to finish writing...
As usual for an ApacheCon, we've
[
https://issues.apache.org/jira/browse/TIKA-1423?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148398#comment-14148398
]
Vineet Ghatge commented on TIKA-1423:
-
Pulling up the data and JAR file and trying to
[
https://issues.apache.org/jira/browse/TIKA-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148528#comment-14148528
]
sunxingzhe commented on TIKA-1415:
--
Atthachment is the correction results, please
[
https://issues.apache.org/jira/browse/TIKA-1415?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=14148528#comment-14148528
]
sunxingzhe edited comment on TIKA-1415 at 9/26/14 2:44 AM:
---
Hello all,
I was wondering if there any in built parser to get help in conversion from
XHTML to JSON.
My research showed that there is one named org.apache.io.json which just
one method implemented. Also, I tried GJSON library to do this, but it does
not seem to work with Tika. Any suggestions
11 matches
Mail list logo