[
https://issues.apache.org/jira/browse/TIKA-423?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting updated TIKA-423:
---
Affects Version/s: 0.8
0.9
0.10
This is still a problem
[
https://issues.apache.org/jira/browse/TIKA-410?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting updated TIKA-410:
---
Affects Version/s: 0.10
This is still an issue with Tika 0.10 and the latest trunk.
[
https://issues.apache.org/jira/browse/TIKA-734?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13122614#comment-13122614
]
Anirban Mitra commented on TIKA-734:
Thanks. I will let you know soon.
-- Anirban
[
https://issues.apache.org/jira/browse/TIKA-272?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13122625#comment-13122625
]
Jukka Zitting commented on TIKA-272:
See PDFBOX-577 for some related work in PDFBox.
[
https://issues.apache.org/jira/browse/TIKA-123?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-123.
Resolution: Duplicate
Much of this was already implemented recently in other issues, so resolving as
[
https://issues.apache.org/jira/browse/TIKA-429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-429.
Resolution: Fixed
Fix Version/s: 1.0
Assignee: Jukka Zitting
Looks like there's no
[
https://issues.apache.org/jira/browse/TIKA-513?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13122639#comment-13122639
]
Jukka Zitting commented on TIKA-513:
Is there a DjVu parser we could use?
[
https://issues.apache.org/jira/browse/TIKA-554?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-554.
Resolution: Won't Fix
Assignee: Jukka Zitting
Resolving as Won't Fix since the ParseUtils
[
https://issues.apache.org/jira/browse/TIKA-581?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-581.
Resolution: Fixed
Fix Version/s: 1.0
Assignee: Jukka Zitting
This was already fixed.
[
https://issues.apache.org/jira/browse/TIKA-576?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-576.
Resolution: Won't Fix
Resolving as Won't Fix since this is a rare enough problem and the workaround
[
https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-509.
Resolution: Fixed
Fix Version/s: 1.0
Resolving as fixed as discussed above.
[
https://issues.apache.org/jira/browse/TIKA-685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-685.
Resolution: Duplicate
Works with latest Tika, so resolving as a duplicate of some of the other
There is the one (GPL) I've been playing with:
http://javadjvu.foxtrottechnologies.com/
However, in order to extract text/context from images, we have to find
suitable implementation of OCR.
On Fri, Oct 7, 2011 at 11:02 AM, Jukka Zitting (Commented) (JIRA)
j...@apache.org wrote:
[
[
https://issues.apache.org/jira/browse/TIKA-682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13122934#comment-13122934
]
Nick Burch commented on TIKA-682:
-
ImageParser currently claims to support image/x-psd,
RTF parser fails to extract the body
Key: TIKA-748
URL: https://issues.apache.org/jira/browse/TIKA-748
Project: Tika
Issue Type: Bug
Components: parser
Affects Versions: 0.10
[
https://issues.apache.org/jira/browse/TIKA-748?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Andrzej Bialecki updated TIKA-748:
---
Attachment: test.rtf
RTF parser fails to extract the body
[
https://issues.apache.org/jira/browse/TIKA-541?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Jukka Zitting resolved TIKA-541.
Resolution: Won't Fix
I don't see much benefit to using commons-cli in our case, so resolving as
[
https://issues.apache.org/jira/browse/TIKA-682?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13123181#comment-13123181
]
Nick Burch commented on TIKA-682:
-
I've added a basic metadata extracting parser in
See
https://builds.apache.org/job/Tika-trunk/org.apache.tika$tika-parsers/674/changes
Changes:
[nick] TIKA-682 Add a basic PSD metadata extracting Parser
--
ignoring exception during new ExecutedMojo null
[PMD] Skipping maven reporter: there is already a
See https://builds.apache.org/job/Tika-trunk/674/changes
Changes:
[nick] TIKA-682 Add a basic PSD metadata extracting Parser
[nick] TIKA-749 Add EndianUtils, which provides a way to read small and big
endian numbers from streams, based on the version in POI
[nick] TIKA-682 Add mime magic
[
https://issues.apache.org/jira/browse/TIKA-749?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Nick Burch resolved TIKA-749.
-
Resolution: Fixed
Avoid using POI's LittleEndian in non-POI parsers
[
https://issues.apache.org/jira/browse/TIKA-749?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13123200#comment-13123200
]
Nick Burch commented on TIKA-749:
-
Done in r1180243.
Avoid using POI's
See
https://builds.apache.org/job/Tika-trunk/org.apache.tika$tika-parsers/675/changes
See https://builds.apache.org/job/Tika-trunk/675/changes
24 matches
Mail list logo