[
https://issues.apache.org/jira/browse/TIKA-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907561#action_12907561
]
Staffan Olsson commented on TIKA-504:
-
An alternative would be to have a secondary field
Use POI API for text extraction from XSLF shape
---
Key: TIKA-510
URL: https://issues.apache.org/jira/browse/TIKA-510
Project: Tika
Issue Type: Improvement
Components: parser
[
https://issues.apache.org/jira/browse/TIKA-511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Maxim Valyanskiy updated TIKA-511:
--
Attachment: event.patch
patch
NPE when POI is configured to prefer event extractors
[
https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907599#action_12907599
]
Nick Burch commented on TIKA-509:
-
Jukka - your patch looks good, just thought I'd check a
[
https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907607#action_12907607
]
Jukka Zitting commented on TIKA-509:
Yes, I think the ContainerExtractor and
[
https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907633#action_12907633
]
Nick Burch commented on TIKA-509:
-
Interface rename makes sense to me, I've done that
Not
See https://hudson.apache.org/hudson/job/Tika-trunk/366/
--
Failed to access build log
hudson.util.IOException2: remote file operation failed:
/home/hudson/hudson-slave/workspace/Tika-trunk at
hudson.remoting.chan...@2c88652b:ubuntu1
at
See
https://hudson.apache.org/hudson/job/Tika-trunk/org.apache.tika$tika-parent/367/
See https://hudson.apache.org/hudson/job/Tika-trunk/367/
Hi all,
In the past, we'd build our Hadoop job jars using a dependency on Tika-
parsers but excluding the supporting jars for types that we know we
don't need to process (e.g. Microsoft docs, PDFs, etc). This
dramatically reduces the size of the resulting Hadoop job jar.
With
10 matches
Mail list logo