[jira] Commented: (TIKA-504) Support XMP metadata keys for more of the common EXIF tags

2010-09-09 Thread Staffan Olsson (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907561#action_12907561 ] Staffan Olsson commented on TIKA-504: - An alternative would be to have a secondary field

[jira] Created: (TIKA-510) Use POI API for text extraction from XSLF shape

2010-09-09 Thread Maxim Valyanskiy (JIRA)
Use POI API for text extraction from XSLF shape --- Key: TIKA-510 URL: https://issues.apache.org/jira/browse/TIKA-510 Project: Tika Issue Type: Improvement Components: parser

[jira] Updated: (TIKA-511) NPE when POI is configured to prefer event extractors

2010-09-09 Thread Maxim Valyanskiy (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-511?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maxim Valyanskiy updated TIKA-511: -- Attachment: event.patch patch NPE when POI is configured to prefer event extractors

[jira] Commented: (TIKA-509) Container contents extraction

2010-09-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907599#action_12907599 ] Nick Burch commented on TIKA-509: - Jukka - your patch looks good, just thought I'd check a

[jira] Commented: (TIKA-509) Container contents extraction

2010-09-09 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907607#action_12907607 ] Jukka Zitting commented on TIKA-509: Yes, I think the ContainerExtractor and

[jira] Commented: (TIKA-509) Container contents extraction

2010-09-09 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-509?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=12907633#action_12907633 ] Nick Burch commented on TIKA-509: - Interface rename makes sense to me, I've done that Not

Build failed in Hudson: Tika-trunk #366

2010-09-09 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Tika-trunk/366/ -- Failed to access build log hudson.util.IOException2: remote file operation failed: /home/hudson/hudson-slave/workspace/Tika-trunk at hudson.remoting.chan...@2c88652b:ubuntu1 at

Hudson build is back to normal : T ika-trunk ยป Apache Tika parent #367

2010-09-09 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Tika-trunk/org.apache.tika$tika-parent/367/

Hudson build is back to normal : Tika-trunk #367

2010-09-09 Thread Apache Hudson Server
See https://hudson.apache.org/hudson/job/Tika-trunk/367/

Error thrown with TikaConfig() constructor

2010-09-09 Thread Ken Krugler
Hi all, In the past, we'd build our Hadoop job jars using a dependency on Tika- parsers but excluding the supporting jars for types that we know we don't need to process (e.g. Microsoft docs, PDFs, etc). This dramatically reduces the size of the resulting Hadoop job jar. With