[jira] [Commented] (TIKA-930) Consolidation of Some Tika Core Properties

2012-05-22 Thread JIRA
[ https://issues.apache.org/jira/browse/TIKA-930?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13281073#comment-13281073 ] Jörg Ehrlich commented on TIKA-930: --- Consolidation is a good idea. As a general comment up

RE: A plan to improve the metadata property definitions

2012-05-22 Thread Joerg Ehrlich
Hi Nick and Ray, +1 Thanks, this looks like a great step forward. It definitely helps to clean up the current metadata usage. But I still have no real idea how to represent structured properties with the current Property/Metadata setup going forward. I have done a quick review and have already a

[jira] [Updated] (TIKA-931) Tika's PDFParser fails to parse documents embedded in a PDF Package

2012-05-22 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting updated TIKA-931: --- Fix Version/s: 1.2 Assignee: Jukka Zitting I copied the changes to Tika in revision 1341463.

[jira] [Moved] (TIKA-931) Tika's PDFParser fails to parse documents embedded in a PDF Package

2012-05-22 Thread Jukka Zitting (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting moved PDFBOX-1303 to TIKA-931: Component/s: (was: Text extraction) parser Fix Ve