Re: A plan to improve the metadata property definitions

2012-05-16 Thread Nick Burch
On Thu, 17 May 2012, Mattmann, Chris A (388J) wrote: Thanks Nick, +1. I'll try and follow and see if I can help in places. We've tried to keep all the issues and commits nice and small, so they're easy to review, but we did end up on an epic 10 hour coding spree today so apologies if it made

Re: A plan to improve the metadata property definitions

2012-05-16 Thread Mattmann, Chris A (388J)
Thanks Nick, +1. I'll try and follow and see if I can help in places. Cheers, Chris On May 16, 2012, at 5:50 AM, Nick Burch wrote: > Hi All > > I've just been brainstorming with Ray Gauss, and we think we've come up with > a way to move towards cleaner and clearer metadata property definition

[jira] [Commented] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13277250#comment-13277250 ] Nick Burch commented on TIKA-926: - Thanks, applied in r1339418. > Data Type

[jira] [Updated] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-926: -- Attachment: tika-add-by-property.diff Changes to allow for adding by Property and setting an array of val

[jira] [Commented] (TIKA-928) Separation of Tika Core Properties From Metadata Processing

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13277215#comment-13277215 ] Nick Burch commented on TIKA-928: - Thanks, applied (with a few extra JavaDoc bits) in r13394

[jira] [Updated] (TIKA-928) Separation of Tika Core Properties From Metadata Processing

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-928: -- Attachment: tika-core-properties.diff Apply to tika-core. > Separation of Tika Core Prop

[jira] [Resolved] (TIKA-916) NullPointerException processing XPS file

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-916?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-916. - Resolution: Fixed Fix Version/s: 1.2 Should be fixed (with a unit test) in r1339390.

[jira] [Created] (TIKA-928) Separation of Tika Core Properties From Metadata Processing

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-928: - Summary: Separation of Tika Core Properties From Metadata Processing Key: TIKA-928 URL: https://issues.apache.org/jira/browse/TIKA-928 Project: Tika Issue Type: I

[jira] [Resolved] (TIKA-360) Outstanding Improvements to Number/Date Formatting in ExcelParser

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-360?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-360. - Resolution: Fixed Fix Version/s: 1.1 I believe this was solved in Tika 1.1. If we identify any other

[jira] [Commented] (TIKA-925) Remove DublinCore From Metadata and Deprecate String Properties

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13277177#comment-13277177 ] Nick Burch commented on TIKA-925: - Once the changes associated with this are done, I think w

[jira] [Resolved] (TIKA-830) Tika.parseToString() causes ForkParser to try to serialize itself

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-830?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-830. - Resolution: Fixed Fix Version/s: 1.1 I believe this was fixed in Tika 1.1 so I'm marking it as fixed

[jira] [Resolved] (TIKA-927) Composite Properties

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-927. - Resolution: Fixed Fix Version/s: 1.2 > Composite Properties > > >

[jira] [Commented] (TIKA-927) Composite Properties

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-927?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13277174#comment-13277174 ] Nick Burch commented on TIKA-927: - Thanks Ray, applied in r1339380. > Compo

[jira] [Updated] (TIKA-927) Composite Properties

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-927: -- Attachment: tika-composite-properties-core.diff Apply to tika-core > Composite Propertie

[jira] [Created] (TIKA-927) Composite Properties

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-927: - Summary: Composite Properties Key: TIKA-927 URL: https://issues.apache.org/jira/browse/TIKA-927 Project: Tika Issue Type: Improvement Components: metadat

[jira] [Commented] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13277101#comment-13277101 ] Nick Burch commented on TIKA-926: - Thanks, good spot! Patch applied in r1339351.

[jira] [Resolved] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-926. - Resolution: Fixed Fix Version/s: 1.2 > Data Typed Metadata.set(...) Value Methods Should Call >

[jira] [Created] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-926: - Summary: Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...) Key: TIKA-926 URL: https://issues.apache.org/jira/browse/TIKA-926 Project: Tika

[jira] [Updated] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-926: -- Affects Version/s: 1.1 > Data Typed Metadata.set(...) Value Methods Should Call > Metadata.set(Prope

[jira] [Updated] (TIKA-926) Data Typed Metadata.set(...) Value Methods Should Call Metadata.set(Property...)

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-926?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-926: -- Attachment: tika-metadata-set-core.diff > Data Typed Metadata.set(...) Value Methods Should Call > M

[jira] [Updated] (TIKA-904) Pages documents created in Layout mode not supported

2012-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless updated TIKA-904: Attachment: TIKA-904.patch Patch w/ test case & fix... it looks like we also have to output c

[jira] [Assigned] (TIKA-904) Pages documents created in Layout mode not supported

2012-05-16 Thread Michael McCandless (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-904?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Michael McCandless reassigned TIKA-904: --- Assignee: Michael McCandless > Pages documents created in Layout mode not supported

[jira] [Commented] (TIKA-925) Remove DublinCore From Metadata and Deprecate String Properties

2012-05-16 Thread Nick Burch (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13276909#comment-13276909 ] Nick Burch commented on TIKA-925: - Thanks for this Ray, looks great, committed in r1339276.

[jira] [Updated] (TIKA-925) Remove DublinCore From Metadata and Deprecate String Properties

2012-05-16 Thread Ray Gauss II (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-925: -- Attachment: tika-dublincore-changes-parsers.diff tika-dublincore-changes-core.diff >

[jira] [Created] (TIKA-925) Remove DublinCore From Metadata and Deprecate String Properties

2012-05-16 Thread Ray Gauss II (JIRA)
Ray Gauss II created TIKA-925: - Summary: Remove DublinCore From Metadata and Deprecate String Properties Key: TIKA-925 URL: https://issues.apache.org/jira/browse/TIKA-925 Project: Tika Issue Typ

A plan to improve the metadata property definitions

2012-05-16 Thread Nick Burch
Hi All I've just been brainstorming with Ray Gauss, and we think we've come up with a way to move towards cleaner and clearer metadata property definitions (prefixes, properties with types etc), whilst maintaining backwards compatibility and avoiding too much work for parsers during the migra