Re: [ANNOUNCEMENT][THANKS] Apache ODF Toolkit(Incubating) 0.5-incubating Release

2012-01-16 Thread Devin Han
Thank you, Chris! Whitout your help we can't get here. 2012/1/17 Mattmann, Chris A (388J) > Congrats guys! > > Cheers, > Chris > > On Jan 16, 2012, at 4:59 AM, Devin Han wrote: > > > Hi all, > > > > Thanks all of the voters from this list. Now there is a result ;) > > > > The Apache ODF Toolkit(

[jira] [Issue Comment Edited] (TIKA-846) Ability to Parse RDF Bag Elements in XML

2012-01-16 Thread Chris A. Mattmann (Issue Comment Edited) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187126#comment-13187126 ] Chris A. Mattmann edited comment on TIKA-846 at 1/17/12 12:54 AM:

[jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187190#comment-13187190 ] Nick Burch commented on TIKA-842: - LEGAL-122 created for this > IPTC Proper

[jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187183#comment-13187183 ] Nick Burch commented on TIKA-842: - I think we'll need the OK from Apache Legal for this, I'l

[jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-01-16 Thread Ray Gauss II (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187165#comment-13187165 ] Ray Gauss II commented on TIKA-842: --- It does, provided we include the license, which I did

[jira] [Commented] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187153#comment-13187153 ] Nick Burch commented on TIKA-842: - Did you manage to confirm that the IPTC Spec license allo

[jira] [Commented] (TIKA-846) Ability to Parse RDF Bag Elements in XML

2012-01-16 Thread Chris A. Mattmann (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-846?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187126#comment-13187126 ] Chris A. Mattmann commented on TIKA-846: +1 to the feature request on this issue. I

[jira] [Updated] (TIKA-846) Ability to Parse RDF Bag Elements in XML

2012-01-16 Thread Ray Gauss II (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-846?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-846: -- Attachment: bag-element-metadata-handler.diff Patch to parse RDF bag elements. > Ability

[jira] [Created] (TIKA-846) Ability to Parse RDF Bag Elements in XML

2012-01-16 Thread Ray Gauss II (Created) (JIRA)
Ability to Parse RDF Bag Elements in XML Key: TIKA-846 URL: https://issues.apache.org/jira/browse/TIKA-846 Project: Tika Issue Type: Improvement Components: parser Affects Versions: 1.0

[jira] [Commented] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-16 Thread Ray Gauss II (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187103#comment-13187103 ] Ray Gauss II commented on TIKA-844: --- It seems to be commonly used when expressing multi-va

[jira] [Updated] (TIKA-845) Check for Existing Value in Multi-Value Fields in XML Metadata Handler

2012-01-16 Thread Ray Gauss II (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-845: -- Attachment: xml-check-multi-value-existing.diff Patch to check for existing multi-value.

[jira] [Created] (TIKA-845) Check for Existing Value in Multi-Value Fields in XML Metadata Handler

2012-01-16 Thread Ray Gauss II (Created) (JIRA)
Check for Existing Value in Multi-Value Fields in XML Metadata Handler -- Key: TIKA-845 URL: https://issues.apache.org/jira/browse/TIKA-845 Project: Tika Issue Type: Improve

[jira] [Commented] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-16 Thread Ken Krugler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187098#comment-13187098 ] Ken Krugler commented on TIKA-844: -- Hi Ray - could you provide more details on when/why a t

[jira] [Updated] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-16 Thread Ray Gauss II (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-844: -- Attachment: text-bag-property-patch.diff Patch to create an internal text bag Property.

[jira] [Created] (TIKA-844) Ability to Define an Internal Text Bag Property

2012-01-16 Thread Ray Gauss II (Created) (JIRA)
Ability to Define an Internal Text Bag Property --- Key: TIKA-844 URL: https://issues.apache.org/jira/browse/TIKA-844 Project: Tika Issue Type: Improvement Components: metadata Affect

[jira] [Updated] (TIKA-843) Support for Date without a Time Component

2012-01-16 Thread Ray Gauss II (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-843: -- Attachment: date-format-patch.diff Patch to add support for parsing of dates with no time component.

[jira] [Created] (TIKA-843) Support for Date without a Time Component

2012-01-16 Thread Ray Gauss II (Created) (JIRA)
Support for Date without a Time Component - Key: TIKA-843 URL: https://issues.apache.org/jira/browse/TIKA-843 Project: Tika Issue Type: Improvement Components: metadata Affects Versions:

[jira] [Updated] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-01-16 Thread Ray Gauss II (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ray Gauss II updated TIKA-842: -- Attachment: IPTC-metadata-def-patch.diff This metadata interface follows the order, standards, and reprod

[jira] [Created] (TIKA-842) IPTC Properties Should be Defined Completely and Independently of the Drew Library

2012-01-16 Thread Ray Gauss II (Created) (JIRA)
IPTC Properties Should be Defined Completely and Independently of the Drew Library -- Key: TIKA-842 URL: https://issues.apache.org/jira/browse/TIKA-842 Project: Tika

[jira] [Commented] (TIKA-774) ExifTool Parser

2012-01-16 Thread Ray Gauss II (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-774?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187062#comment-13187062 ] Ray Gauss II commented on TIKA-774: --- I've refactored much of this and will be splitting so

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Ken Krugler (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187049#comment-13187049 ] Ken Krugler commented on TIKA-86: - For regex magic, I'd recommend compiling into FSM - e.g. u

[jira] [Resolved] (TIKA-86) Support magic(5) files

2012-01-16 Thread Jukka Zitting (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jukka Zitting resolved TIKA-86. --- Resolution: Won't Fix Agreed with the points above, so resolving as Won't Fix. Let's follow up in separ

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187026#comment-13187026 ] Nick Burch commented on TIKA-86: RegEx magic could be interesting, with a bit of care to ensu

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Andrew Jackson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13187018#comment-13187018 ] Andrew Jackson commented on TIKA-86: We've done some work in this area, and noticed that

Re: [ANNOUNCEMENT][THANKS] Apache ODF Toolkit(Incubating) 0.5-incubating Release

2012-01-16 Thread Mattmann, Chris A (388J)
Congrats guys! Cheers, Chris On Jan 16, 2012, at 4:59 AM, Devin Han wrote: > Hi all, > > Thanks all of the voters from this list. Now there is a result ;) > > The Apache ODF Toolkit(Incubating) team is pleased to announce the release > of 0.5-incubating. This is our first Apache release. > >

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186948#comment-13186948 ] Nick Burch commented on TIKA-86: Turning the file magic into a Tika xml match shouldn't be to

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Andrew Jackson (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186938#comment-13186938 ] Andrew Jackson commented on TIKA-86: The file command comes with signatures for a lot mor

[ANNOUNCEMENT][THANKS] Apache ODF Toolkit(Incubating) 0.5-incubating Release

2012-01-16 Thread Devin Han
Hi all, Thanks all of the voters from this list. Now there is a result ;) The Apache ODF Toolkit(Incubating) team is pleased to announce the release of 0.5-incubating. This is our first Apache release. The Apache ODF Toolkit is a set of Java modules that allow programmatic creation, scanning and

[jira] [Commented] (TIKA-86) Support magic(5) files

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-86?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186898#comment-13186898 ] Nick Burch commented on TIKA-86: I'm not sure if we still need this, as the Tika mimetypes fi

[jira] [Resolved] (TIKA-87) MimeTypes should allow modification of MIME types

2012-01-16 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-87?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-87. Resolution: Fixed Fix Version/s: 1.1 TIKA-746 provides a clean way to do this, as documented in http:/

[jira] [Commented] (TIKA-87) MimeTypes should allow modification of MIME types

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-87?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186896#comment-13186896 ] Nick Burch commented on TIKA-87: I believe this is no longer an issue, because of the recent

[jira] [Commented] (TIKA-841) User supplied parsers should be preferred

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-841?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186894#comment-13186894 ] Nick Burch commented on TIKA-841: - I would propose to fix this by adding logic similar to th

[jira] [Created] (TIKA-841) User supplied parsers should be preferred

2012-01-16 Thread Nick Burch (Created) (JIRA)
User supplied parsers should be preferred - Key: TIKA-841 URL: https://issues.apache.org/jira/browse/TIKA-841 Project: Tika Issue Type: Improvement Components: parser Affects Versions: 1.

[jira] [Resolved] (TIKA-805) improvements in XSLFPowerPointExtractorDecorator

2012-01-16 Thread Nick Burch (Resolved) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-805?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nick Burch resolved TIKA-805. - Resolution: Fixed Fix Version/s: 1.1 > improvements in XSLFPowerPointExtractorDecorator > -

[jira] [Commented] (TIKA-805) improvements in XSLFPowerPointExtractorDecorator

2012-01-16 Thread Nick Burch (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-805?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186861#comment-13186861 ] Nick Burch commented on TIKA-805: - Thanks, applied in r1231905. > improveme