RE: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Joerg Ehrlich
Hi Chris, Those are all valid points and I agree that you could do everything with a Hashmap. Having the parsers fill the Metadata class and its Hashmap with all needed information which is then consumed by an XMP component sitting on top of Tika-Core is definitely an interesting solution

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Mattmann, Chris A (388J)
Hi Jörg, Thanks for your email, comments below: On Apr 26, 2012, at 3:35 AM, Joerg Ehrlich wrote: Hi Chris, Those are all valid points and I agree that you could do everything with a Hashmap. Having the parsers fill the Metadata class and its Hashmap with all needed information which

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Ray Gauss II
I think besides the namespaces, one of the issues Jörg is trying to tackle is the structured metadata and the extra time and effort referred to is dealing with serialization of structured data to and from a hashmap. For example I may have metadata similar to: Contact1 |-- First Name |-- Last

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Mattmann, Chris A (388J)
Hi Guys, One comment RE: the below too -- this is precisely where I see Any23 coming into play and why there is a strong relationship between it and Tika: http://incubator.apache.org/any23/ I'm the current Champion for the project and the Tika PMC is sponsoring the podling. Please check it out

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Antoni Mylka
2012/04/25 Joerg Ehrlich napisał/wrote: Hi, I have put a proposal of a roadmap for the metadata features in Tika on the wiki: http://wiki.apache.org/tika/MetadataRoadmap The proposal is based on a discussion around this topic I have had with Jukka. Please review and feel free to edit the wiki

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Antoni Mylka
2012/04/26 Mattmann, Chris A (388J) napisał/wrote: Hi Guys, One comment RE: the below too -- this is precisely where I see Any23 coming into play and why there is a strong relationship between it and Tika: http://incubator.apache.org/any23/ I'm the current Champion for the project and the

[jira] [Resolved] (TIKA-901) Provide version number in tika-server

2012-04-26 Thread Chris A. Mattmann (JIRA)
[ https://issues.apache.org/jira/browse/TIKA-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Chris A. Mattmann resolved TIKA-901. Resolution: Fixed Committed in r1331147, thanks Ingo YOU ROCK! Provide

Re: [metadata] roadmap proposal available on the wiki

2012-04-26 Thread Mattmann, Chris A (388J)
Hi Antoni, Precisely! :) That would be awesome huh. And, my goal there too is to turn Any23 parsers into Tika parsers too as I think they could be one and the same (with an RDF or XMP or RSS ContentHandler transforming the Tika intermediate SAX output the same). Cheers, Chris On Apr 26, 2012,

Build failed in Jenkins: Tika-trunk #837

2012-04-26 Thread Apache Jenkins Server
See https://builds.apache.org/job/Tika-trunk/837/changes Changes: [mattmann] - apply patch from TIKA-901: Provide version number in tika-server contributed by Ingo Renner -- [...truncated 2255 lines...] [WARNING] We have a duplicate