Hi Jörg,

Thanks for your email, comments below:

On Apr 26, 2012, at 3:35 AM, Joerg Ehrlich wrote:

> Hi Chris,
> 
> Those are all valid points and I agree that you could do everything with a 
> Hashmap. 
> Having the parsers fill the Metadata class and its Hashmap with all needed 
> information which is then consumed by an XMP component sitting on top of 
> Tika-Core is definitely an interesting solution which would keep Tika-Core 
> clean of any dependencies and give the ability to introduce new XMP related 
> APIs in a least intrusive way.
> But from my point of view it is also about how much time and effort you would 
> like to spend implementing and testing code in the Metadata class when you 
> have something tested and stable that is already available for exactly that 
> purpose. 

Well I think our Metadata object is fairly well tested and implemented atm, so 
I'm not sure what
extra time and effort we're talking about here? The only extra time and effort 
I see is in adding
this XMP extension to it.

> Another thought that just comes to my mind is that a lot of file formats 
> already use XMP as one or even the only metadata container and you would then 
> end up filling the metadata map with the data from the file's XMP and 
> converting it back to XMP later on, compared to just being able to parse it 
> as is and having most of the metadata available right away. 

Yep in tika-xmp (new module) this might be less efficient, but it will maintain 
a lot of familiarity with folks who
are used to maintaining the existing Metadata object internals and models/etc.

Anyways, feel free to push forward, I am just letting you know I am against 
changing the
internals of the Metadata model, at least at the moment :) At the same time 
your enthusiasm
is great and all I can say is you are doing great and push forward and we'll 
see where we 
get...

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: chris.a.mattm...@nasa.gov
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Reply via email to