Hi Luca,

Thanks for your questions! Answers inline below:

>        I have a question about how to use the metadata extracted by an 
> implementation of "FilemgrMetExtractor" when versioning a file that is 
> ingested by the File Manager.
> 
> I have configured my File Manager to run a custom metadata extractor 
> ("NetCDFMetExtractor") and a custom versioner ("DRSVersioner"). I am running 
> the filemgr-client tool to ingest a netcdf file, please see the log below.
> 
> o First question: why is the versioner run twice ? It seems like the first 
> time it is run, it has access to all the metadata that has been previously 
> extracted by the NetCDFMetExtractor, but the second time it doesn't ?


What does your NetCDFMetExtractor do? Does it call the DRSVersioner? How are 
you wiring the 2 together? I see from your command line below you are attaching 
these to the GenericFile product type? How did you set that up? Can I see an 
example of your policy files? That will help to diagnose what you're seeing.

> 
> o Second question: what is the relation between the metadata extracted by the 
> NetCDFMetExtractor and the argument to the --metadataFile option ? Is there 
> any way to serialize the output of the metadata extractor to a file that is 
> then ingested by the file manager ?

The policy files will help with this. My guess is that you made the 
NetCDFExtractor a *server side* met extractor. You have the ability to do 
*client side* or *server side* extraction. From the FM client side, that's the 
extracted client side metadata, pre-baked in (via the --metadataFile param), or 
generated on the fly from a 
o.a.oodt.cas.metadata.extractors.CmdLineMetExtractor implementation, and piped 
in via the StdIngester, or via the crawler (AutoDetect, or MetExtractor).

Server side met is *derived after* the original client side met is sent along 
during the process of ingestion.

HTH,
CHris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Senior Computer Scientist
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 171-266B, Mailstop: 171-246
Email: [email protected]
WWW:   http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Assistant Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++

Reply via email to