Hi Zichen,

Thanks for your mail. If you use MetExtractorProductCrawler, met
is generated, but it¹s never serialized to disk. I think that explains
it. Let me know if that makes sense.

Cheers,
Chris

++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Chris Mattmann, Ph.D.
Chief Architect
Instrument Software and Science Data Systems Section (398)
NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA
Office: 168-519, Mailstop: 168-527
Email: [email protected]
WWW:  http://sunset.usc.edu/~mattmann/
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
Adjunct Associate Professor, Computer Science Department
University of Southern California, Los Angeles, CA 90089 USA
++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++






-----Original Message-----
From: Zichen Nie <[email protected]>
Date: Saturday, November 1, 2014 at 4:47 PM
To: Chris Mattmann <[email protected]>
Subject: TikaCmdLineMetExtractor does not generate .met file

>Dear Professor:
>
>I followed the instruction on how to use OODT cas-crawler, and tried to
>generate .met file using TikaCmdLineExtractor.
>I can see from the log that Tika is extracting my metadata but it does
>not generate .met file for my json file.
>
>Here is my command line:
>
>
>./crawler_launcher --operation --launchMetCrawler -filemgrUrl
>http://localhost:9000 <http://localhost:9000> --clientTransferer
>org.apache.oodt.cas.filemgr.datatransfer.LocalDataTransferFactory
>--productPath 
>/Users/threeears/Documents/572/Assignment2/oodt-deploy/cas-crawler-0.7/dat
>a/test/0.json --metExtractor
>org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor
> --metExtractorConfig
>/Users/threeears/Documents/572/Assignment2/oodt-deploy/cas-crawler-0.7/ext
>ractors/tikaextractor/tikaextractor.config --metFileExtension met
>
>
>
>I thought MetCrawler should generate meta file before ingestion, it's
>weird that my ingestion is successful and met file is not shown.  Am I
>using the right extractor and crawler? Are there any necessary
>configurations that I missed?
>
>
>Best,
>Zichen
>

Reply via email to