Hi Zichen, Thanks for your mail. If you use MetExtractorProductCrawler, met is generated, but it¹s never serialized to disk. I think that explains it. Let me know if that makes sense.
Cheers, Chris ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Chris Mattmann, Ph.D. Chief Architect Instrument Software and Science Data Systems Section (398) NASA Jet Propulsion Laboratory Pasadena, CA 91109 USA Office: 168-519, Mailstop: 168-527 Email: [email protected] WWW: http://sunset.usc.edu/~mattmann/ ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ Adjunct Associate Professor, Computer Science Department University of Southern California, Los Angeles, CA 90089 USA ++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ -----Original Message----- From: Zichen Nie <[email protected]> Date: Saturday, November 1, 2014 at 4:47 PM To: Chris Mattmann <[email protected]> Subject: TikaCmdLineMetExtractor does not generate .met file >Dear Professor: > >I followed the instruction on how to use OODT cas-crawler, and tried to >generate .met file using TikaCmdLineExtractor. >I can see from the log that Tika is extracting my metadata but it does >not generate .met file for my json file. > >Here is my command line: > > >./crawler_launcher --operation --launchMetCrawler -filemgrUrl >http://localhost:9000 <http://localhost:9000> --clientTransferer >org.apache.oodt.cas.filemgr.datatransfer.LocalDataTransferFactory >--productPath >/Users/threeears/Documents/572/Assignment2/oodt-deploy/cas-crawler-0.7/dat >a/test/0.json --metExtractor >org.apache.oodt.cas.metadata.extractors.TikaCmdLineMetExtractor > --metExtractorConfig >/Users/threeears/Documents/572/Assignment2/oodt-deploy/cas-crawler-0.7/ext >ractors/tikaextractor/tikaextractor.config --metFileExtension met > > > >I thought MetCrawler should generate meta file before ingestion, it's >weird that my ingestion is successful and met file is not shown. Am I >using the right extractor and crawler? Are there any necessary >configurations that I missed? > > >Best, >Zichen >
