Hi Joe,

Looks like a bad file path reference to crawler-config.xml within the 
crawler_launcher script. You should have “/usr” instead of “usr” in the path at 
the very least.

Are you using 0.8-SNAPSHOT version of OODT? Are you using RADiX or just the 
Crawler component downloaded separately?

Can you try the following:
1. Open up crawler_launcher script, and find the line that has the string 
“crawler-config.xml”
2. Check to see if the path looks good (i.e. does it have a “file:” prefix like 
the other XML config files listed below it?) or even try printing the path to 
see what it resolves to.

By the way, I’m CCing your question to the dev mailing list in case others 
could help. Feel free to CC the mailing list on general questions in the future 
too!

Thanks,
Rishi

On Nov 19, 2014, at 4:42 PM, Roberts, Joe T (398H) 
<[email protected]<mailto:[email protected]>> wrote:

Hi Rishi,

I've finally gotten around to configuring Crawler for the first time on 
cmsun-dev (new dev instance of cmsun).  I'm trying to test out a simple ingest 
but am seeing this exception:

[cms@cmsun-dev bin]$ pwd
/usr/local/cms/deploy/crawler/bin

[cms@cmsun-dev bin]$ ./crawler_launcher -op --launchStdCrawler --productPath 
/usr/local/cms/test/ --metFileExtension met --filemgrUrl http://localhost:9000
Setting property 'StdProductCrawler.metFileExtension'
Setting property 'DeleteMetadataFile.fileExtension'
Setting property 'MoveMetadataFileToBackupDir.fileExtension'
Setting property 'MoveMetadataFileToFailureDir.fileExtension'
Setting property 'StdProductCrawler.productPath'
Setting property 'MetExtractorProductCrawler.productPath'
Setting property 'AutoDetectProductCrawler.productPath'
Setting property 'StdProductCrawler.filemgrUrl'
Setting property 'MetExtractorProductCrawler.filemgrUrl'
Setting property 'AutoDetectProductCrawler.filemgrUrl'
ERROR: IOException parsing XML document from file 
[/data/local/cms/deploy/crawler/bin/usr/local/cms/deploy/crawler/policy/crawler-config.xml];
 nested exception is java.io.FileNotFoundException: 
usr/local/cms/deploy/crawler/policy/crawler-config.xml (No such file or 
directory)


It looks like it can't find the proper location of crawler-config.xml.  I can't 
seem to figure out where it is configured to look—any suggestions?

Thanks,
Joe

---
Rishi Verma
NASA Jet Propulsion Laboratory
California Institute of Technology

Reply via email to