Hi all,
I'm using NutchWax (Version 0.7.0-200611082313) and Wera (Version
0.5.0-200611082313) to Index a collection of ARC files generated by a web
crawl using the Heritrix web crawler (Version 1.4.0).
When I check the metadata tag on the wera front-end the following list of
tags are displayed
Hi all,
I'm new to the list so not sure if you even discuss extensions to Nutch or
if the list is exclusively for discussions on Nutch itself.
Have any of you ever used NutchWax? I'm attempting to use NutchWax to index
a number of .arc files generated by a web crawl. I can get the indexing step