Chris Schneider wrote:
Gang,
I had a webmaster complain that our crawler was following his form action links.
Although he admits that his use of the GET method is a bit unorthodox, he feels strongly
that form submissions with input fields shouldn't be followed by crawlers. Would it make
Meta-data per URL/site/section
--
Key: NUTCH-271
URL: http://issues.apache.org/jira/browse/NUTCH-271
Project: Nutch
Type: New Feature
Versions: 0.7.2
Reporter: Stefan Neufeind
We have the need to index sites and attach
[
http://issues.apache.org/jira/browse/NUTCH-271?page=comments#action_12412435 ]
Gal Nitzan commented on NUTCH-271:
--
This functionality is already available in Nutch-0.8
Meta-data per URL/site/section
--
Key:
[
http://issues.apache.org/jira/browse/NUTCH-271?page=comments#action_12412436 ]
Gal Nitzan commented on NUTCH-271:
--
Sorry for the short comment.
Actually the meta tags functionality is already available in the 0.8 version
along with a CrawlDatum object.