Hi!

Yes, it is possible, 
http://nutch.apache.org/apidocs-1.2/org/apache/nutch/parse/HTMLMetaTags.html
HTMLMetaTags  (Nutch1.2 API) allows you to get data from <head> section.
You can write new Nutch plugin and write your own 
http://nutch.apache.org/apidocs-1.2/org/apache/nutch/parse/HtmlParseFilter.html
HtmlParseFilter  implementation - this interface provides access into
HTMLMetaTags object.

You can send crawled data into Solr using this command:
bin/nutch solrindex http://127.0.0.1:8983/solr/ crawl/crawldb crawl/linkdb
crawl/segments/*

-----
Regards,
Jotta

PS. Sorry for my English :)
--
View this message in context: 
http://lucene.472066.n3.nabble.com/Question-regarding-meta-tags-tp3471871p3473108.html
Sent from the Nutch - User mailing list archive at Nabble.com.

Reply via email to