Hello,

This is my first question to this list, and I equally new to Nutch. 
Sorry if this question might be too general. I'd be happy with good
links to documentation, if there are any. Google won't help me find
them.

I need to understand how facets can be extracted from a web site crawled
by Nutch then indexed by Solr. On the web site, pages have meta tags,
like <meta name="price" content="123.45"/> or <meta name="categories"
content="category1, category2"/>. Can I tell Nutch to extract those and
Solr to treat them as facets?

In the example above, I want to specify manually that the meta name
"categories" is to be treated as a facet, but the content should be
dynamically used as categories.

Does it make sense? Is it possible to do with Nutch and Solr, or should
I rethink my way of using it?

Best Regards,

Johan


Reply via email to