Hey Jeff,

Check out the urlmeta plugin. You can inject metadata in with your seed
list and propagate it to outlinks.

On Thu, Apr 2, 2015 at 10:09 AM, Jeff Cocking <[email protected]>
wrote:

> Environment:  Nutch 1.9, Solr 5.0
>
> I am trying to define a group (category) of websites. Each website will
> have assigned group (1 to many). The assignment is known before the
> creation of seed.txt file.  All pages within the website should inherit the
> assigned group(s). The assigned group(s) need to be passed to Solr for
> faceted search.
>
> For example:
> www.site1.com group1, group2 group3
> All pages within www.site1.com inherit group1, group2, group3
>
> www.site2.com group2, group4, group5
> All pages within www.site2.com inherit group2, group4, group5
>
> Thoughts on ways to accomplish this?
>
> Thank you in advance.
>
> jeff
>



-- 
Jonathan Cooper-Ellis
Field Enablement Engineer
<http://www.cloudera.com>

Reply via email to