Hey Jeff, Check out the urlmeta plugin. You can inject metadata in with your seed list and propagate it to outlinks.
On Thu, Apr 2, 2015 at 10:09 AM, Jeff Cocking <[email protected]> wrote: > Environment: Nutch 1.9, Solr 5.0 > > I am trying to define a group (category) of websites. Each website will > have assigned group (1 to many). The assignment is known before the > creation of seed.txt file. All pages within the website should inherit the > assigned group(s). The assigned group(s) need to be passed to Solr for > faceted search. > > For example: > www.site1.com group1, group2 group3 > All pages within www.site1.com inherit group1, group2, group3 > > www.site2.com group2, group4, group5 > All pages within www.site2.com inherit group2, group4, group5 > > Thoughts on ways to accomplish this? > > Thank you in advance. > > jeff > -- Jonathan Cooper-Ellis Field Enablement Engineer <http://www.cloudera.com>

