Hello,

The Creative Commons plugin looks for licenses this way:
// 1st choice: subject in RDF
// 2nd: anchor w/ rel=license
// 3rd: anchor w/ CC license
see CCParserFilter.java in "src/plugin/creativecommons/src/java/org/creativecommons"

and stores the attributes "nc, nd, sa" in the index in the field "cc".

The Creative Commons plugin is available in Nutch (0.9 to 1.1 and so on), to enable it just add "creativecommons" in conf/nutch-site.xml like this:

<property>
<name>plugin.includes</name>
<value>
      myplugins|protocol-http|urlfilter-regex|parse-(text|html|zip|swf\
      |js)|index-(basic|anchor)|query-(basic|site|url)|\
      response-(json|xml)|summary-basic|scoring-opic\
      |urlnormalizer-(pass|regex|basic)|creativecommons
</value>
</property>

Hope this helps,
André Ricardo


On 10/08/18 20:51, Israel wrote:
Hello n, I was reading about the creative commons ccNutch plugin ..... my
question is that if this is already installed in the plugin creative commons
of nutch ?.... or if q ccNutch download the plugin .... and if the process
being performed is transparaente user, I read that works with RDF?

Thank you very much

Reply via email to