Hello,

I'm using Nutch 2.2.1 with parse-plugins.xml configured to handle text/html
with parse-html that is org.apache.nutch.parse.html.HtmlParser and unless
i'm adding the full path to the nekohtml-0.9.5.jar in the plugins directory
to the classpath in the nutch.sh script i'm getting the error
"NoClassDefFoundError: org/cyberneko/html/parsers/DOMFragmentParser" when
the DOMFragmentParser instance is created at the HtmlParser.parseNeko
method.

I was wondering if there is a way to avoid the above error without editing
the nutch script, or perhaps what is the correct way to enable nutch to
find and load its plugin jars?

I already added the 'plugin.folders' property to the nutch-site.xml file
with the path to the plugins dir but it didn't help.

Reply via email to