I fixed it, nutch source comes with outdated nekohtml.jar. Trial and errored with many neko versions until this one worked for me:
nekohtml-1.9.12.tar.gz mitch -- View this message in context: http://lucene.472066.n3.nabble.com/Parsing-error-java-lang-NoClassDefFoundError-org-cyberneko-html-LostText-tp4029958p4038809.html Sent from the Nutch - User mailing list archive at Nabble.com.

