> I've gotten a couple of questions offlist about stemming > so I thought I'd just post here with my changes. Sorry that > some of the changes are in the main code and not in a plugin. It > seemed that it's more efficient to put in the main analyzer. It > would be nice if later releases could add support for plugging > in a custom stemmer/analyzer.
I recently send a proposal on the Nutch Wiki for Multi-Lingual support in Nutch (add the ability to add language specific analyzers for both querying and analyzing). This document is available at http://wiki.apache.org/nutch/MultiLingualSupport It seems that your solution is very similar to mine (except mine uses the plugin framework). Could you please review my proposal regarding your experience. I will begin implementation in a few days (and will keep in mind the piece of code you send). Regards Jerome -- http://motrech.free.fr/ http://frutch.free.fr/
