On 28-Aug-07, at 1:08 PM, Wagner,Harry wrote:

There is a version of the KStem stemmer
(http://ciir.cs.umass.edu/cgi-bin/downloads/downloads.cgi) that has been adapted for Lucene. What would be the simplest way to implement this in
Solr?  As a plug-in?  Has anyone already done this?

You should be able to drop it in lib/ and use it via something like this in schema.xml:

    <!-- One can also specify an existing Analyzer class that has a
default constructor via the class attribute on the analyzer element
    <fieldType name="text_greek" class="solr.TextField">
      <analyzer class="org.apache.lucene.analysis.el.GreekAnalyzer"/>
    </fieldType>
    -->

If it is a tokenfilter (rather than an Analyzer), you can write a little wrapper Factory class (see examples in org/apache/solr/ analysis, then use it as follows:

<fieldType name="text_ws" class="solr.TextField" positionIncrementGap="100">
      <analyzer>
        <tokenizer class="solr.WhitespaceTokenizerFactory"/>
      </analyzer>
    </fieldType>

best,
-Mike

Reply via email to