Hi;

I have impelented that custom Analyzer:

public class DisambiguatorAnalyzer extends Analyzer {

   Version version = Version.LUCENE_46;
   List<String> stopWordList;

   public DisambiguatorAnalyzer(List<String> stopWordList) throws
IOException {
      super();
      this.stopWordList = stopWordList;
   }

   @Override
   protected TokenStreamComponents createComponents(String fieldName,
Reader reader) {
      Tokenizer source = new WhitespaceTokenizer(version, reader);
      int flags = GENERATE_WORD_PARTS | CATENATE_WORDS;
      TokenStream filter = new WordDelimiterFilter(source,
WordDelimiterIterator.DEFAULT_WORD_DELIM_TABLE, flags, null);
      filter = new StopFilter(version, filter,
StopFilter.makeStopSet(version, stopWordList));
      filter = new TurkishLowerCaseFilter(filter);
      return new TokenStreamComponents(source, filter);
   }
}

However it preserve originals and does not remove Stopwords. What maybe the
wrong?

Thanks;
Furkan KAMACI

Reply via email to