: : Nutch has phrase pre-filtering which helps with this. It indexes the
: : phrase fragments as separate terms and uses that set of matches to
: : filter the set of matching documents.
: That reminds me ... i seem to remember someone saying once that Nutch lso
: builds word based n-grams
On Nov 14, 2006, at 2:00 PM, Chris Hostetter wrote:
CommonGrams itself seems to have some other dependencies on nutch
because
of other utilities in the same class, but based on a quick skim,
what you
really want is the nested private static class Filter extends
TokenFilter which doesn't
Hey folks, I just wanted to let you all know about a new feature just
committed yesterday (now available in the solr-2006-11-15 nightly build).
While Solr has always had some really handy hooks for loading your own
code to do analysis, request handlers, output writers, field types, cache