Re: Wildcards / Binary searches

Frédéric Glorieux Sat, 09 Jun 2007 07:41:54 -0700

Hi Chris,

The skills on this list are really very stimulating. I'm sad but I willprobably not be able to contribute. Solr may not be the choosentechnology of the project I'm working on, because of serveradministration issues (java). I know that there is no performancesarguments (lucene is incredible, and solr is nicely close to it), butthat's real life. So I will not find time for the idea below.


> : project, definitively not a good practice for portability of indexes. A
> : duplicate field with an analyser to produce a sortable ASCII version
> : would be better.
>
> exactly ... I think conceptually the methodology for solving the problem
> is very similar to the way the SpellChecker contrib works: use a very
> custom index designed for the application (not just look at the terms in
> the main corpus) and custom logic for using that index.

It could be a useful request handler ? Giving a field, with adisplayable stored value, and a sortable indexed one, you need theanalyser to parse the user entry, build a term with it, and get veryfastly a pointer to the internal lucene index, exactly at the bestplace, for w, wo, wor or word. From the iterator you can display asuggest list, it's also possible to get one or more docs directlyattached, for example to display a count. It seems interesting forthings like, a topic or an author of a doc ?

: Do you mean something like below ?
: <field name="autocomplete">w wo wor word</field>

yeah, but there are some Tokenizers that make this trivial
(EdgeNGramTokenizer i think is the name)





--
Frédéric Glorieux
École nationale des chartes
direction des nouvelles technologies et de l'informatique

Re: Wildcards / Binary searches

Reply via email to