On Thu, 18 May 2000, Randy Kobes wrote:

> Another thing that was configured in is that words have
> to be at least 3 characters long, which seems reasonable,
> and also there's some stopwords that don't get indexed,
> as they're too common. This list of stopwords is built
> by hand - so far it only includes 'perl' and 'modperl'.
> Also, the maximum number of hits is set at 30.

It should also index $/, etc. So limiting to >2char words is another
broken aspect...

But I'm not complaining! It's 100% better than it was. Maybe someone would
like my code for a db backed search engine and fix that up to something
that could work? It's all built in perl so you're free to add and remove
stopwords or change the min word length as you like.

-- 
<Matt/>

Fastnet Software Ltd. High Performance Web Specialists
Providing mod_perl, XML, Sybase and Oracle solutions
Email for training and consultancy availability.
http://sergeant.org http://xml.sergeant.org

Reply via email to