> > On Sunday 17 August 2003 06:48 am, Niklas Bergh wrote:
> > > particular word where 'weight' is defined as Number of
> > > "occurrances of 'wordX'"/"Total number of words in the index" (word
> > > rareness). This would allow the search engine to know that, for
>
> Or something along that line but possibly somewhat smarter (maybe a cap of
> how much a single hit could add to the final result) to avoid things like:
>
> > <font size=1 color=white>
> > movies download movies download movies download movies download movies
> > download movies download (... x 100)
> > </font>
>
> :)
>
> /N

But I see your point. It might be a mighty good idea to (at least
optionally) have some kind of additional indexer-generated fact to merge
into the score, something like Googles 'So and so many pages (or whatever
else that can link) links to this resource'. The problem with this is that
some of the things that I envision should use the search/indexing
functionallity might not have that kind of structure, words are probably
always present though.

/N

_______________________________________________
devl mailing list
[EMAIL PROTECTED]
http://hawk.freenetproject.org:8080/cgi-bin/mailman/listinfo/devl

Reply via email to