> > On Sunday 17 August 2003 06:48 am, Niklas Bergh wrote: > > > particular word where 'weight' is defined as Number of > > > "occurrances of 'wordX'"/"Total number of words in the index" (word > > > rareness). This would allow the search engine to know that, for > > Or something along that line but possibly somewhat smarter (maybe a cap of > how much a single hit could add to the final result) to avoid things like: > > > <font size=1 color=white> > > movies download movies download movies download movies download movies > > download movies download (... x 100) > > </font> > > :) > > /N
But I see your point. It might be a mighty good idea to (at least optionally) have some kind of additional indexer-generated fact to merge into the score, something like Googles 'So and so many pages (or whatever else that can link) links to this resource'. The problem with this is that some of the things that I envision should use the search/indexing functionallity might not have that kind of structure, words are probably always present though. /N _______________________________________________ devl mailing list [EMAIL PROTECTED] http://hawk.freenetproject.org:8080/cgi-bin/mailman/listinfo/devl
