Hi,
You are right about this, I should have thought about it. There is just
one problem: A word can occur multiple times in the same documents, so the
table which gives the position of a word in a doc. must give all the
positions of the word. If we keep your logic, we would have to make a tab
educating, articles here too.)
http://www.ping.be/~ping0658/avrank.html#zipf
http://www.swtech.com/server/websvr/wsindex/
Regards,
Sander
> -Original Message-
> From: Cedric Veilleux [mailto:[EMAIL PROTECTED]]
> Sent: 23 March 2001 04:30
> To: [EMAIL PROTECTED]
> Subject: Larg
st for example!)
btw: your email address looks like "postmaster", is this a bug in the lists
or are you subscribed as postmaster?
regards,
rene
- Original Message -
From: "Cedric Veilleux" <[EMAIL PROTECTED]>
To: <[EMAIL PROTECTED]>
Sent: Friday, Mar
Hi,
I am planning a very large search engine. I've spent some time reading
the archive and I found some suggestions on how to do this. The word
indexing method is a very interesting alternative to slow "...where like
'%foo%';" queries.
There is from 100k to 500k doc