What kind of speed can you expect to get from this after tbl_doc_index fills
up with tens of millions of rows? Is this scheme suitable for that magnitude
of data?

Also, if you wish to generate a query that generates all documents that
contain token x or token y, would mysql neglect to use the keys? I remember
reading that field1=x OR field2=x wont use the index because a general way
of optimizing that query hasnt been found. Correct me if im wrong. AND
queries on the other hand should work well.

thanks,
ryan

> You can do this yourself, pretty easily by building an index in
> advance. I've been using the following technique to implement full text
> searching since before it existed in MySQL in any form. Tokenize
> each unique word in a document and store the word & count in tables:
>
> tbl_tokens:
> token_id int
> token varchar
>
> tbl_doc_index:
> token_id int
> doc_id int
> word_count int
>
> Populate the tables from your document database either what documents
> are saved or in some other offline process. When someone searches on
> words, first convert to tokens using the first table and look up in
> the 2nd table using whatever search/join technique works best in
> your situation.
>
> Jamie
>
> At Monday, May 14, 2001 on 2:18:38 PM, you wrote:
>
> > I too am curious!  I think one feature that I'd really like to see is
the
> > ability to tell the number of times a string appears inside another
> > string.  This would help a lot when trying to do search results
weighting.
>
> > -Chris
>
>
>
> ---------------------------------------------------------------------
> Before posting, please check:
>    http://www.mysql.com/manual.php   (the manual)
>    http://lists.mysql.com/           (the list archive)
>
> To request this thread, e-mail <[EMAIL PROTECTED]>
> To unsubscribe, e-mail <[EMAIL PROTECTED]>
> Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php
>


---------------------------------------------------------------------
Before posting, please check:
   http://www.mysql.com/manual.php   (the manual)
   http://lists.mysql.com/           (the list archive)

To request this thread, e-mail <[EMAIL PROTECTED]>
To unsubscribe, e-mail <[EMAIL PROTECTED]>
Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php

Reply via email to