Dear All,

I wonder whether one can use the full text indexes in mysql to find out
what words in a document are likely to be relevant key words. 

I've several thousand text documents in a table with a full text index,
and I can search for documents corresponding to some given keywords
using the 'match against' functions. But what I'm realy looking for is
that mysql tells me about the say the top 10 relevant keywords of each
of the document satisfying some relevance cutoff. This should be
possible because mysql knows about all words of all my documents which
could be used to score the overall relevance of a word with repsect to
all documents. 

So if a document 'X' about databases mentiones 'mysql' three times and
the majority of all other documents does not contain the word 'mysql',
the word 'mysql' is likely to have some relevance in document 'X'.

I'd be nice to have a command like this:

select keywords(10.0) from MyDocs where DocId = 666;

'10.0' is some kind of relevance cutoff, and the output could someing
like this:

mysql
fulltext
database
fast
...

[ hm, don't know if it's a problem that function that works on a single
column and row produces a multiple row output ... ]

I'm happy for any suggestions on how to extract the ranked most relevant
keywords per document using the existing functionality from mysql 4.0.

        Thanks for help,

        Arne

-- 
Arne Mueller
Biomolecular Modelling Laboratory
Imperial Cancer Research Fund
44 Lincoln's Inn Fields
London WC2A 3PX, U.K.
phone : +44-(0)207 2693405      | fax :+44-(0)207-269-3534
email : [EMAIL PROTECTED] | http://www.bmm.icnet.uk

---------------------------------------------------------------------
Before posting, please check:
   http://www.mysql.com/manual.php   (the manual)
   http://lists.mysql.com/           (the list archive)

To request this thread, e-mail <[EMAIL PROTECTED]>
To unsubscribe, e-mail <[EMAIL PROTECTED]>
Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php

Reply via email to