Dear All,
I wonder whether one can use the full text indexes in mysql to find out what words in a document are likely to be relevant key words. I've several thousand text documents in a table with a full text index, and I can search for documents corresponding to some given keywords using the 'match against' functions. But what I'm realy looking for is that mysql tells me about the say the top 10 relevant keywords of each of the document satisfying some relevance cutoff. This should be possible because mysql knows about all words of all my documents which could be used to score the overall relevance of a word with repsect to all documents. So if a document 'X' about databases mentiones 'mysql' three times and the majority of all other documents does not contain the word 'mysql', the word 'mysql' is likely to have some relevance in document 'X'. I'd be nice to have a command like this: select keywords(10.0) from MyDocs where DocId = 666; '10.0' is some kind of relevance cutoff, and the output could someing like this: mysql fulltext database fast ... [ hm, don't know if it's a problem that function that works on a single column and row produces a multiple row output ... ] I'm happy for any suggestions on how to extract the ranked most relevant keywords per document using the existing functionality from mysql 4.0. Thanks for help, Arne -- Arne Mueller Biomolecular Modelling Laboratory Imperial Cancer Research Fund 44 Lincoln's Inn Fields London WC2A 3PX, U.K. phone : +44-(0)207 2693405 | fax :+44-(0)207-269-3534 email : [EMAIL PROTECTED] | http://www.bmm.icnet.uk --------------------------------------------------------------------- Before posting, please check: http://www.mysql.com/manual.php (the manual) http://lists.mysql.com/ (the list archive) To request this thread, e-mail <[EMAIL PROTECTED]> To unsubscribe, e-mail <[EMAIL PROTECTED]> Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php