Hi all:

Apologies in advance if this is completely off-topic!!

I'm building a site right now in PHP/MySQL. Parts of the site are 
database driven, other parts are not. http://www.tri-drs.org (racing 
and bios are db driven) for those of you who are interested.

I would like to build a Search Engine for this site. I've tried doing 
complex queries on the data to no avail. I'm using old enough 
versions of MySQL and PHP that I can't do things like TEMP tables 
(mysql) and proper array stuff (PHP). I've come to the realization 
that I need to create an index for my database. I've read a number of 
articles about how to create an index, strip out noise words (aka 
stop words), stemming, spiders for 'flat' content, calculating 
relevancy, etc. (Btw, http://citeseer.nj.nec.com is my new best 
friend.)

I don't really understand how the index works though. Conceptually. I 
do have one database book "Relational Database Design Clearly 
Explained" which is excellent, but has only 2 pages about indexes. I 
also have a PHP/MySQL book (but I can't remember the title of it). 
I've looked at Dubois' MySQL book but didn't feel I was ready for it.

Could anyone suggest some (specific) resources (books, articles, 
urls, whatevers) about indexing a database? I think what I'm looking 
for is what do you store in the index; how does it get in there and 
then how do you get the information out again?

For example:
How does hit highlighting (your keywords in bold on the search result 
page) work? I understand that you look up a word in the index and the 
index tells you that the word is found somewhere in some document, 
but how does the results page know how to make your word bold?

Again, my apologies if this is off-topic!! Thanks for all of your 
insight to date. I've learned a lot from reading other peoples' 
questions. :)

emma
PS All of my findings are going into a summary document that I'm 
going to be giving to my team at work. Let me know if you're 
interested in receiving a copy when I'm done. :)

Emma Jane Hogbin
Web producer, TVOntario
e. [EMAIL PROTECTED]

---------------------------------------------------------------------
Before posting, please check:
   http://www.mysql.com/manual.php   (the manual)
   http://lists.mysql.com/           (the list archive)

To request this thread, e-mail <[EMAIL PROTECTED]>
To unsubscribe, e-mail <[EMAIL PROTECTED]>
Trouble unsubscribing? Try: http://lists.mysql.com/php/unsubscribe.php

Reply via email to