On Tue, Jan 28, 2003 at 12:49:07PM -0800, Rob Hudson wrote:
> Just noticed the number on the bottom of the google page:
> Searching 3,083,324,652 web pages.
> 
> I'm curious about the algorythm that can churn through all that data and
> look for the specific keyword I entered and return a meaningful result
> in such a quick time.

I once wrote a (really simple) search engine in perl/mysql ...

I made a keyword ranking system ....

create table keywords
(
  keyword varchar(50) not null,
  score int(5) unsigned not null,
  url_id int(10) unsigned not null,
  index(keyword, url_id)
);

create table urls
(
  url_id int(10) unsigned not null auto_increment primary key,
  url text not null
);

select
  urls.url
from
  keywords, urls
where
  keywords.keyword = '$keyword'
  and
  keywords.url_id = urls.url_id
order by keywords.score desc;

So, the key in this system is parsing and ranking ...

Dunno know how google does it though :p

-- 
<[EMAIL PROTECTED]>
_______________________________________________
Eug-LUG mailing list
[EMAIL PROTECTED]
http://mailman.efn.org/cgi-bin/listinfo/eug-lug

Reply via email to