[GENERAL] multi terabyte fulltext searching

Benjamin Arai Wed, 21 Mar 2007 07:32:33 -0800

Hi,

I have been struggling with getting fulltext searching for very largedatabases. I can fulltext index 10s if gigs without any problem butwhen I start geting to hundreds of gigs it becomes slow. My currentsystem is a quad core with 8GB of memory. I have the resource tothrow more hardware at it but realistically it is not cost effectiveto buy a system with 128GB of memory. Is there any solutions thatpeople have come up with for indexing very large text databases?

Essentially I have several terabytes of text that I need to index.Each record is about 5 paragraphs of text. I am currently usingTSearch2 (stemming and etc) and getting sub-optimal results. Queriestake more than a second to execute. Has anybody implemented such adatabase using multiple systems or some special add-on to TSearch2 tomake things faster? I want to do something like partitioning thedata into multiple systems and merging the ranked results at somemaster node. Is something like this possible for PostgreSQL or mustit be a software solution?


Benjamin

---------------------------(end of broadcast)---------------------------
TIP 9: In versions below 8.0, the planner will ignore your desire to
      choose an index scan if your joining column's datatypes do not
      match

[GENERAL] multi terabyte fulltext searching

Reply via email to