Re: Best practice advice needed!

Fuad Efendi Thu, 25 Sep 2008 10:09:29 -0700

I am guessing your Enterprise system deletes/updates tables in RDBMS,and your SOLR indexes that data. Additionally to that, you havefront-end interacting with SOLR and with RDBMS. At front-end level, incase of a search sent to SOLR returning primary keys for data, you maycheck your database using primary keys returned by SOLR beforecommitting output to end users.

To remove records from an index... best-by performance is to haveMaster-Slave SOLR instances, remove data from Master SOLR, andcommit/synchronize with Slave nightly (when traffic is lowest). SOLRwon't be in-sync with database, but you can always retrieve PKs fromSOLR, check database for those PKs, and 'filter' output...


--
Thanks,

Fuad Efendi
416-993-2060(cell)
Tokenizer Inc.
==============
http://www.linkedin.com/in/liferay


Quoting sundar shankar <[EMAIL PROTECTED]>:

Hi,
We have an index of courses (about 4 million docs in prod) andwe have a nightly that would pick up newly added courses and updatethe index accordingly. There is another Enterprise system thatshares the same table and that could delete data from the table too.
I just want to know what would be the best practice to find outdeleted records and remove it from my index. Unfortunately for us,we dont maintain a history of the deleted records and thats a bigbane.
Please do advice on what might be the best way to implement this?

-Sundar

_________________________________________________________________
Movies, sports & news! Get your daily entertainment fix, only on live.com
http://www.live.com/?scope=video&form=MICOAL

Re: Best practice advice needed!

Reply via email to