Nader Henein wrote:
Considerations that you may want to think about when sanitizing your clustered indecies:

1) Number of documents available vs. number of documents in the persistent store. 2) Are all the document up to date (involves comparing the existence and the last date updated of Lucene documents to persistent store) 3) Have all the documents that should have been deleted actually been deleted from the index, if you delete the documents from the persistent store this is not trivial, we're using an "is_deleted" flag so we can query the index for deleted documents, if we get any results, then there's a problem.

Sync issues will happen, welcome to the wonderful work of NDC, not by any fault of programming, but simply because this is the nature of network communication, and computing in demanding environments. Failing to provide for contingency and fail safes will give you some of the most intense headaches, ultimately, you want the system to fix itself (assume failures for day one) I've been monitoring the system from a distance for the past year with little or no interference (4 years in total), there is nothing better than knowing that I could take a tire iron to a few of our machines and that would not affect the site performance, I get to sleep at night.

No worries, I'm already banging my head because there are 2 'different' incoming sources of messages (the sources of messages are redundant satellites connections on 2 different satellites), one is supposed to be a live backup, the 2 are running in parallel and the broker must ideally wait for a message to come from the 2 sources.

If the wait goes beyond a certain threshold, trigger an alert, put the failed node, ...

That's the easy part :)


---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to