Re: How to reduce number of entries in memory

Josh Elser Tue, 29 Oct 2013 09:48:10 -0700

On 10/29/13, 12:28 PM, Terry P. wrote:


What are your thoughts on doing an hourly flush of the table in the
shell to ensure entries are flushed to disk more frequently to help
minimize the replay required if connectivity to a node is lost?

If you want to go the route of flushing more frequently, I wouldprobably suggest dropping the configuration for tserver.walog.max.sizefrom the default of 1G to something else (maybe 256M or 512M?).

My gut is telling me that this still isn't going to help you in the end.What does the distribution on your ingest look like?

Looking back at some old emails from you, if you're ingesting UUIDs asthe row key, most likely you're ingesting to a "small" amount of data tomany servers. If this is the case, it's more likely that you're justplaying the odds as to whether you happen to catch a flush the exactmoment before you lose the N servers that contained your WALs.

Increasing the WAL replication is likely the best solution you can getfor yourself. Hoping that your failures only occur after a flush butbefore you ingest more data seems unlikely to happen. If you still wantdata flushed more often, reducing the WAL size will be automatic overyour manual cron job to flush the table (one less thing to manage).

And, as you likely know, this would all be at the expense of ingestperformance.

Re: How to reduce number of entries in memory

Reply via email to