Another thing to consider is how many tablet servers the mutations are being sent to - if they're all going to a single split, that's going to reduce your throughput a lot.

On 07/15/2016 02:33 PM, [email protected] wrote:
The batch writer has several knobs (latency time, memory buffer, etc) that you can tune to meet your requirements. The values for those settings will depend on a lot of variables, to include:

  - number of tablet servers
  - size of mutations
  - desired latency
  - memory buffer
  - configuration settings on the table(s) and tablet servers.

 Suggest picking a starting point and see how it works for you, such as

threads - equal to the number of tablet servers (unless you have a really large number of tablet servers)
  buffer - 100MB
  latency - 10 seconds

If you are hitting a wall with those settings, you could increase the buffer and latency and/or change some settings on the server side that have to do with the write ahead logs.

------------------------------------------------------------------------
*From: *"Jamie Johnson" <[email protected]>
*To: *[email protected]
*Sent: *Friday, July 15, 2016 2:16:40 PM
*Subject: *Configuring batch writers

Is there any documentation that outlines reasonable settings for batch writers given a known ingest rate? For instance if I have a source that is producing in the neighborhood of 15MB of mutations per second, what would a reasonable configuration for the batch writer be to handle an ingest at this rate? What are reasonable rules of thumb to follow to ensure that the writers don't block, etc?


Reply via email to