The short answer is no, I don't know of anything like that sorry. The only documentation I found that is close is some tips in the user manual about physical memory: https://accumulo.apache.org/1.10/accumulo_user_manual.html#_physical_memory
The long answer is that it will depend on many factors in your cluster and you will probably just have to see what works for your data and usage. Some common factors that affect scaling will include data size, scan frequency, scan types (small range, vs bulk export), number of concurrent scans, number and size of tables/tablets, volume of live ingest, garbage collection, hardware, and of course amount of memory. On Wed, Oct 13, 2021 at 8:48 AM Ligade, Shailesh [USA] < [email protected]> wrote: > Hello, > > > > I saw various guidelines on how to set memory heap sizes etc. IUs there a > pluggable spreadheet like if the server has x memory, tserver heap should > be x/3 and datanode should dhave x/4 etc? > > Also, is there any recommendation on number of tables hosted per tserver > and tserver memory requirement? May be that will also provide when to > add a new tserver (when # of tablets per tserver goes above some threshold) > > > -S > >
