Also, when they say tablets - I assume this is before replication? so in
reality, it is number of nodes x cpu cores / replication factor? If this is
the case, it is not looking good...

On Wed, Oct 10, 2018 at 5:02 PM Boris Tyukin <bo...@boristyukin.com> wrote:

> Hi all,
>
> can someone clarify if this recommendation below - does it mean physical
> or hyper-threaded CPU cores? quite a big difference...
> Thanks,
> Boris
>
> Partitioning Guidelines (https://kudu.apache.org/docs/
> kudu_impala_integration.html#partitioning_rules_of_thumb)
> - For large tables, such as fact tables, aim for as many tablets as you
> have cores in the cluster.
> - For small tables, such as dimension tables, aim for a large enough
> number of tablets that each tablet is at least 1 GB in size.
>
> In general, be mindful the number of tablets limits the parallelism of
> reads, in the current implementation. Increasing the number of tablets
> significantly beyond the number of cores is likely to have diminishing
> returns.
>
>

Reply via email to