[ 
https://issues.apache.org/jira/browse/KUDU-1967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Adar Dembo reassigned KUDU-1967:
--------------------------------

    Assignee:     (was: Adar Dembo)

I did a fair amount of work for this in past Kudu releases; the Jira tracks 
some of the remaining work to be done.

 

> Umbrella JIRA for node density improvements
> -------------------------------------------
>
>                 Key: KUDU-1967
>                 URL: https://issues.apache.org/jira/browse/KUDU-1967
>             Project: Kudu
>          Issue Type: Task
>          Components: fs, master, tablet, tserver
>    Affects Versions: 1.3.0
>            Reporter: Adar Dembo
>            Priority: Major
>              Labels: data-scalability, roadmap-candidate
>
> For the Kudu 1.4 release, I'll be working to improve node density.
> Here's a brief primer on Kudu's scalability targets today:
> # We recommend no more than 4 TB of total data per node. This is specific to 
> Kudu data blocks, so this data is post-encoding and post-compression.
> # We recommend no more than 1000 partitions (post-replication) per node.
> # We recommend no more than 100 nodes per cluster.
> # We recommend no more than 60 partitions per table per tserver.
> For 1.4, here's what we'd like to achieve:
> # Up to 16 TB of total data per node. Maybe even 48 TB, if possible.
> # Up to 100 "hot" partitions per node. In this context, "hot" means 
> partitions that are actively servicing writes.
> # Thousands of "cold" partitions per node. Put another way, it should be 
> drastically cheaper to serve "cold" partitions than it is today.
> # Maintain the "100 nodes per cluster" limit.
> # Remove the "no more than 60 partitions per table per node" limit.
> I'll be linking various interesting JIRAs into this one, and I'll document, 
> for each one, which aspect of data scalability it affects.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to