[ 
https://issues.apache.org/jira/browse/KUDU-3447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Xixu Wang updated KUDU-3447:
----------------------------
    Attachment: image-2023-02-13-17-16-50-491.png

> Limit the usage of network bandwidth of tablet copying 
> -------------------------------------------------------
>
>                 Key: KUDU-3447
>                 URL: https://issues.apache.org/jira/browse/KUDU-3447
>             Project: Kudu
>          Issue Type: Improvement
>            Reporter: Xixu Wang
>            Priority: Minor
>         Attachments: image-2023-02-09-10-38-50-512.png, 
> image-2023-02-09-10-47-58-370.png, image-2023-02-13-17-08-37-256.png, 
> image-2023-02-13-17-16-50-491.png
>
>
> Copying tablets from an old cluster to another new cluster is a high resource 
> consumed operation using the command : kudu local_replica copy_from_remote. 
> As the follow picture shows: the usage of memory is as high as 75%. And the 
> network is almost occupied fully (the overall network bandwidth is 2Gb/s). 
> Disk reading is every high (the overall disk bandwidth is 200MB/s). 
> !image-2023-02-09-10-47-58-370.png|width=996,height=369!
> If the data size is very large, the copying process will last for a long 
> time. Other service maybe get impacted and become unavailable. Therefore it 
> is better to limit the tablets copying speed and make the system more stable. 
> The goal is to balance the tablets copying speed and the impact to other 
> services.
> As copy_from_remote is mainly downloading data from the remote cluster and 
> write the data to local file system, it is better to control the downloading 
> speed to control the resource consumption. There are some algorithms to 
> implement a rate limiter. This patch will use the token bucket algorithm 
> implemented by Facebook Folly library: 
> [https://github.com/facebook/folly/blob/main/folly/TokenBucket.h]



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to