[ 
https://issues.apache.org/jira/browse/AURORA-1211?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14371696#comment-14371696
 ] 

Maxim Khutornenko commented on AURORA-1211:
-------------------------------------------

Additional point towards separating snapshot and backup writes is easing up 
native log IO load. With the asynchronous backup processing, persisting 
snapshot in a native log will overlay with writing a backup file. In large 
clusters, this may create a significant IO hot spot and result in long (in 
seconds) pauses in IO processing leading to ripple effects across other 
components (including ZK session expiration and scheduler losing leadership).

> Spread snapshot creation and backup file write
> ----------------------------------------------
>
>                 Key: AURORA-1211
>                 URL: https://issues.apache.org/jira/browse/AURORA-1211
>             Project: Aurora
>          Issue Type: Task
>          Components: Scheduler
>            Reporter: Maxim Khutornenko
>
> Currently, backup files are written asynchronously and overlay with the peak 
> of GC cleaning up after persisting snapshot in native log. We should 
> generally avoid overlaying resource-heavy activities when possible to avoid 
> troubleshooting complicated performance related issues.
> Consider delaying backup file write (e.g. for 1 minute) to clearly separate 
> its impact from the snapshot GC.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to