[ 
https://issues.apache.org/jira/browse/AURORA-1940?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kai Huang updated AURORA-1940:
------------------------------
    Description: 
There was a recent change to the Aurora client to provide "at most once" 
instead of "at least once" retries for non-idempotent operations. See:
https://github.com/apache/aurora/commit/f1e25375def5a047da97d8bdfb47a3a9101568f6

`aurora job restart` is a non-idempotent operation, thus it was not retried. 
However, when a transport exception occurs, the operator has to babysit simple 
operations like aurora job restart if it were not retried. Compared to the 
requests that were causing problems (admin tasks, job creating, updates, etc.), 
restarts in general should be retried rather than erring on the side of caution.

  was:
There was a recent change to the Aurora client to provide "at most once" 
instead of "at least once" retries for non-idempotent operations. See:
https://github.com/apache/aurora/commit/f1e25375def5a047da97d8bdfb47a3a9101568f6

`aurora job restart` is a non-idempotent operation, thus it was not retried. 
When there is a transport exception, the operator has to babysit simple 
operations like aurora job restart if it were not retried. Compared to the 
requests that were causing problems (admin tasks, job creating, updates, etc.), 
restarts in general should be retried rather than erring on the side of caution.


> aurora job restart request should be retryable
> ----------------------------------------------
>
>                 Key: AURORA-1940
>                 URL: https://issues.apache.org/jira/browse/AURORA-1940
>             Project: Aurora
>          Issue Type: Task
>            Reporter: Kai Huang
>            Assignee: Kai Huang
>            Priority: Minor
>
> There was a recent change to the Aurora client to provide "at most once" 
> instead of "at least once" retries for non-idempotent operations. See:
> https://github.com/apache/aurora/commit/f1e25375def5a047da97d8bdfb47a3a9101568f6
> `aurora job restart` is a non-idempotent operation, thus it was not retried. 
> However, when a transport exception occurs, the operator has to babysit 
> simple operations like aurora job restart if it were not retried. Compared to 
> the requests that were causing problems (admin tasks, job creating, updates, 
> etc.), restarts in general should be retried rather than erring on the side 
> of caution.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to