[ 
https://issues.apache.org/jira/browse/AURORA-1722?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15343660#comment-15343660
 ] 

Igor Morozov commented on AURORA-1722:
--------------------------------------

That will work for instances that have active tasks assigned and running, for 
instances that don't (transient errors, health check failures) we need to get 
the latest task that was assigned to that instance and is in its terminal 
state(FAILED, KILLED, LOST).
Querying for terminal states statuses broadens the scope significantly as it 
shows all previously scheduled tasks that were killed during upgrade or failed 
for variety of different reasons. 

> Add new field to TaskQuery to allow querying latest statuses grouped by 
> instance id
> -----------------------------------------------------------------------------------
>
>                 Key: AURORA-1722
>                 URL: https://issues.apache.org/jira/browse/AURORA-1722
>             Project: Aurora
>          Issue Type: Task
>          Components: Scheduler
>    Affects Versions: 0.16.0
>            Reporter: Igor Morozov
>
> Currently in order to get the status of all job instances both failed and 
> running one needs to issue a query for all task statuses, then group them by 
> instance id and sort by timestamp to get the lastest statuses per instance. 
> For tasks with a lot of churn that may cause unnecessary transferring huge 
> blobs of thrifts. 
> The proposal is to include new member into TaskQuery struct
> struct TaskQuery {
> ...
>   14: i32 limit_per_instance
> }



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to