[jira] [Closed] (AURORA-1817) Implement ReservationStore

2017-07-05 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1817?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko closed AURORA-1817.
--
Resolution: Invalid

> Implement ReservationStore
> --
>
> Key: AURORA-1817
> URL: https://issues.apache.org/jira/browse/AURORA-1817
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>
> A new store that will keep state of tasks that need reserved resources.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Closed] (AURORA-1818) Allow jobs to request dynamically reserved resources

2017-07-05 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1818?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko closed AURORA-1818.
--
Resolution: Invalid

> Allow jobs to request dynamically reserved resources
> 
>
> Key: AURORA-1818
> URL: https://issues.apache.org/jira/browse/AURORA-1818
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>
> Wire up configuration changes that allow a task to be dynamically reserved 
> via the API.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Closed] (AURORA-1822) Change scheduling logic to have DR preempt jobs if timer waiting for reservation to come back has expired

2017-07-05 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko closed AURORA-1822.
--
Resolution: Invalid

> Change scheduling logic to have DR preempt jobs if timer waiting for 
> reservation to come back has expired
> -
>
> Key: AURORA-1822
> URL: https://issues.apache.org/jira/browse/AURORA-1822
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>
> AURORA-1819 will not preempt existing tasks if no offer is available. We need 
> ability to have jobs needing dynamic reservations ability to preempt.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (AURORA-1735) Enabling jobs to dynamically reserve cluster resources through Aurora

2017-07-05 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075497#comment-16075497
 ] 

Dmitriy Shirchenko commented on AURORA-1735:


Code is here (with a couple of orthogonal patches): reviews.apache.org/r/57487/

> Enabling jobs to dynamically reserve cluster resources through Aurora
> -
>
> Key: AURORA-1735
> URL: https://issues.apache.org/jira/browse/AURORA-1735
> Project: Aurora
>  Issue Type: Epic
>  Components: Scheduler
>Reporter: Mehrdad Nurolahzade
>Assignee: Dmitriy Shirchenko
>
> As a precursor to support persistent services through Apache Aurora we have 
> to enable dynamic reservation of resources through Aurora. A stateful service 
> needs the ability to create a persistent disk volume, and the ability to 
> reserve resources. A persistent volume is necessary to actually store the 
> state, and reservation of resources is necessary in order to reliably 
> re-offer the service: the state itself, along with the resources necessary to 
> retrieve the state (e.g. cpu and memory). Aurora needs to be extended to take 
> advantage of these new features that are now supported by Mesos.
> RFC document: 
> [https://docs.google.com/document/d/15n29HSQPXuFrnxZAgfVINTRP1Iv47_jfcstJNuMwr5A]
> Design Doc: 
> [https://docs.google.com/document/d/1L2EKEcKKBPmuxRviSUebyuqiNwaO-2hsITBjt3SgWvE]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Commented] (AURORA-1735) Enabling jobs to dynamically reserve cluster resources through Aurora

2017-07-05 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=16075453#comment-16075453
 ] 

Dmitriy Shirchenko commented on AURORA-1735:


[~rdelvalle] This was successfully deployed internally. I can update the RB 
with the code if you are interested.

> Enabling jobs to dynamically reserve cluster resources through Aurora
> -
>
> Key: AURORA-1735
> URL: https://issues.apache.org/jira/browse/AURORA-1735
> Project: Aurora
>  Issue Type: Epic
>  Components: Scheduler
>Reporter: Mehrdad Nurolahzade
>Assignee: Dmitriy Shirchenko
>
> As a precursor to support persistent services through Apache Aurora we have 
> to enable dynamic reservation of resources through Aurora. A stateful service 
> needs the ability to create a persistent disk volume, and the ability to 
> reserve resources. A persistent volume is necessary to actually store the 
> state, and reservation of resources is necessary in order to reliably 
> re-offer the service: the state itself, along with the resources necessary to 
> retrieve the state (e.g. cpu and memory). Aurora needs to be extended to take 
> advantage of these new features that are now supported by Mesos.
> RFC document: 
> [https://docs.google.com/document/d/15n29HSQPXuFrnxZAgfVINTRP1Iv47_jfcstJNuMwr5A]
> Design Doc: 
> [https://docs.google.com/document/d/1L2EKEcKKBPmuxRviSUebyuqiNwaO-2hsITBjt3SgWvE]



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)


[jira] [Created] (AURORA-1881) Capture stderr from output for shell health check failures.

2017-01-24 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1881:
--

 Summary: Capture stderr from output for shell health check 
failures.
 Key: AURORA-1881
 URL: https://issues.apache.org/jira/browse/AURORA-1881
 Project: Aurora
  Issue Type: Task
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko


existing `check_call` doesn't return errors output by the process [1], so we 
need to plumb that through back to the user via `check_output` [2]

[1] 
https://github.com/google/python-subprocess32/blob/master/subprocess32.py#L602

[2] 
https://github.com/google/python-subprocess32/blob/master/subprocess32.py#L638



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (AURORA-1819) Scheduling logic changes to support dynamic reservations

2017-01-05 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1819?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko reassigned AURORA-1819:
--

Assignee: Dmitriy Shirchenko

> Scheduling logic changes to support dynamic reservations
> 
>
> Key: AURORA-1819
> URL: https://issues.apache.org/jira/browse/AURORA-1819
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> TaskAssigner and OfferManager will need to be modified to support dynamic 
> reservations.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1816) Implement Offer Reconciler

2017-01-05 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1816?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15802865#comment-15802865
 ] 

Dmitriy Shirchenko commented on AURORA-1816:


Draft here which will be released along with the rest of the subtasks of 
AURORA-1785
https://github.com/apache/aurora/compare/master...shirchen:AURORA-1816

> Implement Offer Reconciler 
> ---
>
> Key: AURORA-1816
> URL: https://issues.apache.org/jira/browse/AURORA-1816
> Project: Aurora
>  Issue Type: Task
>  Components: Scheduler
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> A new service that will unreserve resources which have been previously 
> reserved but are no longer needed.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1822) Change scheduling logic to have DR preempt jobs if timer waiting for reservation to come back has expired

2016-11-17 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1822:
--

 Summary: Change scheduling logic to have DR preempt jobs if timer 
waiting for reservation to come back has expired
 Key: AURORA-1822
 URL: https://issues.apache.org/jira/browse/AURORA-1822
 Project: Aurora
  Issue Type: Task
Reporter: Dmitriy Shirchenko


AURORA-1819 will not preempt existing tasks if no offer is available. We need 
ability to have jobs needing dynamic reservations ability to preempt.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1818) Allow jobs to request dynamically reserved resources

2016-11-11 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1818:
--

 Summary: Allow jobs to request dynamically reserved resources
 Key: AURORA-1818
 URL: https://issues.apache.org/jira/browse/AURORA-1818
 Project: Aurora
  Issue Type: Task
Reporter: Dmitriy Shirchenko


Wire up configuration changes that allow a task to be dynamically reserved via 
the API.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1817) Implement ReservationStore

2016-11-11 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1817:
--

 Summary: Implement ReservationStore
 Key: AURORA-1817
 URL: https://issues.apache.org/jira/browse/AURORA-1817
 Project: Aurora
  Issue Type: Task
Reporter: Dmitriy Shirchenko


A new store that will keep state of tasks that need reserved resources.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1735) Enabling jobs to dynamically reserve cluster resources through Aurora

2016-11-11 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1735:
---
Description: 
As a precursor to support persistent services through Apache Aurora we have to 
enable dynamic reservation of resources through Aurora. A stateful service 
needs the ability to create a persistent disk volume, and the ability to 
reserve resources. A persistent volume is necessary to actually store the 
state, and reservation of resources is necessary in order to reliably re-offer 
the service: the state itself, along with the resources necessary to retrieve 
the state (e.g. cpu and memory). Aurora needs to be extended to take advantage 
of these new features that are now supported by Mesos.

RFC document: 
[https://docs.google.com/document/d/15n29HSQPXuFrnxZAgfVINTRP1Iv47_jfcstJNuMwr5A]

Design Doc: 
[https://docs.google.com/document/d/1L2EKEcKKBPmuxRviSUebyuqiNwaO-2hsITBjt3SgWvE/edit#heading=h.klg3urfbnq3v]

  was:
As a precursor to support persistent services through Apache Aurora we have to 
enable dynamic reservation of resources through Aurora. A stateful service 
needs the ability to create a persistent disk volume, and the ability to 
reserve resources. A persistent volume is necessary to actually store the 
state, and reservation of resources is necessary in order to reliably re-offer 
the service: the state itself, along with the resources necessary to retrieve 
the state (e.g. cpu and memory). Aurora needs to be extended to take advantage 
of these new features that are now supported by Mesos.

RFC document: 
[https://docs.google.com/document/d/15n29HSQPXuFrnxZAgfVINTRP1Iv47_jfcstJNuMwr5A]

Design Doc: 
[https://docs.google.com/document/d/19gV8Po6DIHO14tOC7Qouk8RnboY8UCfRTninwn_5-7c/edit]


> Enabling jobs to dynamically reserve cluster resources through Aurora
> -
>
> Key: AURORA-1735
> URL: https://issues.apache.org/jira/browse/AURORA-1735
> Project: Aurora
>  Issue Type: Epic
>  Components: Scheduler
>Reporter: Mehrdad Nurolahzade
>Assignee: Dmitriy Shirchenko
>
> As a precursor to support persistent services through Apache Aurora we have 
> to enable dynamic reservation of resources through Aurora. A stateful service 
> needs the ability to create a persistent disk volume, and the ability to 
> reserve resources. A persistent volume is necessary to actually store the 
> state, and reservation of resources is necessary in order to reliably 
> re-offer the service: the state itself, along with the resources necessary to 
> retrieve the state (e.g. cpu and memory). Aurora needs to be extended to take 
> advantage of these new features that are now supported by Mesos.
> RFC document: 
> [https://docs.google.com/document/d/15n29HSQPXuFrnxZAgfVINTRP1Iv47_jfcstJNuMwr5A]
> Design Doc: 
> [https://docs.google.com/document/d/1L2EKEcKKBPmuxRviSUebyuqiNwaO-2hsITBjt3SgWvE/edit#heading=h.klg3urfbnq3v]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (AURORA-1735) Enabling jobs to dynamically reserve cluster resources through Aurora

2016-11-11 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1735?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko reassigned AURORA-1735:
--

Assignee: Dmitriy Shirchenko

> Enabling jobs to dynamically reserve cluster resources through Aurora
> -
>
> Key: AURORA-1735
> URL: https://issues.apache.org/jira/browse/AURORA-1735
> Project: Aurora
>  Issue Type: Epic
>  Components: Scheduler
>Reporter: Mehrdad Nurolahzade
>Assignee: Dmitriy Shirchenko
>
> As a precursor to support persistent services through Apache Aurora we have 
> to enable dynamic reservation of resources through Aurora. A stateful service 
> needs the ability to create a persistent disk volume, and the ability to 
> reserve resources. A persistent volume is necessary to actually store the 
> state, and reservation of resources is necessary in order to reliably 
> re-offer the service: the state itself, along with the resources necessary to 
> retrieve the state (e.g. cpu and memory). Aurora needs to be extended to take 
> advantage of these new features that are now supported by Mesos.
> RFC document: 
> [https://docs.google.com/document/d/15n29HSQPXuFrnxZAgfVINTRP1Iv47_jfcstJNuMwr5A]
> Design Doc: 
> [https://docs.google.com/document/d/19gV8Po6DIHO14tOC7Qouk8RnboY8UCfRTninwn_5-7c/edit]



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1769) Enabling webhook is synchronous and could cause longer leader reelection cycle

2016-11-10 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1769?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1769:
---
Assignee: (was: Dmitriy Shirchenko)

> Enabling webhook is synchronous and could cause longer leader reelection cycle
> --
>
> Key: AURORA-1769
> URL: https://issues.apache.org/jira/browse/AURORA-1769
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>
> We had an issue where on scheduler leader reelection EventBus was full of 
> TaskStateChange events and caused scheduler to not be able to post 
> DriverRegistered() message which caused Aurora scheduler to not register 
> within 1 minute. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1773) Asynchronously call webhook endpoint not to block EventBus

2016-11-10 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1773?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1773:
---
Assignee: (was: Dmitriy Shirchenko)

> Asynchronously call webhook endpoint not to block EventBus
> --
>
> Key: AURORA-1773
> URL: https://issues.apache.org/jira/browse/AURORA-1773
> Project: Aurora
>  Issue Type: Sub-task
>Reporter: Dmitriy Shirchenko
>  Labels: uber
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Closed] (AURORA-1776) Switch `launchTask` Mesos Driver call in favor of `acceptOffers` to support future dynamic reservation operations

2016-11-02 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1776?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko closed AURORA-1776.
--

> Switch `launchTask` Mesos Driver call in favor of `acceptOffers` to support 
> future dynamic reservation operations
> -
>
> Key: AURORA-1776
> URL: https://issues.apache.org/jira/browse/AURORA-1776
> Project: Aurora
>  Issue Type: Sub-task
>  Components: Scheduler
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>  Labels: uber
> Fix For: 0.16.0
>
>
> Dynamic reservations cannot be launched via `launchTask` Mesos driver call so 
> we need to switch to using `acceptOffers`.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1783) Probable http connection leak

2016-11-02 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1783?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1783:
---
Fix Version/s: 0.17.0

> Probable http connection leak
> -
>
> Key: AURORA-1783
> URL: https://issues.apache.org/jira/browse/AURORA-1783
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
> Fix For: 0.17.0
>
>
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.lang.Thread.run(Thread.java:745)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:617)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1142)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:293)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$201(ScheduledThreadPoolExecutor.java:180)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.util.concurrent.FutureTask.run(FutureTask.java:266)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> com.google.common.eventbus.Subscriber$1.run(Subscriber.java:80)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> com.google.common.eventbus.Subscriber$SynchronizedSubscriber.invokeSubscriberMethod(Subscriber.java:154)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> com.google.common.eventbus.Subscriber.invokeSubscriberMethod(Subscriber.java:95)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> java.lang.reflect.Method.invoke(Method.java:498)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> sun.reflect.GeneratedMethodAccessor16.invoke(Unknown Source)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.aurora.scheduler.events.Webhook.taskChangedState(Webhook.java:77)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:55)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:107)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.client.CloseableHttpClient.execute(CloseableHttpClient.java:82)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.client.InternalHttpClient.doExecute(InternalHttpClient.java:184)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.execchain.RedirectExec.execute(RedirectExec.java:110)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.execchain.RetryExec.execute(RetryExec.java:88)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.execchain.ProtocolExec.execute(ProtocolExec.java:184)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.execchain.MainClientExec.execute(MainClientExec.java:190)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.conn.PoolingHttpClientConnectionManager$1.get(PoolingHttpClientConnectionManager.java:263)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: at 
> org.apache.http.impl.conn.PoolingHttpClientConnectionManager.leaseConnection(PoolingHttpClientConnectionManager.java:286)
> Sep 26 19:34:12 compute59-sjc1 aurora-scheduler[25557]: E0926 19:34:12.608 
> [AsyncProcessor-4, Webhook:79] Error sending a Webhook event 
> org.apache.http.conn.ConnectionPoolTimeoutException: Timeout waiting for 
> connection from pool



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1776) Switch `launchTask` Mesos Driver call in favor of `acceptOffers` to support future dynamic reservation operations

2016-09-16 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1776:
--

 Summary: Switch `launchTask` Mesos Driver call in favor of 
`acceptOffers` to support future dynamic reservation operations
 Key: AURORA-1776
 URL: https://issues.apache.org/jira/browse/AURORA-1776
 Project: Aurora
  Issue Type: Sub-task
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko


Dynamic reservations cannot be launched via `launchTask` Mesos driver call so 
we need to switch to using `acceptOffers`.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1772) Do not resend TaskStateChange events every time a scheduler starts

2016-09-16 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1772?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15498106#comment-15498106
 ] 

Dmitriy Shirchenko commented on AURORA-1772:


https://reviews.apache.org/r/51980/

> Do not resend TaskStateChange events every time a scheduler starts
> --
>
> Key: AURORA-1772
> URL: https://issues.apache.org/jira/browse/AURORA-1772
> Project: Aurora
>  Issue Type: Sub-task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>  Labels: uber
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1773) Asynchronously call webhook endpoint not to block EventBus

2016-09-12 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1773:
--

 Summary: Asynchronously call webhook endpoint not to block EventBus
 Key: AURORA-1773
 URL: https://issues.apache.org/jira/browse/AURORA-1773
 Project: Aurora
  Issue Type: Sub-task
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1772) Do not resend TaskStateChange events every time a scheduler starts

2016-09-12 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1772:
--

 Summary: Do not resend TaskStateChange events every time a 
scheduler starts
 Key: AURORA-1772
 URL: https://issues.apache.org/jira/browse/AURORA-1772
 Project: Aurora
  Issue Type: Sub-task
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1769) Enabling webhook is synchronous and could cause longer leader reelection cycle

2016-09-09 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1769:
--

 Summary: Enabling webhook is synchronous and could cause longer 
leader reelection cycle
 Key: AURORA-1769
 URL: https://issues.apache.org/jira/browse/AURORA-1769
 Project: Aurora
  Issue Type: Bug
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko


We had an issue where on scheduler leader reelection EventBus was full of 
TaskStateChange events and caused scheduler to not be able to post 
DriverRegistered() message which caused Aurora scheduler to not register within 
1 minute. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1733) Introduce a TaskHistory struct to TaskQuery to support limiting number of taskEvents per instance

2016-07-12 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1733:
--

 Summary: Introduce a TaskHistory struct to TaskQuery to support 
limiting number of taskEvents per instance
 Key: AURORA-1733
 URL: https://issues.apache.org/jira/browse/AURORA-1733
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko
Assignee: Maxim Khutornenko


Our use case is to pull back latest task state history but we only want to pull 
back a specific number of events per instance (usually 1). We have not been 
able to find a way to do that.

Right now we use TaskQuery with getTasksWithoutConfigs with a TaskQuery that 
pulls back everything in a healthy state (includes ASSIGNED, RUNNING) to 
display to our users on our internal UI. This has become too big of a blob to 
process.

Proposal is to introduce and implement the following:
{code}
struct TaskHistory {
0: set statuses
1: i32 limit
}
{code}

and add it to TaskQuery

{code}
struct TaskQuery {
...
15: TaskHistory taskHistory
}
{code}

This is an alternative iteration on proposal in AURORA-1722 and are related 
tasks. 

Comments?




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1721) Support user initiated rollback

2016-06-22 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1721?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15345644#comment-15345644
 ] 

Dmitriy Shirchenko commented on AURORA-1721:


Can someone please reassign this to Igor? He is interested in working on this 
task from our side! Thanks!

> Support user initiated rollback 
> 
>
> Key: AURORA-1721
> URL: https://issues.apache.org/jira/browse/AURORA-1721
> Project: Aurora
>  Issue Type: Task
>  Components: Scheduler
>Reporter: Igor Morozov
>Assignee: Dmitriy Shirchenko
>  Labels: Uber
> Fix For: 0.16.0
>
>
> The proposal to support user initiated rollback:
> 1. Create new thrift API:
>  /**Rollback job update. */
>   Response rollbackJobUpdate(
>   /** The update to rollback. */
>   1: JobUpdateKey key,
>   /** A user-specified message to include with the induced job update 
> state change. */
>   3: string message)
> 2.  Implement new API in a scheduler so the implementation would just undo 
> the latest JobUpdate effectively trying to apply initialState to the job. If 
> that is for some reason is impossible them rollback with fail with 
> appropriate error message.
> 3. Support new aurora client command 'rollback'



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1721) Support user initiated rollback

2016-06-22 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1721:
---
Labels: Uber  (was: )

> Support user initiated rollback 
> 
>
> Key: AURORA-1721
> URL: https://issues.apache.org/jira/browse/AURORA-1721
> Project: Aurora
>  Issue Type: Task
>  Components: Scheduler
>Reporter: Igor Morozov
>Assignee: Dmitriy Shirchenko
>  Labels: Uber
> Fix For: 0.16.0
>
>
> The proposal to support user initiated rollback:
> 1. Create new thrift API:
>  /**Rollback job update. */
>   Response rollbackJobUpdate(
>   /** The update to rollback. */
>   1: JobUpdateKey key,
>   /** A user-specified message to include with the induced job update 
> state change. */
>   3: string message)
> 2.  Implement new API in a scheduler so the implementation would just undo 
> the latest JobUpdate effectively trying to apply initialState to the job. If 
> that is for some reason is impossible them rollback with fail with 
> appropriate error message.
> 3. Support new aurora client command 'rollback'



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (AURORA-1721) Support user initiated rollback

2016-06-22 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1721?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko reassigned AURORA-1721:
--

Assignee: Dmitriy Shirchenko

> Support user initiated rollback 
> 
>
> Key: AURORA-1721
> URL: https://issues.apache.org/jira/browse/AURORA-1721
> Project: Aurora
>  Issue Type: Task
>  Components: Scheduler
>Reporter: Igor Morozov
>Assignee: Dmitriy Shirchenko
> Fix For: 0.16.0
>
>
> The proposal to support user initiated rollback:
> 1. Create new thrift API:
>  /**Rollback job update. */
>   Response rollbackJobUpdate(
>   /** The update to rollback. */
>   1: JobUpdateKey key,
>   /** A user-specified message to include with the induced job update 
> state change. */
>   3: string message)
> 2.  Implement new API in a scheduler so the implementation would just undo 
> the latest JobUpdate effectively trying to apply initialState to the job. If 
> that is for some reason is impossible them rollback with fail with 
> appropriate error message.
> 3. Support new aurora client command 'rollback'



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1709) After rb/46835 job key's role is required to exist as a user

2016-06-08 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1709:
--

 Summary: After rb/46835 job key's role is required to exist as a 
user
 Key: AURORA-1709
 URL: https://issues.apache.org/jira/browse/AURORA-1709
 Project: Aurora
  Issue Type: Bug
Reporter: Dmitriy Shirchenko
Assignee: Joshua Cohen


sandbox.py had the change eg
```
  return FileSystemImageSandbox(self.SANDBOX_NAME, 
self._get_sandbox_user(assigned_task))
```

which is a problem for us since 

```
pwent = pwd.getpwnam(self._user)
grent = grp.getgrgid(pwent.pw_gid)
```

throw an exception if the user does not exist. This is a change in behavior of 
how Aurora launched containers before the diff. Before, job key's role could be 
mangled liberally and arbitrarily w/out this restriction.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1683) Implement an optional webhook parameter to scheduler to POST TaskStateChange events

2016-06-06 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1683:
---
Fix Version/s: 0.14.0

> Implement an optional webhook parameter to scheduler to POST TaskStateChange 
> events
> ---
>
> Key: AURORA-1683
> URL: https://issues.apache.org/jira/browse/AURORA-1683
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
> Fix For: 0.14.0
>
>
> Per discussion on mailing list, I propose adding a webhook to publish 
> TaskStateChange events to an HTTP REST point. This will be an optional flag 
> passed into scheduler with an optional configuration file that will contain 
> optional headers that need to be set. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1683) Implement an optional webhook parameter to scheduler to POST TaskStateChange events

2016-04-28 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1683:
---
Description: Per discussion on mailing list, I propose adding a webhook to 
publish TaskStateChange events to an HTTP REST point. This will be an optional 
flag passed into scheduler with an optional configuration file that will 
contain optional headers that need to be set.   (was: Per discussion on mailing 
list, I propose adding a webhook to publish )

> Implement an optional webhook parameter to scheduler to POST TaskStateChange 
> events
> ---
>
> Key: AURORA-1683
> URL: https://issues.apache.org/jira/browse/AURORA-1683
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> Per discussion on mailing list, I propose adding a webhook to publish 
> TaskStateChange events to an HTTP REST point. This will be an optional flag 
> passed into scheduler with an optional configuration file that will contain 
> optional headers that need to be set. 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1683) Implement an optional webhook parameter to scheduler to POST TaskStateChange events

2016-04-28 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1683:
---
Description: Per discussion on mailing list, I propose adding a webhook to 
publish 

> Implement an optional webhook parameter to scheduler to POST TaskStateChange 
> events
> ---
>
> Key: AURORA-1683
> URL: https://issues.apache.org/jira/browse/AURORA-1683
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> Per discussion on mailing list, I propose adding a webhook to publish 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Assigned] (AURORA-1683) Implement an optional webhook parameter to scheduler to POST TaskStateChange events

2016-04-28 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko reassigned AURORA-1683:
--

Assignee: Dmitriy Shirchenko

> Implement an optional webhook parameter to scheduler to POST TaskStateChange 
> events
> ---
>
> Key: AURORA-1683
> URL: https://issues.apache.org/jira/browse/AURORA-1683
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1683) Implement an optional webhook parameter to scheduler to POST TaskStateChange events

2016-04-28 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1683?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1683:
---
Summary: Implement an optional webhook parameter to scheduler to POST 
TaskStateChange events  (was: Pointers on how to build a subscriber that could 
send TaskStateChange events to a Kafka topic)

> Implement an optional webhook parameter to scheduler to POST TaskStateChange 
> events
> ---
>
> Key: AURORA-1683
> URL: https://issues.apache.org/jira/browse/AURORA-1683
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1683) Pointers on how to build a subscriber that could send TaskStateChange events to a Kafka topic

2016-04-25 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1683:
--

 Summary: Pointers on how to build a subscriber that could send 
TaskStateChange events to a Kafka topic
 Key: AURORA-1683
 URL: https://issues.apache.org/jira/browse/AURORA-1683
 Project: Aurora
  Issue Type: Task
Reporter: Dmitriy Shirchenko






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Reopened] (AURORA-1666) Allow health check to not run as job's role `role`

2016-04-18 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko reopened AURORA-1666:


Previous RB was done incorrectly, as a boolean was not passed in.

```
thermos_executor.pex: error: --setuid-health-checks option does not take a value
```

> Allow health check to not run as job's role `role`
> --
>
> Key: AURORA-1666
> URL: https://issues.apache.org/jira/browse/AURORA-1666
> Project: Aurora
>  Issue Type: Story
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> We have an issue with new permission demotion as our containers run as the 
> same user and we don't care what user they run as. 
> I propose a flag to disable this demotion and allow health check to keep 
> running as root if we want this. Maybe this can be done at thermos executor 
> level.
> {code}
> E0415 20:32:51.799864 6 aurora_executor.py:118] Traceback (most recent call 
> last):
>   File "apache/aurora/executor/aurora_executor.py", line 116, in _run
> self._start_status_manager(driver, assigned_task)
>   File "apache/aurora/executor/aurora_executor.py", line 161, in 
> _start_status_manager
> status_checker = status_provider.from_assigned_task(assigned_task, 
> self._sandbox)
>   File "apache/aurora/executor/common/health_checker.py", line 248, in 
> from_assigned_task
> pw_entry = pwd.getpwnam(assigned_task.task.job.role)
> KeyError: 'getpwnam(): name not found: meta-umonitor-worker'
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (AURORA-1666) Allow health check to not run as job's role `role`

2016-04-18 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15246974#comment-15246974
 ] 

Dmitriy Shirchenko edited comment on AURORA-1666 at 4/19/16 2:03 AM:
-

Previous RB was done incorrectly, as a boolean was not passed in.

{code}
thermos_executor.pex: error: --setuid-health-checks option does not take a value
{code}


was (Author: shirchen):
Previous RB was done incorrectly, as a boolean was not passed in.

```
thermos_executor.pex: error: --setuid-health-checks option does not take a value
```

> Allow health check to not run as job's role `role`
> --
>
> Key: AURORA-1666
> URL: https://issues.apache.org/jira/browse/AURORA-1666
> Project: Aurora
>  Issue Type: Story
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> We have an issue with new permission demotion as our containers run as the 
> same user and we don't care what user they run as. 
> I propose a flag to disable this demotion and allow health check to keep 
> running as root if we want this. Maybe this can be done at thermos executor 
> level.
> {code}
> E0415 20:32:51.799864 6 aurora_executor.py:118] Traceback (most recent call 
> last):
>   File "apache/aurora/executor/aurora_executor.py", line 116, in _run
> self._start_status_manager(driver, assigned_task)
>   File "apache/aurora/executor/aurora_executor.py", line 161, in 
> _start_status_manager
> status_checker = status_provider.from_assigned_task(assigned_task, 
> self._sandbox)
>   File "apache/aurora/executor/common/health_checker.py", line 248, in 
> from_assigned_task
> pw_entry = pwd.getpwnam(assigned_task.task.job.role)
> KeyError: 'getpwnam(): name not found: meta-umonitor-worker'
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1666) Allow health check to not run as job's role `role`

2016-04-15 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243845#comment-15243845
 ] 

Dmitriy Shirchenko commented on AURORA-1666:


https://reviews.apache.org/r/46290/

> Allow health check to not run as job's role `role`
> --
>
> Key: AURORA-1666
> URL: https://issues.apache.org/jira/browse/AURORA-1666
> Project: Aurora
>  Issue Type: Story
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> We have an issue with new permission demotion as our containers run as the 
> same user and we don't care what user they run as. 
> I propose a flag to disable this demotion and allow health check to keep 
> running as root if we want this. Maybe this can be done at thermos executor 
> level.
> {code}
> E0415 20:32:51.799864 6 aurora_executor.py:118] Traceback (most recent call 
> last):
>   File "apache/aurora/executor/aurora_executor.py", line 116, in _run
> self._start_status_manager(driver, assigned_task)
>   File "apache/aurora/executor/aurora_executor.py", line 161, in 
> _start_status_manager
> status_checker = status_provider.from_assigned_task(assigned_task, 
> self._sandbox)
>   File "apache/aurora/executor/common/health_checker.py", line 248, in 
> from_assigned_task
> pw_entry = pwd.getpwnam(assigned_task.task.job.role)
> KeyError: 'getpwnam(): name not found: meta-umonitor-worker'
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1666) Allow health check to not run as job's role `role`

2016-04-15 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1666?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15243736#comment-15243736
 ] 

Dmitriy Shirchenko commented on AURORA-1666:


Sounds [~zmanji] is onboard with suggestion of adding a thermos_executor flag.

> Allow health check to not run as job's role `role`
> --
>
> Key: AURORA-1666
> URL: https://issues.apache.org/jira/browse/AURORA-1666
> Project: Aurora
>  Issue Type: Story
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>
> We have an issue with new permission demotion as our containers run as the 
> same user and we don't care what user they run as. 
> I propose a flag to disable this demotion and allow health check to keep 
> running as root if we want this. Maybe this can be done at thermos executor 
> level.
> {code}
> E0415 20:32:51.799864 6 aurora_executor.py:118] Traceback (most recent call 
> last):
>   File "apache/aurora/executor/aurora_executor.py", line 116, in _run
> self._start_status_manager(driver, assigned_task)
>   File "apache/aurora/executor/aurora_executor.py", line 161, in 
> _start_status_manager
> status_checker = status_provider.from_assigned_task(assigned_task, 
> self._sandbox)
>   File "apache/aurora/executor/common/health_checker.py", line 248, in 
> from_assigned_task
> pw_entry = pwd.getpwnam(assigned_task.task.job.role)
> KeyError: 'getpwnam(): name not found: meta-umonitor-worker'
> {code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1666) Allow health check to not run as job's role `role`

2016-04-15 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1666:
--

 Summary: Allow health check to not run as job's role `role`
 Key: AURORA-1666
 URL: https://issues.apache.org/jira/browse/AURORA-1666
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko


We have an issue with new permission demotion as our containers run as the same 
user and we don't care what user they run as. 

I propose a flag to disable this demotion and allow health check to keep 
running as root if we want this. Maybe this can be done at thermos executor 
level.

{code}
E0415 20:32:51.799864 6 aurora_executor.py:118] Traceback (most recent call 
last):
  File "apache/aurora/executor/aurora_executor.py", line 116, in _run
self._start_status_manager(driver, assigned_task)
  File "apache/aurora/executor/aurora_executor.py", line 161, in 
_start_status_manager
status_checker = status_provider.from_assigned_task(assigned_task, 
self._sandbox)
  File "apache/aurora/executor/common/health_checker.py", line 248, in 
from_assigned_task
pw_entry = pwd.getpwnam(assigned_task.task.job.role)
KeyError: 'getpwnam(): name not found: meta-umonitor-worker'
{code}



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1661) Scheduler leader failed to re-announce itself after ZK name was changed

2016-04-12 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1661:
--

 Summary: Scheduler leader failed to re-announce itself after ZK 
name was changed
 Key: AURORA-1661
 URL: https://issues.apache.org/jira/browse/AURORA-1661
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko
Assignee: John Sirois


After we renamed our ZK cluster we were in a situation where the Aurora leader 
did not write to the ZK endpoint. 

Sequence that we performed:
- change aurora cluster name in clusters.json
- restart all schedulers (all good)
- restart zk and move replication log (lose all state) during renaming: 
intentional 
- current Aurora leader fails to re-announce itself but election isn't 
retriggered so we lose our leader
- some time later: restart all schedulers, and a new leader is elected (good 
again)


Partial logs from the master (sorry, I'm not sure which parts are super 
relevant).


{code}
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: 2016-04-08 
22:59:10,723:30181(0x7f7586244700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [10.162.18.25:2181] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: 2016-04-08 
22:59:10,723:30181(0x7f7586a45700):ZOO_ERROR@handle_socket_error_msg@1697: 
Socket [10.162.28.31:2181] zk retcode=-4, errno=111(Connection refused): server 
refused to accept the client
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: 2016-04-08 
22:59:10,723:30181(0x7f7586244700):ZOO_INFO@check_events@1703: initiated 
connection to server [10.162.22.24:2181]
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: 2016-04-08 
22:59:10,723:30181(0x7f7586a45700):ZOO_INFO@check_events@1703: initiated 
connection to server [10.162.2.27:2181]
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: 2016-04-08 
22:59:10,733:30181(0x7f7586a45700):ZOO_INFO@check_events@1750: session 
establishment complete on server [10.162.2.27:2181], 
sessionId=0x28004d50ccb80001, negotiated timeout=4000
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.734082 
30370 group.cpp:349] Group process (group(1)@10.162.12.31:8083) connected to 
ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.734127 
30370 group.cpp:831] Syncing group operations: queue size (joins, cancels, 
datas) = (0, 0, 0)
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.734136 
30370 group.cpp:427] Trying to create path '/aurora/replicated-log' in ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.755173 
30373 network.hpp:413] ZooKeeper group memberships changed
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.755249 
30367 group.cpp:700] Trying to get '/aurora/replicated-log/00' in 
ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.756778 
30364 network.hpp:461] ZooKeeper group PIDs: { log-replica(1)@10.162.14.30:8083 
}
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.763407 
30373 network.hpp:413] ZooKeeper group memberships changed
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.763478 
30362 group.cpp:700] Trying to get '/aurora/replicated-log/00' in 
ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.764437 
30362 group.cpp:700] Trying to get '/aurora/replicated-log/01' in 
ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.765494 
30376 network.hpp:461] ZooKeeper group PIDs: { 
log-replica(1)@10.160.41.62:8083, log-replica(1)@10.162.14.30:8083 }
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: 2016-04-08 
22:59:10,782:30181(0x7f7586244700):ZOO_INFO@check_events@1750: session 
establishment complete on server [10.162.22.24:2181], 
sessionId=0x26004d8049520002, negotiated timeout=4000
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.782429 
30376 group.cpp:349] Group process (group(2)@10.162.12.31:8083) connected to 
ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.782444 
30376 group.cpp:831] Syncing group operations: queue size (joins, cancels, 
datas) = (1, 0, 0)
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.782447 
30376 group.cpp:427] Trying to create path '/aurora/replicated-log' in ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.795754 
30377 network.hpp:413] ZooKeeper group memberships changed
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.795805 
30359 group.cpp:700] Trying to get '/aurora/replicated-log/00' in 
ZooKeeper
Apr 08 22:59:10 compute33-sjc1 aurora-scheduler[30176]: I0408 22:59:10.796530 
30359 group.cpp:700] Trying to get '/aurora/replicated-log/01' in 
ZooKeeper
Apr 08 22:59:10 

[jira] [Commented] (AURORA-1641) Shell health checker is running as root

2016-03-15 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1641?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15196520#comment-15196520
 ] 

Dmitriy Shirchenko commented on AURORA-1641:


I would love to help and feel responsible but I'm going on vacation on Sunday 
for a week so don't have time right now :/.

But in the meanwhile can someone give a rough outline of required work?
One proposal I saw was by [~zmanji] who mentioned that we may need to make the 
health check runner look more like: 
https://github.com/apache/aurora/blame/d752d466c550118f052d23519d071eb41b2e5bf6/src/main/python/apache/thermos/core/process.py#L327
 


> Shell health checker is running as root
> ---
>
> Key: AURORA-1641
> URL: https://issues.apache.org/jira/browse/AURORA-1641
> Project: Aurora
>  Issue Type: Bug
>  Components: Executor, Security
>Reporter: Stephan Erb
>Priority: Blocker
>
> As the operator of an Aurora cluster, I have to guarantee that users can run 
> commands only with the privileges of their {{role}}. The new health checker 
> feature is risky in that regard, as it runs all health check commands with 
> the privileges of the Thermos runner. In most common deployments this is root.
> The Thermos runner supports various means for setting the uid/user/role that 
> is used to run user processes. The same configuration should also apply to 
> the user-defined health checking command.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1622) Add ability to pass in environment variables into shell health checker

2016-02-23 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15160081#comment-15160081
 ] 

Dmitriy Shirchenko commented on AURORA-1622:


On second thought, maybe just passing in the assigned port via an environment 
variable would do it. 

Just need to switch check_call to Popen  here [1] and pass the port from here 
[2]

[1] 
https://github.com/apache/aurora/blob/master/src/main/python/apache/aurora/common/health_check/shell.py#L50
[2] 
https://github.com/apache/aurora/blob/master/src/main/python/apache/aurora/executor/common/health_checker.py#L224

> Add ability to pass in environment variables into shell health checker
> --
>
> Key: AURORA-1622
> URL: https://issues.apache.org/jira/browse/AURORA-1622
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>
> A use case is to execute the health check command based on the port 
> assignment. 
> One of the options is to change subprocess32 to use Popen funciton instead of 
> current check_call as Popen has an env= argument.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1622) Add ability to pass in environment variables into shell health checker

2016-02-23 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1622?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1622:
---
Description: 
A use case is to execute the health check command based on the port assignment. 

One of the options is to change subprocess32 to use Popen funciton instead of 
current check_call as Popen has an env= argument.

  was:A use case is to execute the health check command based on the port 
assignment.


> Add ability to pass in environment variables into shell health checker
> --
>
> Key: AURORA-1622
> URL: https://issues.apache.org/jira/browse/AURORA-1622
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>
> A use case is to execute the health check command based on the port 
> assignment. 
> One of the options is to change subprocess32 to use Popen funciton instead of 
> current check_call as Popen has an env= argument.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1622) Add ability to pass in environment variables into shell health checker

2016-02-23 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1622:
--

 Summary: Add ability to pass in environment variables into shell 
health checker
 Key: AURORA-1622
 URL: https://issues.apache.org/jira/browse/AURORA-1622
 Project: Aurora
  Issue Type: Task
Reporter: Dmitriy Shirchenko


A use case is to execute the health check command based on the port assignment.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1589) Dupe debs are showing up

2016-01-21 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1589?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15111759#comment-15111759
 ] 

Dmitriy Shirchenko commented on AURORA-1589:


cc [~jsirois]

> Dupe debs are showing up
> 
>
> Key: AURORA-1589
> URL: https://issues.apache.org/jira/browse/AURORA-1589
> Project: Aurora
>  Issue Type: Story
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>
> Looks like jessie and trusty packages are colliding and trusty (last one to 
> build) is winning out:
> https://builds.apache.org/job/aurora-packaging-nightly/lastSuccessfulBuild/artifact/aurora-packaging/artifacts/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1589) Dupe debs are showing up

2016-01-21 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1589:
--

 Summary: Dupe debs are showing up
 Key: AURORA-1589
 URL: https://issues.apache.org/jira/browse/AURORA-1589
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko
Assignee: Bill Farner


Looks like jessie and trusty packages are colliding and trusty (last one to 
build) is winning out:

https://builds.apache.org/job/aurora-packaging-nightly/lastSuccessfulBuild/artifact/aurora-packaging/artifacts/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1562) Create separate HttpEndpointConfig and ShellEndpointConfig structs

2015-12-15 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=15059244#comment-15059244
 ] 

Dmitriy Shirchenko commented on AURORA-1562:


Review: https://reviews.apache.org/r/41428/

> Create separate HttpEndpointConfig and ShellEndpointConfig structs
> --
>
> Key: AURORA-1562
> URL: https://issues.apache.org/jira/browse/AURORA-1562
> Project: Aurora
>  Issue Type: Story
>Affects Versions: 0.11.0
>Reporter: Dmitriy Shirchenko
>
> Need to address feedback from https://reviews.apache.org/r/41154/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1563) Deprecate endpoint, expected_response and expected_response_code from HealthCheckConfig

2015-12-15 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1563:
--

 Summary: Deprecate endpoint, expected_response and 
expected_response_code from HealthCheckConfig
 Key: AURORA-1563
 URL: https://issues.apache.org/jira/browse/AURORA-1563
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko
 Fix For: 0.12.0


For example, remove deprecated code from health_checker.py and config.py which 
supports 2 ways of getting attributes listed in the title of this task.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1562) Create separate HttpEndpointConfig and ShellEndpointConfig structs

2015-12-14 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1562?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1562:
---
Description: Need to address feedback from 
https://reviews.apache.org/r/41154/

> Create separate HttpEndpointConfig and ShellEndpointConfig structs
> --
>
> Key: AURORA-1562
> URL: https://issues.apache.org/jira/browse/AURORA-1562
> Project: Aurora
>  Issue Type: Story
>Affects Versions: 0.11.0
>Reporter: Dmitriy Shirchenko
>
> Need to address feedback from https://reviews.apache.org/r/41154/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1562) Create separate HttpEndpointConfig and ShellEndpointConfig structs

2015-12-14 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1562:
--

 Summary: Create separate HttpEndpointConfig and 
ShellEndpointConfig structs
 Key: AURORA-1562
 URL: https://issues.apache.org/jira/browse/AURORA-1562
 Project: Aurora
  Issue Type: Story
Affects Versions: 0.11.0
Reporter: Dmitriy Shirchenko






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1552) Perform explicit config validation in the executor when configuring health checker

2015-12-10 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1552:
--

 Summary: Perform explicit config validation in the executor when 
configuring health checker
 Key: AURORA-1552
 URL: https://issues.apache.org/jira/browse/AURORA-1552
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko


Currently there is only client side config validation, so if someone is talking 
directly to scheduler jobs may not fail fast and/or it is unclear why the fail.



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1551) Add support for non-HTTP based health checks

2015-12-09 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1551:
--

 Summary: Add support for non-HTTP based health checks
 Key: AURORA-1551
 URL: https://issues.apache.org/jira/browse/AURORA-1551
 Project: Aurora
  Issue Type: Story
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko


We would like to be able to simply define a command which will act as a health 
check with a non 0 return code signifying failure.

Patch attached:
https://reviews.apache.org/r/41154



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1551) Add support for non-HTTP based health checks

2015-12-09 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1551?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1551:
---
Assignee: Bill Farner  (was: Dmitriy Shirchenko)

> Add support for non-HTTP based health checks
> 
>
> Key: AURORA-1551
> URL: https://issues.apache.org/jira/browse/AURORA-1551
> Project: Aurora
>  Issue Type: Story
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>  Labels: Uber
>
> We would like to be able to simply define a command which will act as a 
> health check with a non 0 return code signifying failure.
> Patch attached:
> https://reviews.apache.org/r/41154



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1536) Add test coverage to python/src/apache/*

2015-11-10 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1536:
---
Assignee: Bill Farner  (was: Zameer Manji)

> Add test coverage to python/src/apache/*
> 
>
> Key: AURORA-1536
> URL: https://issues.apache.org/jira/browse/AURORA-1536
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Trivial
>  Labels: Uber
>
> https://reviews.apache.org/r/39958/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1536) Add test coverage to python/src/apache/*

2015-11-10 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14998277#comment-14998277
 ] 

Dmitriy Shirchenko commented on AURORA-1536:


Reassigned to Bill so this can be closed properly. I'm not sure how :/

> Add test coverage to python/src/apache/*
> 
>
> Key: AURORA-1536
> URL: https://issues.apache.org/jira/browse/AURORA-1536
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Trivial
>  Labels: Uber
>
> https://reviews.apache.org/r/39958/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1536) Add test coverage to python/src/apache/*

2015-11-07 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1536:
---
Assignee: Zameer Manji  (was: Bill Farner)

> Add test coverage to python/src/apache/*
> 
>
> Key: AURORA-1536
> URL: https://issues.apache.org/jira/browse/AURORA-1536
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Zameer Manji
>Priority: Trivial
>  Labels: Uber
>
> https://reviews.apache.org/r/39958/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1536) Add test coverage to python/src/apache/*

2015-11-04 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1536?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1536:
---
Summary: Add test coverage to python/src/apache/*  (was: Add test coverage 
python/src/apache/*)

> Add test coverage to python/src/apache/*
> 
>
> Key: AURORA-1536
> URL: https://issues.apache.org/jira/browse/AURORA-1536
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Trivial
>  Labels: Uber
>
> https://reviews.apache.org/r/39958/



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1531) upgrade psutil to 3.2.2 from 2.1.3

2015-11-02 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1531:
---
Labels: Uber  (was: uber)

> upgrade psutil to 3.2.2 from 2.1.3
> --
>
> Key: AURORA-1531
> URL: https://issues.apache.org/jira/browse/AURORA-1531
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Minor
>  Labels: Uber
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1531) upgrade psutil to 3.2.2 from 2.1.3

2015-11-02 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1531?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1531:
---
Labels: uber  (was: )

> upgrade psutil to 3.2.2 from 2.1.3
> --
>
> Key: AURORA-1531
> URL: https://issues.apache.org/jira/browse/AURORA-1531
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Minor
>  Labels: uber
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Updated] (AURORA-1529) Wrong command to `Build a client executable`

2015-11-02 Thread Dmitriy Shirchenko (JIRA)

 [ 
https://issues.apache.org/jira/browse/AURORA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dmitriy Shirchenko updated AURORA-1529:
---
Labels: Uber  (was: uber)

> Wrong command to `Build a client executable`
> 
>
> Key: AURORA-1529
> URL: https://issues.apache.org/jira/browse/AURORA-1529
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>Priority: Trivial
>  Labels: Uber
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1531) upgrade psutil to 3.2.2 from 2.1.3

2015-10-30 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1531?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14983006#comment-14983006
 ] 

Dmitriy Shirchenko commented on AURORA-1531:


https://reviews.apache.org/r/39822/

> upgrade psutil to 3.2.2 from 2.1.3
> --
>
> Key: AURORA-1531
> URL: https://issues.apache.org/jira/browse/AURORA-1531
> Project: Aurora
>  Issue Type: Task
>Reporter: Dmitriy Shirchenko
>Assignee: Dmitriy Shirchenko
>Priority: Minor
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1530) client tests fails when run via Test client code: ./pants test src/test/python/apache/aurora/client/cli:all

2015-10-29 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1530:
--

 Summary: client tests fails when run via Test client code: ./pants 
test src/test/python/apache/aurora/client/cli:all
 Key: AURORA-1530
 URL: https://issues.apache.org/jira/browse/AURORA-1530
 Project: Aurora
  Issue Type: Bug
Reporter: Dmitriy Shirchenko
Priority: Minor


Docs from 
https://aurora.apache.org/documentation/latest/developing-aurora-client/ state 
to run this to test client.

Test client code: ./pants test src/test/python/apache/aurora/client/cli:all

This however fails eg:

{code}
shirchen@Dmitriys-MacBook-Pro-7:~/Uber/sync/glitz.dev.uber.com/home/uber/aurora 
[master| …3]> ./pants test src/test/python/apache/aurora/client/cli:all
INFO] Detected git repository at 
/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora on branch master

13:56:43 00:00 [main]
   (To run a reporting server: ./pants server)
13:56:43 00:00   [bootstrap]
13:56:43 00:00   [setup]
13:56:43 00:00 [parse]
   Executing tasks in goals: bootstrap -> imports -> unpack-jars -> 
deferred-sources -> gen -> resolve -> compile -> resources -> test
13:56:43 00:00   [bootstrap]
13:56:43 00:00 [bootstrap-jvm-tools]
13:56:43 00:00   [imports]
13:56:43 00:00 [ivy-imports]
13:56:43 00:00   [unpack-jars]
13:56:43 00:00 [unpack-jars]
13:56:43 00:00   [deferred-sources]
13:56:43 00:00 [deferred-sources]
13:56:43 00:00   [gen]
13:56:43 00:00 [thrift]
13:56:44 00:01 [protoc]
13:56:44 00:01 [antlr]
13:56:44 00:01 [ragel]
13:56:44 00:01 [jaxb]
13:56:44 00:01 [wire]
13:56:44 00:01 [aapt]
13:56:44 00:01   [resolve]
13:56:44 00:01 [ivy]
13:56:44 00:01   [compile]
13:56:44 00:01 [compile]
13:56:44 00:01 [jvm]
13:56:44 00:01   [jvm-compilers]
13:56:44 00:01   [resources]
13:56:44 00:01 [prepare]
13:56:44 00:01   [test]
13:56:44 00:01 [run_prep_command]
13:56:44 00:01 [test]
13:56:44 00:01 [pytest]
13:56:44 00:01   [run]
 == test session starts ===
 platform darwin -- Python 2.7.5, pytest-2.8.2, py-1.4.30, 
pluggy-0.3.1
 rootdir: 
/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/src/test/python/apache/aurora/client/cli,
 inifile: 
 plugins: cov-2.2.0, timeout-0.5
 collected 157 items 
 
 
src/test/python/apache/aurora/client/cli/test_api_from_cli.py ..
 src/test/python/apache/aurora/client/cli/test_client.py ..
 
src/test/python/apache/aurora/client/cli/test_command_hooks.py ..
 src/test/python/apache/aurora/client/cli/test_context.py 

 src/test/python/apache/aurora/client/cli/test_cron.py 
..
 src/test/python/apache/aurora/client/cli/test_inspect.py 
...
 
src/test/python/apache/aurora/client/cli/test_cancel_update.py ..
 src/test/python/apache/aurora/client/cli/test_create.py 
..
 src/test/python/apache/aurora/client/cli/test_diff.py ...
 src/test/python/apache/aurora/client/cli/test_kill.py 
.
 src/test/python/apache/aurora/client/cli/test_open.py .
 src/test/python/apache/aurora/client/cli/test_restart.py 
...
 src/test/python/apache/aurora/client/cli/test_status.py 
.
 
src/test/python/apache/aurora/client/cli/test_config_noun.py ...
 src/test/python/apache/aurora/client/cli/test_options.py 
...
 src/test/python/apache/aurora/client/cli/test_plugins.py .
 src/test/python/apache/aurora/client/cli/test_quota.py 
.
 src/test/python/apache/aurora/client/cli/test_sla.py .
 src/test/python/apache/aurora/client/cli/test_task.py .
 src/test/python/apache/aurora/client/cli/test_supdate.py 
.
 src/test/python/apache/aurora/client/cli/test_update.py 
.FF...F..
 src/test/python/apache/aurora/client/cli/test_version.py .
 
  FAILURES 
  TestUpdateCommand.test_large_with_instances_doesnt_warn 
 
 self = 
 
 def test_large_with_instances_doesnt_warn(self):
   (mock_api, mock_scheduler_proxy) = 
self.create_mock_api()
   mock_health_check = 
self.setup_health_checks(mock_api)
   mock_quota_check = self.setup_quota_check()
   

[jira] [Comment Edited] (AURORA-1530) client tests fails when run via Test client code: ./pants test src/test/python/apache/aurora/client/cli:all

2015-10-29 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14981288#comment-14981288
 ] 

Dmitriy Shirchenko edited comment on AURORA-1530 at 10/29/15 9:20 PM:
--

a solution suggested by [~maximk] and [~zmanji] is to instead use 
{code}
./pants test.pytest --no-fast src/test/python/apache/aurora/client/cli:all
{code}

Also, 
{code} ./pants test.pytest --no-fast 
src/test/python/apache/aurora/client/cli:all {code} takes about 58-60s 

versus 

{code} ./pants test src/test/python/apache/aurora/client/cli:all {code}
  3 failed, 154 passed, 1 pytest-warnings in 106.37 seconds 



was (Author: shirchen):
a solution suggested by [~maximk] and [~zmanji] is to instead use 
{code}
./pants test.pytest --no-fast src/test/python/apache/aurora/client/cli:all
{code}

Also, 
{code} ./pants test.pytest --no-fast 
src/test/python/apache/aurora/client/cli:all {code} takes about 58-60s 

versus 

{./pants test src/test/python/apache/aurora/client/cli:all}
  3 failed, 154 passed, 1 pytest-warnings in 106.37 seconds 


> client tests fails when run via Test client code: ./pants test 
> src/test/python/apache/aurora/client/cli:all
> ---
>
> Key: AURORA-1530
> URL: https://issues.apache.org/jira/browse/AURORA-1530
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Priority: Minor
>
> Docs from 
> https://aurora.apache.org/documentation/latest/developing-aurora-client/ 
> state to run this to test client.
> Test client code: ./pants test src/test/python/apache/aurora/client/cli:all
> This however fails eg:
> {code}
> shirchen@Dmitriys-MacBook-Pro-7:~/Uber/sync/glitz.dev.uber.com/home/uber/aurora
>  [master| …3]> ./pants test src/test/python/apache/aurora/client/cli:all
> INFO] Detected git repository at 
> /Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora on branch master
> 13:56:43 00:00 [main]
>(To run a reporting server: ./pants server)
> 13:56:43 00:00   [bootstrap]
> 13:56:43 00:00   [setup]
> 13:56:43 00:00 [parse]
>Executing tasks in goals: bootstrap -> imports -> unpack-jars 
> -> deferred-sources -> gen -> resolve -> compile -> resources -> test
> 13:56:43 00:00   [bootstrap]
> 13:56:43 00:00 [bootstrap-jvm-tools]
> 13:56:43 00:00   [imports]
> 13:56:43 00:00 [ivy-imports]
> 13:56:43 00:00   [unpack-jars]
> 13:56:43 00:00 [unpack-jars]
> 13:56:43 00:00   [deferred-sources]
> 13:56:43 00:00 [deferred-sources]
> 13:56:43 00:00   [gen]
> 13:56:43 00:00 [thrift]
> 13:56:44 00:01 [protoc]
> 13:56:44 00:01 [antlr]
> 13:56:44 00:01 [ragel]
> 13:56:44 00:01 [jaxb]
> 13:56:44 00:01 [wire]
> 13:56:44 00:01 [aapt]
> 13:56:44 00:01   [resolve]
> 13:56:44 00:01 [ivy]
> 13:56:44 00:01   [compile]
> 13:56:44 00:01 [compile]
> 13:56:44 00:01 [jvm]
> 13:56:44 00:01   [jvm-compilers]
> 13:56:44 00:01   [resources]
> 13:56:44 00:01 [prepare]
> 13:56:44 00:01   [test]
> 13:56:44 00:01 [run_prep_command]
> 13:56:44 00:01 [test]
> 13:56:44 00:01 [pytest]
> 13:56:44 00:01   [run]
>  == test session starts ===
>  platform darwin -- Python 2.7.5, pytest-2.8.2, 
> py-1.4.30, pluggy-0.3.1
>  rootdir: 
> /Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/src/test/python/apache/aurora/client/cli,
>  inifile: 
>  plugins: cov-2.2.0, timeout-0.5
>  collected 157 items 
>  
>  
> src/test/python/apache/aurora/client/cli/test_api_from_cli.py ..
>  src/test/python/apache/aurora/client/cli/test_client.py 
> ..
>  
> src/test/python/apache/aurora/client/cli/test_command_hooks.py ..
>  src/test/python/apache/aurora/client/cli/test_context.py 
> 
>  src/test/python/apache/aurora/client/cli/test_cron.py 
> ..
>  src/test/python/apache/aurora/client/cli/test_inspect.py 
> ...
>  
> src/test/python/apache/aurora/client/cli/test_cancel_update.py ..
>  src/test/python/apache/aurora/client/cli/test_create.py 
> ..
>  src/test/python/apache/aurora/client/cli/test_diff.py ...
>  src/test/python/apache/aurora/client/cli/test_kill.py 
> .
>  src/test/python/apache/aurora/client/cli/test_open.py 
> .
>  src/test/python/apache/aurora/client/cli/test_restart.py 
> ...
>  

[jira] [Comment Edited] (AURORA-1530) client tests fails when run via Test client code: ./pants test src/test/python/apache/aurora/client/cli:all

2015-10-29 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14981288#comment-14981288
 ] 

Dmitriy Shirchenko edited comment on AURORA-1530 at 10/29/15 9:20 PM:
--

a solution suggested by [~maximk] and [~zmanji] is to instead use 
{code}
./pants test.pytest --no-fast src/test/python/apache/aurora/client/cli:all
{code}

Also, 
{code} ./pants test.pytest --no-fast 
src/test/python/apache/aurora/client/cli:all {code} takes about 58-60s 

versus 

{./pants test src/test/python/apache/aurora/client/cli:all}
  3 failed, 154 passed, 1 pytest-warnings in 106.37 seconds 



was (Author: shirchen):
a solution suggested by [~maximk]and [~zmanji] is to instead use 
{code}
./pants test.pytest --no-fast src/test/python/apache/aurora/client/cli:all
{code}


> client tests fails when run via Test client code: ./pants test 
> src/test/python/apache/aurora/client/cli:all
> ---
>
> Key: AURORA-1530
> URL: https://issues.apache.org/jira/browse/AURORA-1530
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Priority: Minor
>
> Docs from 
> https://aurora.apache.org/documentation/latest/developing-aurora-client/ 
> state to run this to test client.
> Test client code: ./pants test src/test/python/apache/aurora/client/cli:all
> This however fails eg:
> {code}
> shirchen@Dmitriys-MacBook-Pro-7:~/Uber/sync/glitz.dev.uber.com/home/uber/aurora
>  [master| …3]> ./pants test src/test/python/apache/aurora/client/cli:all
> INFO] Detected git repository at 
> /Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora on branch master
> 13:56:43 00:00 [main]
>(To run a reporting server: ./pants server)
> 13:56:43 00:00   [bootstrap]
> 13:56:43 00:00   [setup]
> 13:56:43 00:00 [parse]
>Executing tasks in goals: bootstrap -> imports -> unpack-jars 
> -> deferred-sources -> gen -> resolve -> compile -> resources -> test
> 13:56:43 00:00   [bootstrap]
> 13:56:43 00:00 [bootstrap-jvm-tools]
> 13:56:43 00:00   [imports]
> 13:56:43 00:00 [ivy-imports]
> 13:56:43 00:00   [unpack-jars]
> 13:56:43 00:00 [unpack-jars]
> 13:56:43 00:00   [deferred-sources]
> 13:56:43 00:00 [deferred-sources]
> 13:56:43 00:00   [gen]
> 13:56:43 00:00 [thrift]
> 13:56:44 00:01 [protoc]
> 13:56:44 00:01 [antlr]
> 13:56:44 00:01 [ragel]
> 13:56:44 00:01 [jaxb]
> 13:56:44 00:01 [wire]
> 13:56:44 00:01 [aapt]
> 13:56:44 00:01   [resolve]
> 13:56:44 00:01 [ivy]
> 13:56:44 00:01   [compile]
> 13:56:44 00:01 [compile]
> 13:56:44 00:01 [jvm]
> 13:56:44 00:01   [jvm-compilers]
> 13:56:44 00:01   [resources]
> 13:56:44 00:01 [prepare]
> 13:56:44 00:01   [test]
> 13:56:44 00:01 [run_prep_command]
> 13:56:44 00:01 [test]
> 13:56:44 00:01 [pytest]
> 13:56:44 00:01   [run]
>  == test session starts ===
>  platform darwin -- Python 2.7.5, pytest-2.8.2, 
> py-1.4.30, pluggy-0.3.1
>  rootdir: 
> /Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/src/test/python/apache/aurora/client/cli,
>  inifile: 
>  plugins: cov-2.2.0, timeout-0.5
>  collected 157 items 
>  
>  
> src/test/python/apache/aurora/client/cli/test_api_from_cli.py ..
>  src/test/python/apache/aurora/client/cli/test_client.py 
> ..
>  
> src/test/python/apache/aurora/client/cli/test_command_hooks.py ..
>  src/test/python/apache/aurora/client/cli/test_context.py 
> 
>  src/test/python/apache/aurora/client/cli/test_cron.py 
> ..
>  src/test/python/apache/aurora/client/cli/test_inspect.py 
> ...
>  
> src/test/python/apache/aurora/client/cli/test_cancel_update.py ..
>  src/test/python/apache/aurora/client/cli/test_create.py 
> ..
>  src/test/python/apache/aurora/client/cli/test_diff.py ...
>  src/test/python/apache/aurora/client/cli/test_kill.py 
> .
>  src/test/python/apache/aurora/client/cli/test_open.py 
> .
>  src/test/python/apache/aurora/client/cli/test_restart.py 
> ...
>  src/test/python/apache/aurora/client/cli/test_status.py 
> .
>  
> src/test/python/apache/aurora/client/cli/test_config_noun.py ...
>  src/test/python/apache/aurora/client/cli/test_options.py 
> ...
>  

[jira] [Comment Edited] (AURORA-1530) client tests fails when run via Test client code: ./pants test src/test/python/apache/aurora/client/cli:all

2015-10-29 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1530?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14981288#comment-14981288
 ] 

Dmitriy Shirchenko edited comment on AURORA-1530 at 10/29/15 9:13 PM:
--

a solution suggested by [~maximk]and [~zmanji] is to instead use 
{code}
./pants test.pytest --no-fast src/test/python/apache/aurora/client/cli:all
{code}



was (Author: shirchen):
a solution suggested by [~maximk]and [~zmanji]is to instead use 
{code}
./pants test.pytest --no-fast src/test/python/apache/aurora/client/cli:all
{code}


> client tests fails when run via Test client code: ./pants test 
> src/test/python/apache/aurora/client/cli:all
> ---
>
> Key: AURORA-1530
> URL: https://issues.apache.org/jira/browse/AURORA-1530
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Priority: Minor
>
> Docs from 
> https://aurora.apache.org/documentation/latest/developing-aurora-client/ 
> state to run this to test client.
> Test client code: ./pants test src/test/python/apache/aurora/client/cli:all
> This however fails eg:
> {code}
> shirchen@Dmitriys-MacBook-Pro-7:~/Uber/sync/glitz.dev.uber.com/home/uber/aurora
>  [master| …3]> ./pants test src/test/python/apache/aurora/client/cli:all
> INFO] Detected git repository at 
> /Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora on branch master
> 13:56:43 00:00 [main]
>(To run a reporting server: ./pants server)
> 13:56:43 00:00   [bootstrap]
> 13:56:43 00:00   [setup]
> 13:56:43 00:00 [parse]
>Executing tasks in goals: bootstrap -> imports -> unpack-jars 
> -> deferred-sources -> gen -> resolve -> compile -> resources -> test
> 13:56:43 00:00   [bootstrap]
> 13:56:43 00:00 [bootstrap-jvm-tools]
> 13:56:43 00:00   [imports]
> 13:56:43 00:00 [ivy-imports]
> 13:56:43 00:00   [unpack-jars]
> 13:56:43 00:00 [unpack-jars]
> 13:56:43 00:00   [deferred-sources]
> 13:56:43 00:00 [deferred-sources]
> 13:56:43 00:00   [gen]
> 13:56:43 00:00 [thrift]
> 13:56:44 00:01 [protoc]
> 13:56:44 00:01 [antlr]
> 13:56:44 00:01 [ragel]
> 13:56:44 00:01 [jaxb]
> 13:56:44 00:01 [wire]
> 13:56:44 00:01 [aapt]
> 13:56:44 00:01   [resolve]
> 13:56:44 00:01 [ivy]
> 13:56:44 00:01   [compile]
> 13:56:44 00:01 [compile]
> 13:56:44 00:01 [jvm]
> 13:56:44 00:01   [jvm-compilers]
> 13:56:44 00:01   [resources]
> 13:56:44 00:01 [prepare]
> 13:56:44 00:01   [test]
> 13:56:44 00:01 [run_prep_command]
> 13:56:44 00:01 [test]
> 13:56:44 00:01 [pytest]
> 13:56:44 00:01   [run]
>  == test session starts ===
>  platform darwin -- Python 2.7.5, pytest-2.8.2, 
> py-1.4.30, pluggy-0.3.1
>  rootdir: 
> /Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/src/test/python/apache/aurora/client/cli,
>  inifile: 
>  plugins: cov-2.2.0, timeout-0.5
>  collected 157 items 
>  
>  
> src/test/python/apache/aurora/client/cli/test_api_from_cli.py ..
>  src/test/python/apache/aurora/client/cli/test_client.py 
> ..
>  
> src/test/python/apache/aurora/client/cli/test_command_hooks.py ..
>  src/test/python/apache/aurora/client/cli/test_context.py 
> 
>  src/test/python/apache/aurora/client/cli/test_cron.py 
> ..
>  src/test/python/apache/aurora/client/cli/test_inspect.py 
> ...
>  
> src/test/python/apache/aurora/client/cli/test_cancel_update.py ..
>  src/test/python/apache/aurora/client/cli/test_create.py 
> ..
>  src/test/python/apache/aurora/client/cli/test_diff.py ...
>  src/test/python/apache/aurora/client/cli/test_kill.py 
> .
>  src/test/python/apache/aurora/client/cli/test_open.py 
> .
>  src/test/python/apache/aurora/client/cli/test_restart.py 
> ...
>  src/test/python/apache/aurora/client/cli/test_status.py 
> .
>  
> src/test/python/apache/aurora/client/cli/test_config_noun.py ...
>  src/test/python/apache/aurora/client/cli/test_options.py 
> ...
>  src/test/python/apache/aurora/client/cli/test_plugins.py 
> .
>  src/test/python/apache/aurora/client/cli/test_quota.py 
> .
>  src/test/python/apache/aurora/client/cli/test_sla.py 
> .
>  src/test/python/apache/aurora/client/cli/test_task.py 
> .
>   

[jira] [Commented] (AURORA-1529) Wrong command to `Build a client executable`

2015-10-28 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979787#comment-14979787
 ] 

Dmitriy Shirchenko commented on AURORA-1529:


oh, and i think one of the committers needs to land the diff for me. thanks!

> Wrong command to `Build a client executable`
> 
>
> Key: AURORA-1529
> URL: https://issues.apache.org/jira/browse/AURORA-1529
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Trivial
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Commented] (AURORA-1529) Wrong command to `Build a client executable`

2015-10-28 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979791#comment-14979791
 ] 

Dmitriy Shirchenko commented on AURORA-1529:


oh sorry, i thought i addressed it. the second command had to be changed, 
right? https://reviews.apache.org/r/39742/diff/3#index_header

> Wrong command to `Build a client executable`
> 
>
> Key: AURORA-1529
> URL: https://issues.apache.org/jira/browse/AURORA-1529
> Project: Aurora
>  Issue Type: Bug
>Reporter: Dmitriy Shirchenko
>Assignee: Bill Farner
>Priority: Trivial
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Created] (AURORA-1529) Wrong command to `Build a client executable`

2015-10-28 Thread Dmitriy Shirchenko (JIRA)
Dmitriy Shirchenko created AURORA-1529:
--

 Summary: Wrong command to `Build a client executable`
 Key: AURORA-1529
 URL: https://issues.apache.org/jira/browse/AURORA-1529
 Project: Aurora
  Issue Type: Bug
Reporter: Dmitriy Shirchenko
Assignee: Dmitriy Shirchenko
Priority: Trivial






--
This message was sent by Atlassian JIRA
(v6.3.4#6332)


[jira] [Comment Edited] (AURORA-1529) Wrong command to `Build a client executable`

2015-10-28 Thread Dmitriy Shirchenko (JIRA)

[ 
https://issues.apache.org/jira/browse/AURORA-1529?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=14979487#comment-14979487
 ] 

Dmitriy Shirchenko edited comment on AURORA-1529 at 10/28/15 11:40 PM:
---

pants docs here [1] and [2] to build a client executable : 
`Build a client executable: ./pants binary 
src/main/python/apache/aurora/client/cli:aurora`
`./pants binary src/main/python/apache/aurora/client/cli:kaurora`

leads to 
`
{code}
(env) 
shirchen@Dmitriys-MacBook-Pro-7:~/Uber/sync/glitz.dev.uber.com/home/uber/aurora 
[master| …443]> ./pants binary src/main/python/apache/aurora/client/cli:aurora

Exception caught:
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/bin/pants",
 line 9, in 
load_entry_point('pantsbuild.pants==0.0.32', 'console_scripts', 'pants')()
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/pants_exe.py",
 line 81, in main
_run(exiter)
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/pants_exe.py",
 line 72, in _run
goal_runner.setup()
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/goal_runner.py",
 line 109, in setup
self._expand_goals_and_specs()
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/goal_runner.py",
 line 163, in _expand_goals_and_specs
for address in spec_parser.parse_addresses(spec, fail_fast):
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/base/cmd_line_spec_parser.py",
 line 85, in parse_addresses
for address in self._parse_spec(spec, fail_fast):
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/base/cmd_line_spec_parser.py",
 line 172, in _parse_spec
raise self.BadSpecError(e)

Exception message: BUILD file does not exist at: 
/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/src/main/python/apache/aurora/client/cli/BUILD
{code}

Similar story for the kerberos client.

[1] http://aurora.apache.org/documentation/latest/developing-aurora-client
[2] http://aurora.apache.org/documentation/latest/security/


was (Author: shirchen):
pants docs here [1] and [2] to build a client executable : 
`Build a client executable: ./pants binary 
src/main/python/apache/aurora/client/cli:aurora`
`./pants binary src/main/python/apache/aurora/client/cli:kaurora`

leads to 
`
(env) 
shirchen@Dmitriys-MacBook-Pro-7:~/Uber/sync/glitz.dev.uber.com/home/uber/aurora 
[master| …443]> ./pants binary src/main/python/apache/aurora/client/cli:aurora

Exception caught:
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/bin/pants",
 line 9, in 
load_entry_point('pantsbuild.pants==0.0.32', 'console_scripts', 'pants')()
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/pants_exe.py",
 line 81, in main
_run(exiter)
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/pants_exe.py",
 line 72, in _run
goal_runner.setup()
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/goal_runner.py",
 line 109, in setup
self._expand_goals_and_specs()
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/bin/goal_runner.py",
 line 163, in _expand_goals_and_specs
for address in spec_parser.parse_addresses(spec, fail_fast):
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/base/cmd_line_spec_parser.py",
 line 85, in parse_addresses
for address in self._parse_spec(spec, fail_fast):
  File 
"/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/build-support/pants.venv/lib/python2.7/site-packages/pants/base/cmd_line_spec_parser.py",
 line 172, in _parse_spec
raise self.BadSpecError(e)

Exception message: BUILD file does not exist at: 
/Users/shirchen/Uber/sync/glitz.dev.uber.com/home/uber/aurora/src/main/python/apache/aurora/client/cli/BUILD
`

[1] http://aurora.apache.org/documentation/latest/developing-aurora-client
[2] http://aurora.apache.org/documentation/latest/security/

> Wrong command to `Build a client executable`
> 
>
> Key: AURORA-1529
> URL: