Hmm, looks like at least some builds
<https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/19804/consoleFull>
are working now, though this last one was from ~5 hours ago.


On Fri, Sep 5, 2014 at 1:02 AM, shane knapp <skn...@berkeley.edu> wrote:

> yep.  that's exactly the behavior i saw earlier, and will be figuring out
> first thing tomorrow morning.  i bet it's an environment issues on the
> slaves.
>
>
> On Thu, Sep 4, 2014 at 7:10 PM, Nicholas Chammas <
> nicholas.cham...@gmail.com> wrote:
>
>> Looks like during the last build
>> <https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/19797/console>
>> Jenkins was unable to execute a git fetch?
>>
>>
>> On Thu, Sep 4, 2014 at 7:58 PM, shane knapp <skn...@berkeley.edu> wrote:
>>
>>> i'm going to restart jenkins and see if that fixes things.
>>>
>>>
>>> On Thu, Sep 4, 2014 at 4:56 PM, shane knapp <skn...@berkeley.edu> wrote:
>>>
>>>> looking
>>>>
>>>>
>>>> On Thu, Sep 4, 2014 at 4:21 PM, Nicholas Chammas <
>>>> nicholas.cham...@gmail.com> wrote:
>>>>
>>>>> It appears that our main man is having trouble
>>>>> <https://amplab.cs.berkeley.edu/jenkins/view/Pull%20Request%20Builders/job/SparkPullRequestBuilder/>
>>>>>  hearing new requests
>>>>> <https://github.com/apache/spark/pull/2277#issuecomment-54549106>.
>>>>>
>>>>> Do we need some smelling salts?
>>>>>
>>>>>
>>>>> On Thu, Sep 4, 2014 at 5:49 PM, shane knapp <skn...@berkeley.edu>
>>>>> wrote:
>>>>>
>>>>>> i'd ping the Jenkinsmench...  the master was completely offline, so
>>>>>> any new
>>>>>> jobs wouldn't have reached it.  any jobs that were queued when power
>>>>>> was
>>>>>> lost probably started up, but jobs that were running would fail.
>>>>>>
>>>>>>
>>>>>> On Thu, Sep 4, 2014 at 2:45 PM, Nicholas Chammas <
>>>>>> nicholas.cham...@gmail.com
>>>>>> > wrote:
>>>>>>
>>>>>> > Woohoo! Thanks Shane.
>>>>>> >
>>>>>> > Do you know if queued PR builds will automatically be picked up? Or
>>>>>> do we
>>>>>> > have to ping the Jenkinmensch manually from each PR?
>>>>>> >
>>>>>> > Nick
>>>>>> >
>>>>>> >
>>>>>> > On Thu, Sep 4, 2014 at 5:37 PM, shane knapp <skn...@berkeley.edu>
>>>>>> wrote:
>>>>>> >
>>>>>> >> AND WE'RE UP!
>>>>>> >>
>>>>>> >> sorry that this took so long...  i'll send out a more detailed
>>>>>> explanation
>>>>>> >> of what happened soon.
>>>>>> >>
>>>>>> >> now, off to back up jenkins.
>>>>>> >>
>>>>>> >> shane
>>>>>> >>
>>>>>> >>
>>>>>> >> On Thu, Sep 4, 2014 at 1:27 PM, shane knapp <skn...@berkeley.edu>
>>>>>> wrote:
>>>>>> >>
>>>>>> >> > it's a faulty power switch on the firewall, which has been
>>>>>> swapped out.
>>>>>> >> >  we're about to reboot and be good to go.
>>>>>> >> >
>>>>>> >> >
>>>>>> >> > On Thu, Sep 4, 2014 at 1:19 PM, shane knapp <skn...@berkeley.edu
>>>>>> >
>>>>>> >> wrote:
>>>>>> >> >
>>>>>> >> >> looks like some hardware failed, and we're swapping in a
>>>>>> replacement.
>>>>>> >> i
>>>>>> >> >> don't have more specific information yet -- including *what*
>>>>>> failed,
>>>>>> >> as our
>>>>>> >> >> sysadmin is super busy ATM.  the root cause was an incorrect
>>>>>> circuit
>>>>>> >> being
>>>>>> >> >> switched off during building maintenance.
>>>>>> >> >>
>>>>>> >> >> on a side note, this incident will be accelerating our plan to
>>>>>> move the
>>>>>> >> >> entire jenkins infrastructure in to a managed datacenter
>>>>>> environment.
>>>>>> >> this
>>>>>> >> >> will be our major push over the next couple of weeks.  more
>>>>>> details
>>>>>> >> about
>>>>>> >> >> this, also, as soon as i get them.
>>>>>> >> >>
>>>>>> >> >> i'm very sorry about the downtime, we'll get everything up and
>>>>>> running
>>>>>> >> >> ASAP.
>>>>>> >> >>
>>>>>> >> >>
>>>>>> >> >> On Thu, Sep 4, 2014 at 12:27 PM, shane knapp <
>>>>>> skn...@berkeley.edu>
>>>>>> >> wrote:
>>>>>> >> >>
>>>>>> >> >>> looks like a power outage in soda hall.  more updates as they
>>>>>> happen.
>>>>>> >> >>>
>>>>>> >> >>>
>>>>>> >> >>> On Thu, Sep 4, 2014 at 12:25 PM, shane knapp <
>>>>>> skn...@berkeley.edu>
>>>>>> >> >>> wrote:
>>>>>> >> >>>
>>>>>> >> >>>> i am trying to get things up and running, but it looks like
>>>>>> either
>>>>>> >> the
>>>>>> >> >>>> firewall gateway or jenkins server itself is down.  i'll
>>>>>> update as
>>>>>> >> soon as
>>>>>> >> >>>> i know more.
>>>>>> >> >>>>
>>>>>> >> >>>
>>>>>> >> >>>
>>>>>> >> >>
>>>>>> >> >
>>>>>> >>
>>>>>> >
>>>>>> >  --
>>>>>> > You received this message because you are subscribed to the Google
>>>>>> Groups
>>>>>> > "amp-infra" group.
>>>>>> > To unsubscribe from this group and stop receiving emails from it,
>>>>>> send an
>>>>>> > email to amp-infra+unsubscr...@googlegroups.com.
>>>>>> > For more options, visit https://groups.google.com/d/optout.
>>>>>> >
>>>>>>
>>>>>
>>>>>
>>>>
>>>
>>
>

Reply via email to