Re: [vpp-dev] FD.io CI Outage

2022-06-21 Thread Dave Wallace

Folks,

Jenkins was restarted last night which resolved the outage.  RCA is 
still being investigated.


Thanks,
-daw-

On 6/20/22 4:36 PM, Dave Wallace via lists.fd.io wrote:

Folks,

Jenkins.fd.io was inadvertently put into maintenance mode on 
Saturday.  It was taken out of maintenance mode about 5 hours ago, but 
VPP & CSIT jobs are still stuck in the Jenkins Build Queue (currently 
18 jobs in the queue).


LF-IT tickets [0] have been opened and resolution of the issue is ongoing.

Thank you for your patience as today is a US Holiday and the regular 
LF-IT support team is short staffed.

-daw-

[0] 
https://jira.linuxfoundation.org/plugins/servlet/theme/portal/2/IT-24179

https://jira.linuxfoundation.org/plugins/servlet/theme/portal/2/IT-24181




-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#21562): https://lists.fd.io/g/vpp-dev/message/21562
Mute This Topic: https://lists.fd.io/mt/91886472/21656
Group Owner: vpp-dev+ow...@lists.fd.io
Unsubscribe: https://lists.fd.io/g/vpp-dev/leave/1480452/21656/631435203/xyzzy 
[arch...@mail-archive.com]
-=-=-=-=-=-=-=-=-=-=-=-



[vpp-dev] FD.io CI Outage

2022-06-20 Thread Dave Wallace

Folks,

Jenkins.fd.io was inadvertently put into maintenance mode on Saturday.  
It was taken out of maintenance mode about 5 hours ago, but VPP & CSIT 
jobs are still stuck in the Jenkins Build Queue (currently 18 jobs in 
the queue).


LF-IT tickets [0] have been opened and resolution of the issue is ongoing.

Thank you for your patience as today is a US Holiday and the regular 
LF-IT support team is short staffed.

-daw-

[0] https://jira.linuxfoundation.org/plugins/servlet/theme/portal/2/IT-24179
https://jira.linuxfoundation.org/plugins/servlet/theme/portal/2/IT-24181

-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#21557): https://lists.fd.io/g/vpp-dev/message/21557
Mute This Topic: https://lists.fd.io/mt/91886472/21656
Group Owner: vpp-dev+ow...@lists.fd.io
Unsubscribe: https://lists.fd.io/g/vpp-dev/leave/1480452/21656/631435203/xyzzy 
[arch...@mail-archive.com]
-=-=-=-=-=-=-=-=-=-=-=-



Re: [vpp-dev] FD.io CI outage

2021-12-15 Thread Dave Wallace

[0] https://jira.linuxfoundation.org/plugins/servlet/theme/portal/2/IT-23405

On 12/15/21 1:13 PM, Dave Wallace via lists.fd.io wrote:

Folks,

Due to service issues in AWS, gerrit.fd.io (AWS instance) is currently 
not communicating with jenkins.fd.io (Vexxhost instance).


I have opened a ticket with LF-IT [0] and Vanessa has put 
jenkins.fd.io into shutdown mode in anticipation of a reset when the 
communications issue is resolved.


Thanks,
-daw-




-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#20638): https://lists.fd.io/g/vpp-dev/message/20638
Mute This Topic: https://lists.fd.io/mt/87749872/21656
Group Owner: vpp-dev+ow...@lists.fd.io
Unsubscribe: https://lists.fd.io/g/vpp-dev/unsub [arch...@mail-archive.com]
-=-=-=-=-=-=-=-=-=-=-=-



[vpp-dev] FD.io CI outage

2021-12-15 Thread Dave Wallace

Folks,

Due to service issues in AWS, gerrit.fd.io (AWS instance) is currently 
not communicating with jenkins.fd.io (Vexxhost instance).


I have opened a ticket with LF-IT [0] and Vanessa has put jenkins.fd.io 
into shutdown mode in anticipation of a reset when the communications 
issue is resolved.


Thanks,
-daw-

-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#20637): https://lists.fd.io/g/vpp-dev/message/20637
Mute This Topic: https://lists.fd.io/mt/87749872/21656
Group Owner: vpp-dev+ow...@lists.fd.io
Unsubscribe: https://lists.fd.io/g/vpp-dev/unsub [arch...@mail-archive.com]
-=-=-=-=-=-=-=-=-=-=-=-



[vpp-dev] FD.io CI Outage has been resolved

2021-09-21 Thread Dave Wallace

Folks,

Sorry for the late notice, but I've been meeting bound while 
troubleshooting today's FD.io CI outage.


There was a network configuration change in the Vexxhost datacenter that 
cause the public IP address of jenkins to become inaccessible from the 
Nomad cluster causing all jobs to fail. This has been resolved.


Another issue still remains, that is why the external address was being 
used. It was determined that Jenkins was configured this way in the a 
portion of the cloud configuration even though Jenkins was configured to 
use the internal IP address in the global configuration variables.


Unfortunately updating the cloud configuration failed to work after the 
network issues were resolved and restoring the cloud configuration to 
use jenkins.fd.io resolved the issue.


Given today's outage, I have recommended that Andrew push back the VPP 
21.10 branch pull to Thursday 9/23/2021.


Kudo's to Peter Mikus for identifying the root cause of the outage, 
Vanessa Valderrama for assisting during a PTO day to manage Jenkins 
configuration changes & restarts, and Anton Baranov & Mohammed Naser for 
fixing the data center network configuration.


Thanks,
-daw-

-=-=-=-=-=-=-=-=-=-=-=-
Links: You receive all messages sent to this group.
View/Reply Online (#20174): https://lists.fd.io/g/vpp-dev/message/20174
Mute This Topic: https://lists.fd.io/mt/85775812/21656
Group Owner: vpp-dev+ow...@lists.fd.io
Unsubscribe: https://lists.fd.io/g/vpp-dev/unsub [arch...@mail-archive.com]
-=-=-=-=-=-=-=-=-=-=-=-