[Bug 62958] Instances fail to initialize on initial boot due to network communication failures

2014-03-23 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=62958

--- Comment #1 from Bryan Davis  ---
Here's something I noticed that is different about one of the instances having
this problem, deployment-elastic01 (i-0275.eqiad.wmflabs): it has been
given the ip address 10.68.17.2. All of the other eqiad instances in
deployment-prep project have ip addresses that would fall within the
10.68.16.0/24 CIDR range.

The assigned range for eqiad labs seems to be 10.68.16.0/21, but is it possible
that there is a firewall of acl rule somehere that is set to 10.68.16.0/24
instead that would be blocking ldap and puppet communications?

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 62958] Instances fail to initialize on initial boot due to network communication failures

2014-03-23 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=62958

--- Comment #2 from Andrew Bogott  ---
Hah, I just noticed that a second ago as well.  Pursuing that idea now...

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 62958] Instances fail to initialize on initial boot due to network communication failures

2014-03-24 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=62958

Andrew Bogott  changed:

   What|Removed |Added

 CC||nicolas.ra...@gmail.com

--- Comment #3 from Andrew Bogott  ---
*** Bug 62999 has been marked as a duplicate of this bug. ***

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l


[Bug 62958] Instances fail to initialize on initial boot due to network communication failures

2014-03-24 Thread bugzilla-daemon
https://bugzilla.wikimedia.org/show_bug.cgi?id=62958

Bryan Davis  changed:

   What|Removed |Added

 Status|NEW |RESOLVED
 Resolution|--- |FIXED

--- Comment #4 from Bryan Davis  ---
This seems to be fixed now. Andrew reported via irc that a static route for
10.68.16.0/24 was found on the routers. I assume this has been changed to a
static route for 10.68.16.0/21.

The deployment-elastic01.eqiad.wmflabs instance that I left running with the
broken configuration recovered and was able to communicate with LDAP and the
labs puppetmaster. Some configuration seemed to remain broken however as the
instance was not recognising me as a member of the group that is allowed to run
sudo without a password. I tried one reboot to see if this would self correct
and when it didn't I deleted the instance and built a replacement. The
replacement is working as expected.

-- 
You are receiving this mail because:
You are the assignee for the bug.
You are on the CC list for the bug.
___
Wikibugs-l mailing list
Wikibugs-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wikibugs-l