David O'Hallaron wrote:
We're running a tashi cluster as the backend for an autograding
service at CMU called Autolab. Each time a student hands in their
work, the service creates a fresh VM instance, autogrades the students
work in this instance, and then destroys it. Now and then, typically
in periods of higher load when we have to create multiple instances
at the same time, we'll get an instance that can't be reached (see
below).

Any ideas what might be causing this?

Thanks!
Dave

linux> tashi-client.py getinstances
 id  hostId name        user state   disk     memory cores
----------------------------------------------------------
 178 6      tango6-rhel gdm  Running rhel.img 512    1

linux> ping tango6-rhel.vmnet
PING tango6-rhel.vmnet (192.168.1.216) 56(84) bytes of data.
From 192.168.1.1 icmp_seq=1 Destination Host Unreachable
From 192.168.1.1 icmp_seq=2 Destination Host Unreachable

If the VM is actually up, is the DHCP server capable of handing out an address in time for the machine to accept it?

Greetings,
Michael.

Reply via email to