Aaron, Thank you for your reply. I copied some log information as follows. It looks like the VM was started, but sshd on the VM was not active. The script looped and waited for half an hour and failed at the end. The image works fine on existing blades. I wonder if there is some configuration of VM server I didn't set correctly for the new node.
Thanks, Lei =================== 2011-09-21 23:31:27|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx start' 2>&1 2011-09-21 23:31:31|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:31 2011-09-21 23:31:36|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:36 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for connection by admin on vmguest-2, attempt 27 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 2011-09-21 23:31:40|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14122|517:511|inuse| none 2011-09-21 23:31:41|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH command executed on vmguest-2, returning (0, "none") 2011-09-21 23:31:41|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:41 2011-09-21 23:31:46|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:31:46 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14402|518:512|new| start() = 1 2011-09-21 23:31:48|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH command executed on CSB308, returning (0, "start() = 1") 2011-09-21 23:31:48|14402|518:512|new|vmware.pm:load(808)|started /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx on CSB308 2011-09-21 23:31:48|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, startvm, started vm on CSB308 |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx getstate' 2>&1 2011-09-21 23:32:11|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:11 2011-09-21 23:32:16|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:16 2011-09-21 23:32:21|14122|517:511|inuse|utils.pm:check_connection(1765)|checking for connection by admin on vmguest-2, attempt 29 2011-09-21 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'netstat -an' 2>&1 2011-09-21 23:32:21|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:21 2011-09-21 23:32:21|14122|517:511|inuse|utils.pm:run_ssh_command(6180)|executing SSH command on vmguest-2: |14122|517:511|inuse| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x vmguest-2 'who' 2>&1 2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14122|517:511|inuse| none 2011-09-21 23:32:22|14122|517:511|inuse|utils.pm:run_ssh_command(6276)|SSH command executed on vmguest-2, returning (0, "none") 2011-09-21 23:32:26|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-21 23:32:26 2011-09-21 23:32:28|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14402|518:512|new| getstate() = on 2011-09-21 23:32:28|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH command executed on CSB308, returning (0, "getstate() = on") 2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(831)|checking state of vm vmguest-10 2011-09-21 23:32:28|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, vmstage1, node has been turned on 2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(838)|stage1 completed vm vmguest-10 has been turned on 2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(839)|eth0MAC 00:50:56:1a:01:13 privateIPaddress 129.207.46.26 2011-09-21 23:32:28|14402|518:512|new|vmware.pm:load(848)|vmguest-10 ROUND 1 checks loop 0 of 40 |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx getstate' 2>&1 2011-09-22 00:05:41|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:05:41 2011-09-22 00:05:46|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:05:46 2011-09-22 00:05:51|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:05:51 2011-09-22 00:05:56|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:05:56 2011-09-22 00:06:00|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14402|518:512|new| getstate() = on2011-09-22 00:06:00|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH command executed on CSB308, returning (0, "getstate() = on")2011-09-22 00:06:00|14402|518:512|new|vmware.pm:load(852)|rechecking state of vm vmguest-10 /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx2011-09-22 00:06:00|14402|518:512|new|vmware.pm:load(857)|vm vmguest-10 reports on 2011-09-22 00:06:00|14402|518:512|new|vmware.pm:load(868)|sshd is NOT active on vmguest-10 yet 2011-09-22 00:06:01|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:01 2011-09-22 00:06:06|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:06 2011-09-22 00:06:10|14402|518:512|new|vmware.pm:load(848)|vmguest-10 ROUND 1 checks loop 66 of 40 2011-09-22 00:06:10|14402|518:512|new|utils.pm:run_ssh_command(6180)|executing SSH command on CSB308: |14402|518:512|new| /usr/bin/ssh -i /etc/vcl/vcl.key -l root -p 22 -x CSB308 'vmware-cmd /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx getstate' 2>&12011-09-22 00:06:11|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:11 2011-09-22 00:06:16|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:16 2011-09-22 00:06:21|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:21 2011-09-22 00:06:26|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:26 2011-09-22 00:06:30|14402|518:512|new|utils.pm:run_ssh_command(6262)|run_ssh_command output: |14402|518:512|new| getstate() = on 2011-09-22 00:06:30|14402|518:512|new|utils.pm:run_ssh_command(6276)|SSH command executed on CSB308, returning (0, "getstate() = on") 2011-09-22 00:06:30|14402|518:512|new|vmware.pm:load(852)|rechecking state of vm vmguest-10 /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx 2011-09-22 00:06:30|14402|518:512|new|vmware.pm:load(857)|vm vmguest-10 reports on 2011-09-22 00:06:31|14402|518:512|new|vmware.pm:load(868)|sshd is NOT active on vmguest-10 yet 2011-09-22 00:06:31|3929|vcld:main(165)|lastcheckin time updated for management node 1: 2011-09-22 00:06:31 No recipient addresses found in header 2011-09-22 00:06:31|14402|518:512|new|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , PROBLEM -- vmware.pm |14402|518:512|new| ---- CRITICAL ---- |14402|518:512|new| 2011-09-22 00:06:31|14402|518:512|new|vmware.pm:load(967)|could not load /install/vmware_files/runningvms/CentOS5_5-base10-v0vmguest-10/CentOS5_5-ba se10-v0vmguest-10.vmx on vmguest-10 on host CSB308|14402|518:512|new| ( 0) utils.pm, notify (line: 737) |14402|518:512|new| (-1) vmware.pm, load (line: 967)|14402|518:512|new| (-2) new.pm, reload_image (line: 665) |14402|518:512|new| (-3) new.pm, process (line: 266) |14402|518:512|new| (-4) vcld, make_new_child (line: 594) |14402|518:512|new| (-5) vcld, main (line: 341) 2011-09-22 00:06:31|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, failed, could not load vmx on CSB308 |14402|518:512|new| ---- WARNING ----|14402|518:512|new| 2011-09-22 00:06:31|14402|518:512|new|new.pm:reload_image(670)|CentOS5_5-base10-v0 failed to load on vmguest-10, returning |14402|518:512|new| ( 0) utils.pm, notify (line: 737) |14402|518:512|new| (-1) new.pm, reload_image (line: 670) |14402|518:512|new| (-2) new.pm, process (line: 266) |14402|518:512|new| (-3) vcld, make_new_child (line: 594) |14402|518:512|new| (-4) vcld, main (line: 341) 2011-09-22 00:06:31|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, loadimagefailed, CentOS5_5-base10-v0 failed to load on vmguest-10 |14402|518:512|new| ---- WARNING ---- |14402|518:512|new| 2011-09-22 00:06:31|14402|518:512|new|new.pm:process(313)|failed to load vmguest-10 with CentOS5_5-base10-v0 |14402|518:512|new| ( 0) utils.pm, notify (line: 737) |14402|518:512|new| (-1) new.pm, process (line: 313) |14402|518:512|new| (-2) vcld, make_new_child (line: 594) |14402|518:512|new| (-3) vcld, main (line: 341) 2011-09-22 00:06:31|14402|518:512|new|DataStructure.pm:get_computer_state_name(1946)|a ttempting to retrieve current state of computer vmguest-10 from the database 2011-09-22 00:06:31|14402|518:512|new|DataStructure.pm:get_computer_state_name(1977)|r etrieved current state of computer vmguest-10 from the database: reloading 2011-09-22 00:06:31|14402|518:512|new|DataStructure.pm:_automethod(697)|data structure updated: $self->request_data->{reservation}{512}{computer}{state}{name} |14402|518:512|new| computer_state_name = reloading No recipient addresses found in header 2011-09-22 00:06:32|14402|518:512|new|utils.pm:mail(1348)|SUCCESS -- Sending mail To: , PROBLEM -- State.pm |14402|518:512|new| ---- CRITICAL ---- |14402|518:512|new| 2011-09-22 00:06:31|14402|518:512|new|State.pm:reservation_failed(290)|reservation failed on vmguest-10: process failed after trying to load or make available |14402|518:512|new| ( 0) utils.pm, notify (line: 737) |14402|518:512|new| (-1) State.pm, reservation_failed (line: 290) |14402|518:512|new| (-2) new.pm, process (line: 316) |14402|518:512|new| (-3) vcld, make_new_child (line: 594) |14402|518:512|new| (-4) vcld, main (line: 341) 2011-09-22 00:06:32|14402|518:512|new|utils.pm:insertloadlog(4710)|inserted computer=17, failed, process failed after trying to load or make available 2011-09-22 00:06:32|14402|518:512|new|State.pm:reservation_failed(293)|inserted computerloadlog entry 2011-09-22 00:06:32|14402|518:512|new|State.pm:reservation_failed(301)|updated log ending value to 'failed', logid=451 2011-09-22 00:06:32|14402|518:512|new|utils.pm:update_computer_state(2228)|computer 17 state updated to: failed 2011-09-22 00:06:32|14402|518:512|new|State.pm:reservation_failed(312)|computer vmguest-10 (17) state set to failed 2011-09-22 00:06:32|14402|518:512|new|utils.pm:update_request_state(2186)|request 518 state updated to: failed, laststate to: new 2011-09-22 00:06:32|14402|518:512|new|State.pm:reservation_failed(325)|set request state to 'failed'/'new' 2011-09-22 00:06:32|14402|518:512|new|utils.pm:is_inblockrequest(6972)|zero rows were returned from database select 2011-09-22 00:06:32|14402|518:512|new|State.pm:reservation_failed(343)|vmguest-10 is NOT in blockcomputers table 2011-09-22 00:06:32|14402|518:512|new|State.pm:reservation_failed(346)|exiting 1 2011-09-22 00:06:32|14402|518:512|new|State.pm:DESTROY(905)|destructor called, ref($self)=VCL::new 2011-09-22 00:06:32|14402|518:512|new|utils.pm:delete_computerloadlog_reservation(7551 )|removing computerloadlog entries matching loadstate = begin 2011-09-22 00:06:32|14402|518:512|new|utils.pm:delete_computerloadlog_reservation(7598 )|deleted rows from computerloadlog for reservation id=512 2011-09-22 00:06:32|14402|518:512|new|State.pm:DESTROY(912)|removed computerloadlog rows with loadstate=begin for reservation 2011-09-22 00:06:32|14402|518:512|new|State.pm:DESTROY(924)|number of database handles state process created: 1 2011-09-22 00:06:32|14402|518:512|new|State.pm:DESTROY(933)|process has a database handle stored in $ENV{dbh}, attempting disconnect 2011-09-22 00:06:32|14402|518:512|new|State.pm:DESTROY(935)|$ENV{dbh}: database disconnect successful 2011-09-22 00:06:32|14402|518:512|new|State.pm:DESTROY(949)|VCL::new process 14402 exiting 2011-09-22 00:06:32|3929|vcld:REAPER(744)|VCL process exited for reservation 512 ========================== On 9/22/11 7:49 AM, "Aaron Peeler" <fapee...@ncsu.edu> wrote: >Hello Lei, > >Could you provide the log file for vcld related to this reservation >failure. It's default location is /var/log/vcld.log > >Thanks, >Aaron > > >On Thu, Sep 22, 2011 at 2:19 AM, Huang,Lei <lhu...@pvamu.edu> wrote: >> Dear VCL experts, >> I am working on extending our VCL Cloud system for teaching purpose >>by >> adding some new computing nodes. The VCL version is 2.1 and we use >>VMware >> server 1.0 in each computing node. I have installed CentOS 5.0 and >>Vmware >> server 1.0 on a new blade, configured the network and VCL. However, I am >> stuck by the following failure when I tried to reserve an image. >> ========================== >> State Est/Act TimeTotal >> Timeconfirming image exists(22) 0:04/0:24 0:24 >> starting load process(40) 0:06/2:03 2:27 >> creating configuration file(28) 0:02/1:01 3:28 >> starting virtual machine(48) 0:03/0:42 4:10 >> machine booting(46) 1:08/34:43 38:53 >> failed: could not load vmx on CSB308 >> ==================== >> Does anybody know what's wrong on my setup? It takes very long time to >> boot an image, but failed at the end. I am wondering if there is any >> instruction of how to install a computing node that I can follow. >>Thanks a >> lot in advance. >> Regards, >> Lei Huang (Ph.D.) >> Assistant Professor >> Computer Science Department >> Prairie View A&M University >> SR Collins Room 314, MailStop: 2515, Prairie View, TX 77446 >> Phone: (936)261-9878 Fax: (936)261-9866 >> > > > >-- >Aaron Peeler >Program Manager >Virtual Computing Lab >NC State University > >All electronic mail messages in connection with State business which >are sent to or received by this account are subject to the NC Public >Records Law and may be disclosed to third parties.