Thanks a lot for your very helpful reply. I've followed your suggestions and
have some clues but I myself still don't know how to fix the problems; if you
or others could offer some advice I'd be grateful.
First, I was able to log in to the client nodes directly (with keyboard and monitor)
as root using the same password I had on the head node. (The OSCAR documentation
ought to say that the root password is propagated to the client nodes.) I found that
1) There is no /etc/resolv.conf file on the client nodes, and
2) pfilter is not running on the client nodes.
Second, I was able to telnet into the client nodes on port 22 but not on the default
port. And "ping oscarnode1.oscardomain" is rejected by the client node.
Third, there was an error message in the pre-step 1 part of the oscarinstall.log
file:
Starting the mysql database server ...
LC_ALL=EN;/etc/init.d/mysqld start
Starting MySQL: ESC[60G[ ESC[1;32mOKESC[0;39m ]
/root/oscar-2.1/scripts/package_config_xmls_to_database: _ListDBs call to list databases failed:
Checking for existance of the OSCAR database ...
... not there.
Creating the OSCAR database ...
ERROR 1045: Access denied for user: 'root@localhost' (Using password: NO)
Cannot create the OSCAR database at /root/oscar-2.1/scripts/package_config_xmls_to_database line 472.
That's the only error that I could see. I could send you the complete logfile if it would help.
I guess my questions are:
1. What should resolv.conf contain on the client nodes?
2. Why is ping rejected, and why can't I telnet into the client nodes except on port 22?
3. How can I make things work now?
4. Why did all of this happen?
Thanks for any light you or others can shed on this!
Ed
At 12:11 PM 1/8/03 -0800, [EMAIL PROTECTED] wrote:
On Wed, 8 Jan 2003, Edmund Bertschinger wrote: > I wrote yesterday that Oscar2.1 Step 7 resulted in a hang with a request > for password on oscarnode1. This occurs in scripts/post_install where > an ssh command is issued to each node, in order to find out how many > processors each node has. > > So, I tried ssh oscarnode1.oscardomain in a root window and, after a > long delay (a minute or so), got a request for the root password, just > like I reported yesterday.I think we have seen this ssh long delay problem before -- I think this may only be a symptom and not the real cause, but it's still worth fixing. IIRC, it has to do with faulty /etc/resolv.conf files on the nodes. I see that this somehow never made it to the FAQ, so we'll have to add it there when we figure this out again / remember how we fixed it before. I don't remember the exact issue -- I think it was one of the following: 1) /etc/resolv.conf on the nodes pointed to DNS servers outside of the OSCAR cluster, and the head node (pfilter) was not allowing the DNS traffic out 2) /etc/resolv.conf on the nodes pointers to non-existant DNS servers 3) /etc/resolv.conf on the nodes was empty The reason you can't ssh to nodes properly is because somehow the root SSH keys are not agreeing. i.e., the client node is supposed to accept the keys from the head node and allow passwordless logins. But somehow, those keys did not propogate properly out there. Are there any errors indicated in your oscarinstall.log? (sorry -- you may have mentioned this before; I'm kinda jumping in the middle of the conversation) > I tried pinging the client nodes and all ping's failed! This despite > the fact that I had just successfully installed all the client nodes > over the same network. This is quite odd. I *believe* that pfilter is only installed on the head node, and if you're on the head node, you should be able to ping all the client nodes. Neil? Regardless, if you can telnet to port 22 on all the nodes (from the head node), then they're all reachable and all fine, no matter what ping says. -- {+} Jeff Squyres {+} [EMAIL PROTECTED] {+} http://www.lam-mpi.org/
------------------------------------------------------- This SF.NET email is sponsored by: SourceForge Enterprise Edition + IBM + LinuxWorld = Something 2 See! http://www.vasoftware.com _______________________________________________ Oscar-users mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/oscar-users
