Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-11 Thread Scott Weitzenkamp (sweitzen)
We aren't using SLES auto-install. But I did google for "SLES 127.0.0.2" and found this at http://www.novell.com/documentation/novellaudit20/readme/novellaudit20_r eadme.html: 2.8 SLES 10 hosts File SLES 10 includes two localhost entries in the /etc/hosts file: 127.0.0.1 and 127.0.0.2

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-11 Thread Pavel Shamis (Pasha)
I mean SLES10. (yes it's different distros) Scott Weitzenkamp (sweitzen) wrote: > You checked SUSE 10 or SLES 10, aren't those different distros? > > Scott Weitzenkamp > SQA and Release Manager > Server Virtualization Business Unit > Cisco Systems > > > >> -Original Message- >> From:

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-11 Thread Pavel Shamis (Pasha)
Here is some link about SuSE's bugs related to 127.0.0.2 https://bugzilla.novell.com/show_bug.cgi?id=165269 Check your SuEe auto-install stuff. It is possible that you have some broken configuration in it. Scott Weitzenkamp (sweitzen) wrote: > We've installed four SLES10 machines so far, and the

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-11 Thread Scott Weitzenkamp (sweitzen)
We've installed four SLES10 machines so far, and they all have the "127.0.0.2 " entry. Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cisco Systems > -Original Message- > From: Pavel Shamis (Pasha) [mailto:[EMAIL PROTECTED] > Sent: Wednesday, October 11,

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-11 Thread Scott Weitzenkamp (sweitzen)
You checked SUSE 10 or SLES 10, aren't those different distros? Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cisco Systems > -Original Message- > From: Pavel Shamis (Pasha) [mailto:[EMAIL PROTECTED] > Sent: Wednesday, October 11, 2006 3:09 AM > To: Scot

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-11 Thread Pavel Shamis (Pasha)
On some of our SUSE 10 machines i found the 127.0.0.2 ip, but it was pointing to some random Linux site (linux.org) and has no effect on mpi runs. In you case the ip point to _real_ machine, it very strange. Scott Weitzenkamp (sweitzen) wrote: > Aha, I found something in /etc/hosts, thanks for the

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-09 Thread Scott Weitzenkamp (sweitzen)
Aha, I found something in /etc/hosts, thanks for the hint. 127.0.0.2 svbu-qa1850-3.cisco.com svbu-qa1850-3 If I comment this line out, MVAPICH works fine. Does Mellanox have this entry in /etc/hosts? Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cis

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-05 Thread Pavel Shamis (Pasha)
> I see it for all MVAPICH tests, it's 100% consistent. MVAPICH tests are osu_benchmarks (bw/lt/etc..) or all test over mvapich on SUSE10 platform ? Please check /etc/hosts file on your machines, it should be exactly the same on all nodes. Regards, Pasha > > Scott Weitzenkamp > SQA and Releas

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-04 Thread Scott Weitzenkamp (sweitzen)
I see it for all MVAPICH tests, it's 100% consistent. Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cisco Systems > -Original Message- > From: Pavel Shamis (Pasha) [mailto:[EMAIL PROTECTED] > Sent: Tuesday, October 03, 2006 3:37 AM > To: Scott Weitzenkam

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-03 Thread Pavel Shamis (Pasha)
Hi Scott, Unfortunately was not able to reproduce the failure on our platforms. Do you see the problem with all tests or with the specific only ? Is it consistent problem ? Regards, Pasha Scott Weitzenkamp (sweitzen) wrote: > $ uname -a > Linux svbu-qa1850-3 2.6.16.21-0.8-smp #1 SMP Mon Jul 3 18:

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-02 Thread Scott Weitzenkamp (sweitzen)
Aviram, can I try Mellanox binary RPMs? Scott Weitzenkamp SQA and Release Manager Server Virtualization Business Unit Cisco Systems > -Original Message- > From: Scott Weitzenkamp (sweitzen) > Sent: Sunday, October 01, 2006 9:31 PM > To: 'Aviram Gutman'; Scott Weitzenkamp (sweitzen) > C

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-01 Thread Scott Weitzenkamp (sweitzen)
$ uname -a Linux svbu-qa1850-3 2.6.16.21-0.8-smp #1 SMP Mon Jul 3 18:25:39 UTC 2006 x86_64 x86_64 x86_64 GNU/Linux $ /usr/local/ofed/mpi/gcc/mvapich-0.9.7-mlx2.2.0/bin/mpirun_rsh -np 2 192.168.2.46 192.168.2.49 hostname svbu-qa1850-4 svbu-qa1850-3 $ /usr/local/ofed/mpi/gcc/mvapich-0.9.7-mlx2.2.0/bi

Re: [openib-general] [openfabrics-ewg] problems running MVAPICH on OFED 1.1 rc6 with SLES10 x86_64

2006-10-01 Thread Aviram Gutman
Can you please elaborate on MVAPICH issues, can you send command line? We ran it here on 32 Opteron nodes each quad core and also rigorous tests on the many other nodes? Scott Weitzenkamp (sweitzen) wrote: > We are just getting started with OFED testing on SLES10, first > platform is x86_64.