Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Ira Weiny
On Wed, 24 Mar 2010 13:42:55 -0600 Michael Robbert wrote: > I've got good news. I was able to get opensm to take control. I gave it a > priority of 15 and rebooted the 7000D. Unfortunately I'm not sure I can > leave it like this forever. The only host I had with opensm installed is my > test fron

RE: ibstat stuck in state initialized after reboot

2010-03-24 Thread Meyer, Donald J
I can't speak for all IB networks, but I do know on our network, the SM on our switch wouldn't last longer than a week. That network had 288 nodes. We cured all the SM problems by switching to a dedicated server running OpenSM only. We tried running OpenSM on a server running other tasks too,

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Michael Robbert
I've got good news. I was able to get opensm to take control. I gave it a priority of 15 and rebooted the 7000D. Unfortunately I'm not sure I can leave it like this forever. The only host I had with opensm installed is my test front end for an OS upgrade I'm testing. We're moving from Rocks 4.3

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Chuck Hartley
On Wed, Mar 24, 2010 at 2:25 PM, Ira Weiny wrote: > On Wed, 24 Mar 2010 11:34:02 -0600 > Michael Robbert wrote: > > I will second this.  OpenSM has come a long way since the time Cisco was > selling IB switches.  If I understand your situation you don't even need the > 7000D you could just remove

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Ira Weiny
On Wed, 24 Mar 2010 11:34:02 -0600 Michael Robbert wrote: > Interesting note! The 7024 is our large switch where all the hosts are > connected, but I was told that we were sold the 7000D because the 7024 > didn't have a subnet manager. Unfortunately the 7000D has a different CLI > and that comman

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Michael Robbert
I just discovered another interesting point. I tried to start opensm on one of my hosts and it went into STANDBY state. Here is the log of it trying to start up: Mar 24 12:23:25 117170 [66DAC170] 0x80 -> OpenSM 3.3.5 Entering DISCOVERING state Mar 24 12:23:25 117863 [66DAC170] 0x02 -> osm_vendo

Re: Ummunotify: progress at last!

2010-03-24 Thread Jason Gunthorpe
On Tue, Mar 23, 2010 at 10:59:42PM -0700, Roland Dreier wrote: > That is all definitely doable. I wonder if it's better to get rid of > the dedicated fd though. After all, having the fd means a fancy app can > do poll() or sigio or whatever internally. Being able to integrate into > an fd-drive

Re: [PATCH] IB core: Fix locking on device numbers allocation

2010-03-24 Thread Roland Dreier
> When the driver needs to dynamically allocate char device numbers in systems > with more than IB_UVERBS_MAX_DEVICES, it releases map lock, allocates a new > range and a new device number from that range, and only then re-acquires the > lock. This must be protected for the same reasoning that

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Michael Robbert
Interesting note! The 7024 is our large switch where all the hosts are connected, but I was told that we were sold the 7000D because the 7024 didn't have a subnet manager. Unfortunately the 7000D has a different CLI and that command is not available and I don't have the password for our 7024 so

RE: ibstat stuck in state initialized after reboot

2010-03-24 Thread Meyer, Donald J
http://www.cisco.com/en/US/docs/server_nw_virtual/7024/release_4.1/hardware/installation/guide/7024hig.pdf smControl Starts and stops the embedded subnet manager. Syntax: smControl start | stop | restart | status Thanks, Don Meyer Senior Network/System Engineer/Programmer US+ (253) 371-9532 iNet

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Michael Robbert
Ira, Thanks for the quick response. That is what I was afraid of. I've been looking through the switch documentation, but it doesn't cover starting, stopping, or even checking the status of the SM service. I'll look into opening a TAC case, but since Cisco has gotten out of the IB business I'm n

Re: ibstat stuck in state initialized after reboot

2010-03-24 Thread Ira Weiny
On Wed, 24 Mar 2010 10:26:02 -0600 Michael Robbert wrote: > I hope this is the correct place to get help with the problem I have. I have > an IB fabric running on a Cisco SFS switch with a 7000D as the subnet > manager and the whole thing has been running great for well over a year now, > but tod

ibstat stuck in state initialized after reboot

2010-03-24 Thread Michael Robbert
I hope this is the correct place to get help with the problem I have. I have an IB fabric running on a Cisco SFS switch with a 7000D as the subnet manager and the whole thing has been running great for well over a year now, but today I noticed that after any node gets rebooted its IB link doesn'

[PATCH] opensm: fixing compilation issues in some header files

2010-03-24 Thread Yevgeny Kliteynik
All the compilation issues refer to implicit casting from "void*" to "some_struct_t*" Signed-off-by: Yevgeny Kliteynik --- opensm/include/opensm/osm_pkey.h |8 +--- opensm/include/opensm/osm_port.h |4 ++-- opensm/include/opensm/osm_subnet.h |2 +- 3 files changed, 8 insertio

Re: [PATCH] Dimension port order file support

2010-03-24 Thread Sasha Khapyorsky
Hi Dale, On 18:06 Wed 03 Mar , Dale Purdy wrote: > > Provide a means to specify on a per switch basis the mapping (order) > between switch ports and dimensions for Dimension Order Routing. This > allows the DOR routing engine to be used when the cabling is not > properly aligned for DOR, eit

Re: opensm/main.c: foce stdout to be line-buffered

2010-03-24 Thread Yevgeny Kliteynik
On 23/Mar/10 23:36, Yevgeny Kliteynik wrote: On 23/Mar/10 18:37, Sasha Khapyorsky wrote: On 14:25 Tue 23 Mar , Yevgeny Kliteynik wrote: I'm running "opensm> somefile", and I don't see SM's stdout (such as "SUBNET UP" message, or new cached options after SIGHUP), because when stdout is assigned