2011/1/10 Peter Kjellström <c...@nsc.liu.se>: > On Monday, January 10, 2011 03:06:06 pm Michael Di Domenico wrote: >> I'm not sure if these are being reported from OpenMPI or through >> OpenMPI from OpenFabrics, but i figured this would be a good place to >> start >> >> On one node we received the below errors, i'm not sure i under the >> error sequence, hopefully someone can shed some light on what >> happened. >> >> [[5691,1],49][btl_openib_component.c:3294:handle_wc] from node27 to: > ... >> network is qlogic qdr end to end, openmpi 1.5 and ofed 1.5.2 (q stack) > > Not really addressing your problem, but, with qlogic you should be using psm, > not verbs (btl_openib). > > That said, openib should work (slowly).
Yes, you are correct. We're running via verbs at the moment because of a slurm interop issue. I have a patch from ralph but have not tested it yet. So far the only noticeable to effect to running non-psm is a 5usec hit on each packet. otherwise functionally we seem okay.