Re: [opensm] RFC: new routing options

2010-10-12 Thread Yevgeny Kliteynik
Hi Al, This looks really great! One question: have you tried benchmarking the BW with up/down routing using the guid_routing_order_file option w/o your new features? -- YK On 08-Oct-10 7:40 PM, Albert Chu wrote: Hey Sasha, We recently got a new cluster and I've been experimenting with some

Re: [PATCH] mlx4: Limit num of fast reg WRs

2010-10-12 Thread Eli Cohen
On Tue, Oct 12, 2010 at 12:13:26AM +0200, Or Gerlitz wrote: Guys, can you clarify if the hardware limitation is 511 entries or its (PAGE_SIZE / sizeof(pointer)) - 1 which is 4096 / 8 - 1 = 511 but can change if the page size gets bigger or smaller? The limit is 511 entries. After I posted

[patch v3] infiniband: uverbs: handle large number of entries

2010-10-12 Thread Dan Carpenter
In the original code there was a potential integer overflow if you passed in a large cmd.ne. The calls to kmalloc() would allocate smaller buffers than intended, leading to memory corruption. There was also an information leak. Documentation/infiniband/user_verbs.txt suggests this function is

Trying to link with DAT 2.0 function

2010-10-12 Thread Young, Eric R.
My motivation for using the dat_cno_fd_create() is that I am able register a file descriptor with a reactor (all events go through a reactor which has multiple I/O including I/O which is not at all tied to uDAPL). An application is able to work on other tasks while waiting for the reactor to call

Re: [opensm] RFC: new routing options

2010-10-12 Thread Albert Chu
Hey Yevgeny, Yes, I tried that and it didn't have much of an effect. Ever since Sasha put in his routing sorted by switch load (sort_ports_by_switch_load() in osm_ucast_mgr.c), guid_routing_order isn't really necessary (as long as most of the cluster is up). Al On Tue, 2010-10-12 at 00:59

Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Bart Van Assche
Hello, Has anyone already tried to process the work completions generated by a HCA after the state of a queue pair has been changed to IB_QPS_ERR ? With the hardware/firmware/driver combination I have tested I have observed the following: * Multiple completions with the same wr_id and nonzero

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Or Gerlitz
Bart Van Assche bvanass...@acm.org wrote: Has anyone been looking into this before ? nope, never ever, what hca is that? Or. -- To unsubscribe from this list: send the line unsubscribe linux-rdma in the body of a message to majord...@vger.kernel.org More majordomo info at

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Bart Van Assche
On Tue, Oct 12, 2010 at 8:50 PM, Ralph Campbell ralph.campb...@qlogic.com wrote: On Tue, 2010-10-12 at 11:38 -0700, Bart Van Assche wrote: Hello, Has anyone already tried to process the work completions generated by a HCA after the state of a queue pair has been changed to IB_QPS_ERR ? With

[RFC 0/2] IB/umad: Export mad snooping to userspace

2010-10-12 Thread Hefty, Sean
The kernel mad interface allows a client to view all sent and received MADs. This has proven to be a useful debugging technique when paired with the external kernel module, madeye. However, madeye was never intended to be submitted upstream. A couple of alternatives have been proposed for

[RFC 1/2] IB/mad: Simplify snooping interface

2010-10-12 Thread Hefty, Sean
In preparation for exporting the kernel mad snooping capability to user space, remove all code originally inserted as place holders and simplify the mad snooping interface. For performance reasons, we want to filter which mads are reported to clients of the snooping interface at the lowest level,

[RFC 2/2] IB/umad: Export mad snooping capability to userspace

2010-10-12 Thread Hefty, Sean
Export the mad snooping capability to user space clients through the existing umad interface. This will allow users to capture MAD data for debugging, plus it allows for services to act on MAD traffic that occurs. For example, a daemon could snoop SA queries and CM messages as part of providing

Re: Work completions generated after a queue pair has made the transition to an error state

2010-10-12 Thread Eli Cohen
On Tue, Oct 12, 2010 at 08:58:59PM +0200, Bart Van Assche wrote: On Tue, Oct 12, 2010 at 8:50 PM, Ralph Campbell ralph.campb...@qlogic.com wrote: On Tue, 2010-10-12 at 11:38 -0700, Bart Van Assche wrote: Hello, Has anyone already tried to process the work completions generated by a HCA

[PATCH 0/2] svcrdma: NFSRDMA Server fixes for 2.6.37

2010-10-12 Thread Tom Tucker
Hi Bruce, These fixes are ready for 2.6.37. They fix two bugs in the server-side NFSRDMA transport. Thanks, Tom --- Tom Tucker (2): svcrdma: Cleanup DMA unmapping in error paths. svcrdma: Change DMA mapping logic to avoid the page_address kernel API

[PATCH 1/2] svcrdma: Change DMA mapping logic to avoid the page_address kernel API

2010-10-12 Thread Tom Tucker
There was logic in the send path that assumed that a page containing data to send to the client has a KVA. This is not always the case and can result in data corruption when page_address returns zero and we end up DMA mapping zero. This patch changes the bus mapping logic to avoid page_address()

[PATCH 2/2] svcrdma: Cleanup DMA unmapping in error paths.

2010-10-12 Thread Tom Tucker
There are several error paths in the code that do not unmap DMA. This patch adds calls to svc_rdma_unmap_dma to free these DMA contexts. Signed-off-by: Tom Tucker t...@opengridcomputing.com --- net/sunrpc/xprtrdma/svc_rdma_recvfrom.c |1 + net/sunrpc/xprtrdma/svc_rdma_sendto.c|2 ++

Re: [RFC 0/2] IB/umad: Export mad snooping to userspace

2010-10-12 Thread Jason Gunthorpe
On Tue, Oct 12, 2010 at 12:10:37PM -0700, Hefty, Sean wrote: The kernel mad interface allows a client to view all sent and received MADs. This has proven to be a useful debugging technique when paired with the external kernel module, madeye. However, madeye was never intended to be

Re: [PATCH] mlx4: Limit num of fast reg WRs

2010-10-12 Thread Roland Dreier
After I posted this patch, I was told that there is yet another constraint on the page list: The buffer containing the list must not cross a page boundary. So I was thinking what is the best way to deal with this. One way is to always allocate a whole page and map it using

RE: [RFC 0/2] IB/umad: Export mad snooping to userspace

2010-10-12 Thread Hefty, Sean
TBH, I think this would be much better off integrating with the existing paths tcpdump/setc uses rather than yet again something new This ties in with the existing MAD interface, which isn't going away anytime soon, if ever. -- To unsubscribe from this list: send the line unsubscribe

Re: [patch v3] infiniband: uverbs: handle large number of entries

2010-10-12 Thread Jason Gunthorpe
On Tue, Oct 12, 2010 at 01:31:17PM +0200, Dan Carpenter wrote: In the original code there was a potential integer overflow if you passed in a large cmd.ne. The calls to kmalloc() would allocate smaller buffers than intended, leading to memory corruption. Keep in mind these are probably

Re: [RFC 0/2] IB/umad: Export mad snooping to userspace

2010-10-12 Thread Jason Gunthorpe
On Tue, Oct 12, 2010 at 01:54:54PM -0700, Hefty, Sean wrote: TBH, I think this would be much better off integrating with the existing paths tcpdump/setc uses rather than yet again something new This ties in with the existing MAD interface, which isn't going away anytime soon, if ever. I

Re: [PATCH] Make multicast and path record queue flexible.

2010-10-12 Thread Jason Gunthorpe
On Tue, Oct 12, 2010 at 06:29:53PM +0200, Alekseys Senin wrote: On Tue, 2010-10-05 at 14:12 -0500, Christoph Lameter wrote: On Tue, 5 Oct 2010, Jason Gunthorpe wrote: On Tue, Oct 05, 2010 at 06:07:37PM +0200, Aleksey Senin wrote: When using slow SM allow more packets to be

Re: [PATCH] mlx4: Limit num of fast reg WRs

2010-10-12 Thread Eli Cohen
On Tue, Oct 12, 2010 at 01:37:37PM -0700, Roland Dreier wrote: Is there any chance of the dma_alloc_coherent() in the current code allocating memory that crosses a page boundary? You mean that the allocation is aligned at least to its size? I could not find any commitment to this anywhere.

RE: [RFC 0/2] IB/umad: Export mad snooping to userspace

2010-10-12 Thread Hefty, Sean
TBH, I think this would be much better off integrating with the existing paths tcpdump/setc uses rather than yet again something new This ties in with the existing MAD interface, which isn't going away anytime soon, if ever. I didn't say the MAD interface was going away, I said it

Opensm crash with OFED 1.5

2010-10-12 Thread Suresh Shelvapille
Folks: I have a multi-processor machine, running FedoraCore 12. I have installed OFED 1.5. Everything seems to come up ok, I can look at the ibstat and it shows that the Mellanox card stats etc... As soon as I start opensm, I get the following kernel oops and the machine locks up. Any

linux-next: manual merge of the bkl-llseek tree with the infiniband tree

2010-10-12 Thread Stephen Rothwell
Hi Arnd, Today's linux-next merge of the bkl-llseek tree got a conflict in drivers/infiniband/hw/cxgb4/device.c between commit 8bbac892fb75d20fa274ca026e24faf00afbf9dd (RDMA/cxgb4: Add default_llseek to debugfs files) from the infiniband tree and commit 9711569d06e7df5f02a943fc4138fb152526e719