[openib-general] DAT API for OpenIB stack

2005-05-03 Thread James Lentini
An open source implementation of the DAT API for the OpenIB stack (gen2) is available in the OpenIB subversion repository at https://openib.org/svn/gen2/users/jlentini/ If you are interested in learning more about the DAT API, please see http://www.datcollaborative.org/ or http://sourceforge.net/

[openib-general] Re: [DAPL] when to cleanup code?

2005-05-03 Thread James Lentini
We are ready to start the cleanup now. I've just checked in some changes that I think simplify the interface. james On Mon, 2 May 2005, Tom Duffy wrote: James, When do you plan on cleaning up the dapl code? I am referring to the following things: - remove opaque types - remove use of typedef - f

Re: [openib-general] RFD: uverbs and hotplug

2005-05-03 Thread Caitlin Bestler
On 5/3/05, Tom Duffy <[EMAIL PROTECTED]> wrote: > On Tue, 2005-05-03 at 09:42 -0700, Caitlin Bestler wrote: > > Most applications are not written to be able to > > survice a sudden loss of virtually all of its connections, and then resume > > all of the suspended sessions with new connections using

Re: [openib-general] RFD: uverbs and hotplug

2005-05-03 Thread Tom Duffy
On Tue, 2005-05-03 at 09:42 -0700, Caitlin Bestler wrote: > Most applications are not written to be able to > survice a sudden loss of virtually all of its connections, and then resume > all of the suspended sessions with new connections using new memory > regions. Most applications *should* be wr

RE: [openib-general] [PATCH] [RMPP] ib_coalesce_recv_mad

2005-05-03 Thread Sean Hefty
>I'm not sure I like this "usermode" parameter that selects between >memcpy() and copy_to_user(). For one thing, it means that it is >impossible to fix the sparse warnings without defeating the sparse >address space testing: you have to add a casts to and from __user >pointers all over the place.

Re: [openib-general] Re: user-mode verbs on Itanium

2005-05-03 Thread Grant Grundler
On Fri, Apr 29, 2005 at 11:38:46AM -0700, Grant Grundler wrote: ... > ionize:~# ibv_pingpong -s 64 -n 10 10.0.0.51 > local address: LID 0x000d, QPN 0x070406, PSN 0x972e9e > remote address: LID 0x000b, QPN 0x030406, PSN 0x3c7543 > 1280 bytes in 1.65 seconds = 62.10 Mbit/sec > 10 ite

Re: [openib-general] [PATCH] [RMPP] ib_coalesce_recv_mad

2005-05-03 Thread Roland Dreier
I'm not sure I like this "usermode" parameter that selects between memcpy() and copy_to_user(). For one thing, it means that it is impossible to fix the sparse warnings without defeating the sparse address space testing: you have to add a casts to and from __user pointers all over the place. Also

[openib-general] [PATCH] [RMPP] ib_coalesce_recv_mad

2005-05-03 Thread Sean Hefty
The following patch implements ib_coalesce_recv_mad and fixes an issue where the mad_len was set incorrectly on the receive side. Note, you will need to use the attached file for the patch to apply correctly, but provided inline for review. Signed-off-by: Sean Hefty <[EMAIL PROTECTED]> Index: i

[openib-general] Re: Some Initial RMPP Wire Observations

2005-05-03 Thread Hal Rosenstock
On Tue, 2005-05-03 at 18:01, Sean Hefty wrote: > Hal Rosenstock wrote: > > Thanks for pointing that out. I am using ib_create_mad_send for this. > > Not sure why they are not cleared. Maybe it's related to the other issue > > I am having. > > This may or may not be relevant to what you're doing, b

[openib-general] Re: Some Initial RMPP Wire Observations

2005-05-03 Thread Sean Hefty
Hal Rosenstock wrote: Thanks for pointing that out. I am using ib_create_mad_send for this. Not sure why they are not cleared. Maybe it's related to the other issue I am having. This may or may not be relevant to what you're doing, but I should mention that there's a bug setting the mad_len on the

[openib-general] Re: Some Initial RMPP Wire Observations

2005-05-03 Thread Hal Rosenstock
On Tue, 2005-05-03 at 17:39, Sean Hefty wrote: > I think what's necessary is for the DATA MAD to be rejected. This check > is missing, but easily added. (I was deferring some of this until > STOP/ABORT are implemented.) OK. > So... I think that the ACK code may be > correct, but checks don

[openib-general] Re: Some Initial RMPP Wire Observations

2005-05-03 Thread Sean Hefty
Hal Rosenstock wrote: OK that answers the ACK formatting but not whether an ACK is correct in this case. o13-21.1.15 states that if a status code is received with an RMPPType value other than those it applies to, the Receiver shall discard the packet, terminate the protocol,and issue an RMPP ABORT

[openib-general] Re: Some Initial RMPP Wire Observations

2005-05-03 Thread Hal Rosenstock
On Tue, 2005-05-03 at 16:55, Sean Hefty wrote: > Hal Rosenstock wrote: > > Short RMPP messages (less than 1 MAD's worth) have first bit set but not > > also last in RMPPFlags. > > Can you re-check this? I'm able to successfully receive short segments, > and the receive code looks for the last bi

[openib-general] Re: Some Initial RMPP Wire Observations

2005-05-03 Thread Sean Hefty
Hal Rosenstock wrote: Short RMPP messages (less than 1 MAD's worth) have first bit set but not also last in RMPPFlags. Can you re-check this? I'm able to successfully receive short segments, and the receive code looks for the last bit to check for completion. Also, if the initial RMPP send had b

[openib-general] Some Initial RMPP Wire Observations

2005-05-03 Thread Hal Rosenstock
Hi Sean, Here are some wire based observations so far on RMPP running between a toy SA client (making some GetTable requests) and OpenSM with RMPP enabled: Short RMPP messages (less than 1 MAD's worth) have first bit set but not also last in RMPPFlags. Also, if the initial RMPP send had bad stat

[openib-general] Re: SDP: unexpected conn state

2005-05-03 Thread Libor Michalek
On Mon, May 02, 2005 at 05:21:48PM +0300, Michael S. Tsirkin wrote: > Quoting r. Michael S. Tsirkin <[EMAIL PROTECTED]>: > > Subject: SDP: unexpected conn state > > > > With ttcp, I sometimes see the following messages from sdp_rcvd_disconnect: > > > > WARN: : Unexpected conn state. conn <1> stat

Re: [openib-general] [PATCH][SDP] AIO buffer corruption

2005-05-03 Thread Libor Michalek
On Mon, May 02, 2005 at 03:48:51PM +0100, Steven Wooding wrote: > Hello Libor, > > I've tried your patch, but unfortunately it made no difference to > the -32/-104 errors I get. I have observed the following features, > which may help you diagnose my problem: > > Platform: 64 bit > OS: RHEL 4 >

[openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Roland Dreier
Michael> The application will get a completion with error, and it Michael> just needs a way to know that its because of the HCA Michael> going away, so it seems to me you have to signal the Michael> event before closing the QPs, not after that. It won't get any completion at all: w

Re: [openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Roland Dreier
Michael> What about ib_umad? Can you pull the HCA while opensm is Michael> running? Don't know... I never thought about it. - R. ___ openib-general mailing list openib-general@openib.org http://openib.org/mailman/listinfo/openib-general To uns

[openib-general] Re: slab error in kmem_cache_destroy(): cache `ib_mad': Can't free all objects

2005-05-03 Thread Hal Rosenstock
On Tue, 2005-05-03 at 11:08, Roland Dreier wrote: > I don't think so, since user_mad's recv_handler() function always > calls ib_free_recv_mad(). If this only occurs on machines which are running OpenSM, it could be a problem with the local MAD handling where ib_free_recv_mad is not called for so

[openib-general] Re: [PATCH][DAPL] fix kdapltest Makefile

2005-05-03 Thread Tom Duffy
On Tue, 2005-05-03 at 15:07 -0400, James Lentini wrote: > Tom, > > Do you also want to remove the first EXTRA_CLAGS value: > >-Idrivers/dat \ > > or did you leave that there on purpose? > > You shouldn't need the > >-I$(src)/../../../dat-provider \ > > li

[openib-general] Re: Advice about adapting ibv_pingpong to use UC

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: Advice about adapting ibv_pingpong to use UC > > Michael> Roland, would you like me to implement this? > > It would be great if you wanted to implement UC support. I think the > main thing that needs to be done is to sit down and g

[openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: RFD: uverbs and hotplug > Michael> No, but I know how NIC hotplug works on my laptop: push a > Michael> button, wait till a little light flashes, pull it out. > > Sure, that's easy, because the network layer just throws packets

Re: [openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: [openib-general] Re: RFD: uverbs and hotplug > > Grant> The rx4640 I'm using supports hotplug. What do you want > Grant> tested? I think you have access to "iowa" via gsyprf10, > Grant> right? > > Yeah, thanks. I'm not sur

[openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: RFD: uverbs and hotplug > > Michael> Hello, Roland, all! How should hotplug work with uverbs? > > ... > > Another possibility is as you described: free all the resources > allocated by userspace processes (including remapping any do

[openib-general] Re: [PATCH][DAPL] fix kdapltest Makefile

2005-05-03 Thread James Lentini
Tom, Do you also want to remove the first EXTRA_CLAGS value: -Idrivers/dat \ or did you leave that there on purpose? You shouldn't need the -I$(src)/../../../dat-provider \ line. DAT consumers only need to be aware of the registry (the code in the dat directory).

Re: [openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Roland Dreier
Grant> The rx4640 I'm using supports hotplug. What do you want Grant> tested? I think you have access to "iowa" via gsyprf10, Grant> right? Yeah, thanks. I'm not sure there's anything worth testing: mthca is a well-behaved PCI driver, so hotplug should work just like removing the mod

Re: [openib-general] Re: [PATCH][RFC][0/4] InfiniBand userspace verbs implementation

2005-05-03 Thread Greg Lindahl
On Tue, May 03, 2005 at 11:43:25AM -0700, Andy Isaacson wrote: > [1] You might want to allow the child to start a completely new RDMA > context, but I don't see that as necessary. Some people use a hybrid OpenMP+MPI model, so some MPI implementations want to fork and have the child open a new

Re: [openib-general] Re: [PATCH][RFC][0/4] InfiniBand userspace verbs implementation

2005-05-03 Thread Caitlin Bestler
On 5/3/05, Andy Isaacson <[EMAIL PROTECTED]> wrote: > > A consistent statement would be > > After fork(2), any regions which were registered are UNDEFINED. > Region boundaries are byte-accurate; a registration can cover just > part of a page, in which case the non-registered part of

Re: [openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Grant Grundler
On Tue, May 03, 2005 at 11:32:18AM -0700, Roland Dreier wrote: > Sure, that's easy, because the network layer just throws packets away > when a device disappears. Mass storage is better example - transactions have "state". The term "nexus" is used alot in SCSI specs. > We already have that workin

Re: [openib-general] Re: [PATCH][RFC][0/4] InfiniBand userspace verbs implementation

2005-05-03 Thread Andy Isaacson
On Fri, Apr 29, 2005 at 05:31:44PM -0700, Caitlin Bestler wrote: > Attempting to provide *any* support for applications that fork children > after doing RDMA registrations is a ratshole best avoided. The general > rule that application developers should follow is to do RDMA *only* > in the child pr

[openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Roland Dreier
Michael> Okay, but you still must blocks the applications from Michael> accessing the hardware - you cant trust them not to. Right, that's why I said "including remapping any doorbell pages." Michael> No, but I know how NIC hotplug works on my laptop: push a Michael> button, wait

[openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: RFD: uverbs and hotplug > > Michael> Hello, Roland, all! How should hotplug work with uverbs? > > It's complicated. Discussion now is useful but I would definitely > prefer to hold off on any implementation until we have the basic

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread Caitlin Bestler
On 5/3/05, David Addison <[EMAIL PROTECTED]> wrote: > We believe the IOPROC patch is generic and powerful and would allow other > RDMA NICs to solve the page registration problems in a different manner. > For NICs which require page registration, new VM hooks can be used to avoid > pages being unl

Re: [openib-general] FMR and how they work

2005-05-03 Thread Caitlin Bestler
On 5/3/05, Libor Michalek <[EMAIL PROTECTED]> wrote: > On Mon, May 02, 2005 at 01:37:00PM -0700, Caitlin Bestler wrote: > > What advantages is this style of fast memory register supposed to > > have over the work request style found in IB 1.2 and iWARP? > > It's faster since all you're doing is

[openib-general] Re: RFD: uverbs and hotplug

2005-05-03 Thread Roland Dreier
Michael> Hello, Roland, all! How should hotplug work with uverbs? It's complicated. Discussion now is useful but I would definitely prefer to hold off on any implementation until we have the basic userspace verbs support merged upstream. One possibility is that it is the administrator's res

[openib-general] Re: Advice about adapting ibv_pingpong to use UC

2005-05-03 Thread Roland Dreier
Michael> Roland, would you like me to implement this? It would be great if you wanted to implement UC support. I think the main thing that needs to be done is to sit down and go over the description of the modify QP verb in the IB spec and update all the transitions in state_table[][] in mthc

Re: [openib-general] FMR and how they work

2005-05-03 Thread David M. Brean
Hello, I thought that IB implementations of iSER and NFS over RDMA were using kDAPL. -David Libor Michalek wrote: On Mon, May 02, 2005 at 01:37:00PM -0700, Caitlin Bestler wrote: What advantages is this style of fast memory register supposed to have over the work request style found in IB 1.2

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread Ronald G. Minnich
> On 5/3/05, David Addison <[EMAIL PROTECTED]> wrote: > > as our recent IOPROC patch on lkml shows, it's not that invasive. There > > are just 24 hooks added to the Linux VM code paths - which we have been > > able to > > maintain outside the mainline tree for many years now. > > As these hooks

Re: [openib-general] FMR and how they work

2005-05-03 Thread Libor Michalek
On Mon, May 02, 2005 at 02:59:26PM -0700, Fab Tillier wrote: > > From: Dror Goldenberg [mailto:[EMAIL PROTECTED] > > > > You're welcomed to measure. I think that FMRs will be the fastest way > > to create mapping. Faster than MWs. > > The reason I asked is SDP went down the FMR path, but that can

Re: [openib-general] FMR and how they work

2005-05-03 Thread Libor Michalek
On Mon, May 02, 2005 at 01:37:00PM -0700, Caitlin Bestler wrote: > What advantages is this style of fast memory register supposed to > have over the work request style found in IB 1.2 and iWARP? It's faster since all you're doing is rewriting the device's page table mapping. It would be interest

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread Caitlin Bestler
An ex post facto notification of a PTE change would enable the RDMA Device driver to know when a Memory Region had been invalidated so that it could probably declare an access violation and tear all the connections using it down. But if the intent is to allow it to migrate the memory region to th

Re: [openib-general] RFD: uverbs and hotplug

2005-05-03 Thread Caitlin Bestler
The fundamental question here is what degree of support is desired? Is it to ensure that the OS can force a memory region to be destroyed on demand, and it is then up to the application to figure out how to respond? Or is it to allow the OS to change the meaning of a Memory Region so that it can

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread Grant Grundler
On Tue, May 03, 2005 at 09:42:12AM +0100, David Addison wrote: > >This doesn't scale well as more cards are added to the box. > >I think I understand why it's good for single cards though. > > With the IOPROC patch the device driver hooks are registered on a per > process or perhaps better still,

[openib-general] Re: slab error in kmem_cache_destroy(): cache `ib_mad': Can't free all objects

2005-05-03 Thread Roland Dreier
Michael> Right, but this wont be related to the kmem_cache leak I Michael> see, will it? I don't think so, since user_mad's recv_handler() function always calls ib_free_recv_mad(). - R. ___ openib-general mailing list openib-general@openib.org

RE: [openib-general] FMR and how they work

2005-05-03 Thread Dror Goldenberg
Title: RE: [openib-general] FMR and how they work > -Original Message- > From: Fab Tillier [mailto:[EMAIL PROTECTED]] > Sent: Tuesday, May 03, 2005 12:59 AM > > Does the HW2SW_MPT command flush the MTTs referenced by the > MPT (if any)? That is, if an MPT has mtt_seg_adr_h and

[openib-general] Re: slab error in kmem_cache_destroy(): cache `ib_mad': Can't free all objects

2005-05-03 Thread Hal Rosenstock
On Tue, 2005-05-03 at 03:27, Michael S. Tsirkin wrote: > Right, but this wont be related to the kmem_cache leak I see, will it? Right )it doesn't solve it). This is what the original mail said: "I don't see a MAD leak but perhaps see a couple of other possible memory leaks on shutdown." This was o

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread David Addison
Grant Grundler wrote: On Fri, Apr 29, 2005 at 08:22:24PM +0200, Brice Goglin wrote: For instance, instead of adding PROT_DONT/ALWAYSCOPY, you may use an ioproc hook in the fork path. This hook (a function in your driver) would be called for each registered page. It will decide whether the page shou

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread David Addison
Ronald G. Minnich wrote: On Fri, 29 Apr 2005, Greg Lindahl wrote: It doesn't imply that there's an MMU, either. I know that Myricom uses a little lookup routine in software on their nic, which most people wouldn't call an MMU. I don't know what Mellanox does for this, they don't talk much about wha

[openib-general] RFD: uverbs and hotplug

2005-05-03 Thread Michael S. Tsirkin
Hello, Roland, all! How should hotplug work with uverbs? Currently ib_uverbs_remove_one simply does class_device_unregister. What will if an application has the device open? Ideally, once the device is removed, applications get an error code and so can close the device freeing up any resources, a

[openib-general] Re: Advice about adapting ibv_pingpong to use UC

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: Advice about adapting ibv_pingpong to use UC > > Steven> Hi, I wonder if someone working on the gen2 uverbs would > Steven> be so kind as to give me some advice on adapting the > Steven> ibv_pingpong program to use a UC QP ty

Re: [openib-general] Re: RDMA memory registration

2005-05-03 Thread David Addison
Greg Lindahl wrote: On Fri, Apr 29, 2005 at 12:33:54PM -0700, Grant Grundler wrote: Being mostly clueless about Quadrics implementation, I'm probably missing something that makes Quadrics a MMU but not the IB variants. Can someone clue me in please? As far as I can tell it's mostly a marketing dist

[openib-general] Re: slab error in kmem_cache_destroy(): cache `ib_mad': Can't free all objects

2005-05-03 Thread Michael S. Tsirkin
Quoting r. Roland Dreier <[EMAIL PROTECTED]>: > Subject: Re: slab error in kmem_cache_destroy(): cache `ib_mad': Can't free > all objects > > Hal> Also, does user_mad.c leak memory on the receive side if it > Hal> is shutdown and the application has not drained the receive > Hal> queu