Hi Sasha, On Tue, Jan 12, 2010 at 4:31 AM, Sasha Khapyorsky <[email protected]> wrote: > Hi Hal, > > On 08:56 Mon 11 Jan , Hal Rosenstock wrote: >> > >> > diff --git a/tests/subnet_discover.c b/tests/subnet_discover.c >> > index 7f8a85c..42e7aee 100644 >> > --- a/tests/subnet_discover.c >> > +++ b/tests/subnet_discover.c >> > @@ -40,6 +40,7 @@ static struct node *node_array[32 * 1024]; >> > static unsigned node_count = 0; >> > static unsigned trid_cnt = 0; >> > static unsigned outstanding = 0; >> > +static unsigned max_outstanding = 8; >> >> Any reason why this default is different from the one which OpenSM >> uses ? Seems to me it should be the same (or less). > > In my tests I found that '8' is more optimal number (the tool works > faster and without drops) than '4' used in OpenSM. > > Of course it would be helpful to run this over bigger cluster than > what I have to see that the results are consistent.
This is exactly my concern. Not only cluster size but use cases including concurrent diag discover and SM operation where SMPs are heavily in use. There already have been a number of reports of dropped SMPs on this list with the current diags and this change will only make things worse IMO. Also, the OpenSM default should be at least as large as the diags for this. -- Hal > Sasha > -- To unsubscribe from this list: send the line "unsubscribe linux-rdma" in the body of a message to [email protected] More majordomo info at http://vger.kernel.org/majordomo-info.html
