On Wed, 25 Jul 2012 15:57:58 +0300
Alex Netes <ale...@mellanox.com> wrote:

> Hi Ira,
> 
> On 16:53 Tue 03 Jul     , Ira Weiny wrote:
> > 
> > 
> > When a node drops from the fabric the default behavior was to leave the node
> > data in the PerfMgr DB.  Add the option (with default set to TRUE) to delete
> > these "inactive" or missing nodes from the DB.
> > 
> > Signed-off-by: Ira Weiny <wei...@llnl.gov>
> > ---

[snip]

> > +}
> > +
> >  /**********************************************************************
> >   * Dump a reading vs the previous reading to stdout
> >   **********************************************************************/
> > diff --git a/opensm/osm_subnet.c b/opensm/osm_subnet.c
> > index 662e591..06efa9c 100644
> > --- a/opensm/osm_subnet.c
> > +++ b/opensm/osm_subnet.c
> > @@ -493,6 +493,7 @@ static const opt_rec_t opt_tbl[] = {
> >     { "perfmgr_max_outstanding_queries", 
> > OPT_OFFSET(perfmgr_max_outstanding_queries), opts_parse_uint32, NULL, 0 },
> >     { "perfmgr_ignore_cas", OPT_OFFSET(perfmgr_ignore_cas), 
> > opts_parse_boolean, NULL, 0 },
> >     { "event_db_dump_file", OPT_OFFSET(event_db_dump_file), 
> > opts_parse_charp, NULL, 0 },
> > +   { "perfmgr_rm_nodes", OPT_OFFSET(perfmgr_rm_nodes), opts_parse_boolean, 
> > NULL, 0 },
> 
> I guess that this option can be enabled to be changed on the fly, as it might
> be changed also via the console.

yes.

> 
> >  #endif                             /* ENABLE_OSM_PERF_MGR */
> >     { "event_plugin_name", OPT_OFFSET(event_plugin_name), opts_parse_charp, 
> > NULL, 0 },
> >     { "event_plugin_options", OPT_OFFSET(event_plugin_options), 
> > opts_parse_charp, NULL, 0 },
> > @@ -985,6 +986,7 @@ void osm_subn_set_default_opt(IN osm_subn_opt_t * p_opt)
> >         OSM_PERFMGR_DEFAULT_MAX_OUTSTANDING_QUERIES;
> >     p_opt->perfmgr_ignore_cas = FALSE;
> >     p_opt->event_db_dump_file = NULL; /* use default */
> > +   p_opt->perfmgr_rm_nodes = TRUE;
> 
> Here you change the default behavior. Any thoughts what is the added value to
> leave disconnected nodes in the DB?

Originally I was thinking that leaving nodes in the DB would reduce the load
on the perfmgr when nodes disappears/reappears for "normal" reasons.  For
example a full reboot of all compute nodes.

As we start to finally ramp up to use this I don't think the load is that
great and we are having issues with hardware replacements leaving nodes in the
DB and causing user confusion (for example they have 2 nodes of the same name
being reported).  So for users I think the default is better set at TRUE.

Ira

> 
> >  #endif                             /* ENABLE_OSM_PERF_MGR */
> >  
> >     p_opt->event_plugin_name = NULL;
> > @@ -2009,12 +2011,15 @@ int osm_subn_output_conf(FILE *out, IN 
> > osm_subn_opt_t * p_opts)
> >             "perfmgr_sweep_time_s %u\n\n"
> >             "# Max outstanding queries\n"
> >             "perfmgr_max_outstanding_queries %u\n"
> > -           "perfmgr_ignore_cas %s\n\n",
> > +           "perfmgr_ignore_cas %s\n\n"
> > +           "# Remove missing nodes from DB\n"
> > +           "perfmgr_rm_nodes %s\n",
> >             p_opts->perfmgr ? "TRUE" : "FALSE",
> >             p_opts->perfmgr_redir ? "TRUE" : "FALSE",
> >             p_opts->perfmgr_sweep_time_s,
> >             p_opts->perfmgr_max_outstanding_queries,
> > -           p_opts->perfmgr_ignore_cas ? "TRUE" : "FALSE");
> > +           p_opts->perfmgr_ignore_cas ? "TRUE" : "FALSE",
> > +           p_opts->perfmgr_rm_nodes ? "TRUE" : "FALSE");
> >  
> >     fprintf(out,
> >             "#\n# Event DB Options\n#\n"
> > -- 
> > 1.7.1
> > 
> > --
> > To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
> > the body of a message to majord...@vger.kernel.org
> > More majordomo info at  http://vger.kernel.org/majordomo-info.html


-- 
Ira Weiny
Member of Technical Staff
Lawrence Livermore National Lab
925-423-8008
wei...@llnl.gov
--
To unsubscribe from this list: send the line "unsubscribe linux-rdma" in
the body of a message to majord...@vger.kernel.org
More majordomo info at  http://vger.kernel.org/majordomo-info.html

Reply via email to