Re: [OMPI devel] [OMPI svn-full] svn:open-mpi r25015

2011-08-08 Thread Wesley Bland
The reason is because valgrind was complaining about uninitialized values that were passed into proc_get_epoch. I saw the same warnings from valgrind when I ran it. I added the code to initialize the values to what really should be the default value and the warnings went away. Since the

Re: [OMPI devel] Uninitialized ORTE epoch values

2011-08-05 Thread Wesley Bland
I don't think these are anything to worry about since they're all print statements, but I will work on these tonight. On Fri, Aug 5, 2011 at 3:03 PM, Jeff Squyres wrote: > Ralph and I are trying to track down the mysterious ORTE error. > > In doing so, I have found at least

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-23 Thread Wesley Bland
Committed in r24815. On Thursday, June 23, 2011 at 4:19 PM, Ralph Castain wrote: > > On Jun 23, 2011, at 2:14 PM, Wesley Bland wrote: > > Maybe before the ORTED saw the signal, it detected a communication failure > > and reacted to that. > > Quite possible. However, r

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-23 Thread Wesley Bland
e been equally okay to simply call > "opal_event_dispatch" while waiting for the callback. > > All applications have to cycle the progress engine. > > > On Jun 23, 2011, at 1:18 PM, Wesley Bland wrote: > > Josh, > > > > There were a couple of bugs th

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-23 Thread Wesley Bland
nitalized > orte_abort: Name [[60292,1],3,0] Host: smoky14 Pid 3843 -- Calling Abort > [jjhursey@smoky14 system] echo $? > 3 > ---- > > Any ideas on what I might be doing wrong? > > I tried with both calling 'orte_errmgr.abort(ORTE_PROC_MY_NAME->vpid

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-23 Thread Wesley Bland
Last reminder (I hope). RFC goes in a COB today. Wesley

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-18 Thread Wesley Bland
e since we just got the new patch on Friday COB (the RFC gave us 2 weeks to review the original patch). Would waiting until next Thursday/Friday COB be too disruptive? That should give me and maybe Ralph enough time to test and send any further feedback. > > Thanks, > Josh > > On Jun 17, 20

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-17 Thread Wesley Bland
es some code refactoring, which should > probably be done once in the trunk instead of two possibly disruptive > commits. > > -- Josh > > On Fri, Jun 17, 2011 at 5:18 PM, Wesley Bland <wbl...@eecs.utk.edu> wrote: >> This is a reminder that the Resilient ORTE RFC is

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-08 Thread Wesley Bland
On Tuesday, June 7, 2011 at 4:55 PM, Josh Hursey wrote: - orte_errmgr.post_startup() start the persistent RML message. There does not seem to be a shutdown version of this (to deregister the RML message at orte_finalize time). Was this intentional, or just missed? I just missed that one. I've

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-07 Thread Wesley Bland
Definitely we are targeting ORTED failures here. If an ORTED fails than any other ORTEDs connected to it will notice and report the failure. Of course if the failure is an application than the ORTED on that node will be the only one to detect it. Also, if an ORTED is lost, all of the

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-07 Thread Wesley Bland
> > Perhaps it would help if you folks could provide a little explanation about > how you use epoch? While the value sounds similar, your explanations are > beginning to sound very different from what we are doing and/or had > envisioned. > > I'm not sure how you can talk about an epoch

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-07 Thread Wesley Bland
On Tuesday, June 7, 2011 at 12:14 PM, Ralph Castain wrote: > > > On Tue, Jun 7, 2011 at 9:45 AM, Wesley Bland <wbl...@eecs.utk.edu > (mailto:wbl...@eecs.utk.edu)> wrote: > > To adress your concerns about putting the epoch in the process name > > structure, p

Re: [OMPI devel] RFC: Resilient ORTE

2011-06-07 Thread Wesley Bland
merge conflicts with my state machine branch, which would be > ready for commit in the same time frame, but I'll hold off on that one and > deal with the merge issues on my side. > > > > On Tue, Jun 7, 2011 at 8:46 AM, Wesley Bland <wbl...@eecs.utk.edu > (mailto:wbl.

Re: [OMPI devel] Open MPI Developers Meeting Agenda

2011-05-02 Thread Wesley Bland
Josh, Do you have a time that the meetings will be starting tomorrow for the Open MPI meeting? I'm sorry if I've missed it on the list. Thanks, Wesley Bland On Wed, Apr 6, 2011 at 11:09 AM, Joshua Hursey <jjhur...@open-mpi.org>wrote: > Reminder: > If you are interested in attendi

Re: [OMPI devel] [OMPI svn] svn:open-mpi r23628

2010-08-19 Thread Wesley Bland
So just to clarify, this means that we don't need to worry about having more than one errmgr module handling a single failure and therefore don't have to set the stack_state (which is now gone anyway). Am I reading this correctly? Thanks, Wesley On Thu, Aug 19, 2010 at 9:09 AM,

Re: [OMPI devel] RFC: Merge tmp fault recovery branch into trunk

2010-03-10 Thread Wesley Bland
Josh, You mentioned some MCA parameters that you would include in the email, but I don't see those parameters anywhere. Could you please put those in here to make testing easier for people. Wesley On Wed, Mar 10, 2010 at 1:26 PM, Josh Hursey wrote: > Yesterday evening

Problem with Snow Leopard and gnuplot

2009-10-08 Thread Wesley Bland
I'm having a problem installing gnuplot on my Snow Leopard box from macports. I've already done a port upgrade outdated to make sure everything is up to date and I've installed p5-libintl-perl, but I'm still getting this error: $ sudo port install gnuplot --- Computing dependencies for

[Oscar-devel] OSCAR trunk/branch-5.1 audit

2008-04-18 Thread Wesley Bland
since that's not going to be useful information. Wesley Bland Differences between trunk and branch 5-1 Asterisks should be noted. Asterisked files should probably be reviewed again to get a closer look. scripts/create_and_populate_basic_node_info Changed calls to oscar_log to normal print

Re: [Oscar-devel] Problems with install fc7

2007-08-03 Thread Wesley Bland
Sorry about that. That bug was mine. I didn't do enough error checking before I checked that bit of code in. The problem should be fixed now. I didn't do a full OSCAR install, but I'm pretty sure that it's ok. Wesley Geoffroy Vallée wrote: Wesley, Can you check if the problem is caused

[Oscar-devel] ODA Changes

2007-08-01 Thread Wesley Bland
anyway, it seems like a good time to check it in. For more information about OPM see the wiki (http://svn.oscar.openclustergroup.org/trac/oscar/wiki/OPM) or email me. Wesley -- Wesley Bland ORNL Summer Intern v3.0 [EMAIL PROTECTED

Re: [Oscar-devel] About Package Sets

2007-07-20 Thread Wesley Bland
/ ___ Oscar-devel mailing list Oscar-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/oscar-devel -- Wesley Bland ORNL Summer Intern v3.0 [EMAIL PROTECTED

[Oscar-devel] OPKG Version Comparisons

2007-06-15 Thread Wesley Bland
. Any thoughts? -- Wesley Bland ORNL Summer Intern v3.0 [EMAIL PROTECTED] - This SF.net email is sponsored by DB2 Express Download DB2 Express C - the FREE version of DB2 express and take control of your XML. No limits. Just

[Oscar-devel] Package Set Manager

2007-03-11 Thread Wesley Bland
to see if this can be a simple addition to the existing library. Feel free to send me feedback on the lists or by posting something in the wiki. Thanks in advance for any assistance you guys give me this semester and into the Summer. Wesley Bland [EMAIL PROTECTED

Re: [Oscar-devel] Package Set Manager

2007-03-11 Thread Wesley Bland
Yeah. I've been working on OSCAR with the ORNL guys for a couple of years now. Usually I'm on site there, although I won't be until May this year. Michael Edwards wrote: I assume this is separate from the Google summer of code? On 3/11/07, Wesley Bland [EMAIL PROTECTED] wrote: Hi, Last

[Oscar-devel] PBS

2006-07-31 Thread Wesley Bland
. Anyone have any insight on this? Can I take PBS out of the requires list for torque? BTW, this problem isn't just in the CLI. I dumped the data in the GUI as well and it's there too. The only difference is the output was suppressed somewhere (never did find that part). Wesley Bland [EMAIL

Re: [Oscar-devel] [Oscar-checkins] r5238 - trunk/lib/OSCAR

2006-07-28 Thread Wesley Bland
The issue here was emulating a check box in a console, not representing true and false in the code. That would be confusing if I was trying to do that in the code, but this is just to print. Wesley Erich Focht wrote: On Thursday 27 July 2006 19:24, Bernard Li wrote: Hey Wesley:

Re: [Oscar-devel] [Oscar-checkins] r5081 - trunk/lib/OSCAR

2006-06-30 Thread Wesley Bland
Sorry about the blank. Thanks for the fix. It works fine now. Wesley [EMAIL PROTECTED] wrote: Author: efocht Date: 2006-06-29 17:46:11 -0400 (Thu, 29 Jun 2006) New Revision: 5081 Modified: trunk/lib/OSCAR/Configbox.pm Log: Added {} around the $values{$key} variable to avoid nasty

Re: [Oscar-devel] [Oscar-checkins] r5081 - trunk/lib/OSCAR

2006-06-30 Thread Wesley Bland
[EMAIL PROTECTED] wrote: Author: efocht Date: 2006-06-29 17:46:11 -0400 (Thu, 29 Jun 2006) New Revision: 5081 Modified: trunk/lib/OSCAR/Configbox.pm Log: Added {} around the $values{$key} variable to avoid nasty error message under fc5. Defined $context as if undefined to reduce

Re: [Oscar-devel] [Oscar-checkins] r5066 - in trunk: lib/OSCAR packages/ganglia/scripts packages/kernel_picker/scripts packages/ntpconfig/scripts packages/oda/scripts packages/sge/scripts packages/swi

2006-06-29 Thread Wesley Bland
I've been having problems installing since this update. I'm using FC4 with make test from the most current trunk. I also tried last night's nightly just to confirm. Here's the output I'm getting in oscar_wizard: =

[Oscar-devel] oscar_wizard

2006-06-14 Thread Wesley Bland
it anywhere. Is this just stale code that hasn't been removed or am I just missing something? Thanks, Wesley -- Wesley Bland 865-576-5508 [EMAIL PROTECTED] ___ Oscar-devel mailing list Oscar-devel@lists.sourceforge.net https://lists.sourceforge.net/lists

Re: [Oscar-devel] Configurator

2006-06-14 Thread Wesley Bland
So this approach sounds good to me, but there are a few things that I want to know before I dive into this thing simply because I don't have enough time here this summer to be able to address all these issues: Would this still work for the current GUI or are there going to be some major

[Oscar-devel] Configurator

2006-06-08 Thread Wesley Bland
input you have and keep an eye on the wiki page to keep up with progress on the CLI. Thanks, Wesley -- Wesley Bland 576-5508 [EMAIL PROTECTED] ___ Oscar-devel mailing list Oscar-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo

Re: [Oscar-devel] Configurator

2006-06-08 Thread Wesley Bland
on the command line in the fashion that I am looking for (not the same as what lynx would do). Feel free to take a look at the wiki to see more details of the CLI project. Wesley On Thursday 08 June 2006 10:54 am, Ted Powell wrote: On Thu, Jun 08, 2006 at 09:02:36AM -0400, Wesley Bland wrote

Re: [Oscar-devel] Configurator

2006-06-08 Thread Wesley Bland
Sorry about that, here's the URL: https://svn.oscar.openclustergroup.org/trac/oscar/wiki/DevelDocs My stuff is under Command Line Interface. Wesley On Thursday 08 June 2006 12:07 pm, Ted Powell wrote: On Thu, Jun 08, 2006 at 09:02:36AM -0400, Wesley Bland wrote: [...] Please send me any

Re: [Oscar-devel] Configurator

2006-06-08 Thread Wesley Bland
PROTECTED] On Behalf Of Wesley Bland Sent: Thursday, June 08, 2006 6:03 To: oscar-devel@lists.sourceforge.net Subject: [Oscar-devel] Configurator As you may have seen this summer I'm working on making a CLI for the OSCAR Installer.  As I have moved into the configurator step (Step 2

[Oscar-devel] Configurator schema

2006-06-08 Thread Wesley Bland
go ahead and begin writing the configurator with the specifications I've put on the wiki. -- Wesley Bland 576-5508 [EMAIL PROTECTED] ___ Oscar-devel mailing list Oscar-devel@lists.sourceforge.net https://lists.sourceforge.net/lists/listinfo/oscar-devel

<    1   2