Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-16 Thread Jeff Squyres
Is there a local system administrator that you can talk to about this? It may be a lot easier to talk to someone face-to-face about this stuff rather than talking to a bunch of people that you don't know on a mailing list, and none of us are familiar with your local environment, what tools

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-15 Thread Hana Milani
Dear all, 1. I have not run it with debugger, could you tell me how to do it? 2. How can I make sure that it is or it is not killing my job. siorry if my questions seems wierd. But I have to solve the problem immediately. Thanks for helping me

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-15 Thread Jeff Squyres
Have you checked to ensure that the job manager is not killing your job? As I mentioned yesterday, SIGTERM is usually when some external agent kills your job. On Jan 15, 2009, at 3:39 AM, Hana Milani wrote: please tell me how to get rid of the message and how to run the parallel job? I

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-15 Thread jody
Without any details it's difficult to make a diagnosis, but it looks like one of your processes crashes, perhaps from a segmentation fault . Have you run it with a debugger? Jody On Thu, Jan 15, 2009 at 9:39 AM, Hana Milani wrote: > please tell me how to get rid of the message and how to run th

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-15 Thread Hana Milani
please tell me how to get rid of the message and how to run the parallel job? I have another code running directly by mpirun without a problem, but this one that needed blacs and scalapack is palying with me. please if there is any solution let me have it. Regards, hana

Re: [OMPI users] mpirun (signal 15 Termination) urgent

2009-01-15 Thread Hana Milani
Hello Simon, For running the program in parallel, I write: mpirun -np 4 ~/program output It takes a second that I receive the message: mpirun noticed that job rank 0 with PID 9477 on node linux-4pel exited on signal 15 (Terminated). and at the end of the output file, I receive: "3 additiona

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-14 Thread Jeff Squyres
Signal 15 (i.e., SIGTERM) can also mean that a job scheduler killed you (e.g., if your job ran out of time). On Jan 14, 2009, at 11:46 AM, Simon Hammond wrote: Do you get anything else in the output? We sometimes get this but it means the execution has proceeded sucessfully. Si Hammond Hi

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-14 Thread Hana Milani
Hello Simon, For running the program in parallel, I write: mpirun -np 4 ~/program output It takes a second that I receive the message: mpirun noticed that job rank 0 with PID 9477 on node linux-4pel exited on signal 15 (Terminated). and at the end of the output file, I receive: "3 addition

Re: [OMPI users] mpirun (signal 15 Termination)

2009-01-14 Thread Simon Hammond
Do you get anything else in the output? We sometimes get this but it means the execution has proceeded sucessfully. Si Hammond High Performance Systems Group Department of Computer Science University of Warwick 2009/1/14 Hana Milani : > Dear all, > > I have to run my code in parallel, therefore

[OMPI users] mpirun (signal 15 Termination)

2009-01-14 Thread Hana Milani
Dear all, I have to run my code in parallel, therefore, I have installed openmpi-1.2.8 on a core2quad system with suse 11.0 linux and gfortran compiler. I have also downloaded blacs and scalapack from: http://www.netlib.org/scalapack/scalapack_installer.tgz. Everything has gone smoothly in ins