subject:"Re\: \[gmx\-users\] GPU"

Have you tried running on CPUs only just to see if the issue persists?
Unless the issue does not occur with the same binary on the same
hardware running on CPUs only, I doubt it's a problem in the code.

Do you have ECC on?
--
Szilárd


On Sun, Apr 28, 2013 at 5:27 PM, Albert mailmd2...@gmail.com wrote:
 Dear:

   I am running MD jobs in a workstation with 4 K20 GPU and I found that the
 job always failed with following messages from time to time:


 [tesla:03432] *** Process received signal ***
 [tesla:03432] Signal: Segmentation fault (11)
 [tesla:03432] Signal code: Address not mapped (1)
 [tesla:03432] Failing at address: 0xfffe02de67e0
 [tesla:03432] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0xfcb0)
 [0x7f4666da1cb0]
 [tesla:03432] [ 1] mdrun_mpi() [0x47dd61]
 [tesla:03432] [ 2] mdrun_mpi() [0x47d8ae]
 [tesla:03432] [ 3]
 /opt/intel/lib/intel64/libiomp5.so(__kmp_invoke_microtask+0x93)
 [0x7f46667904f3]
 [tesla:03432] *** End of error message ***
 --
 mpirun noticed that process rank 0 with PID 3432 on node tesla exited on
 signal 11 (Segmentation fault).
 --


 I can continue the jobs with mdrun option -append -cpi, but it still
 stopped from time to time. I am just wondering what's the problem?

 thank you very much
 Albert
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the www
 interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped


Hello:

 yes, I tried the CPU only version, it goes well and didn't stop. I am 
not sure whether I have ECC on or not. There are 4 Tesla K20 and one 
GTX650 in the workstation, after compilation, I simple submit the jobs 
with command:



mdrun -s md.tpr -gpu_id 0234

I submit the same system in another GTX690 machine, it also goes 
well. I compiled Gromacs with the same options in that machine.


thank you very much
best
Albert



On 04/29/2013 01:19 PM, Szilárd Páll wrote:

Have you tried running on CPUs only just to see if the issue persists?
Unless the issue does not occur with the same binary on the same
hardware running on CPUs only, I doubt it's a problem in the code.

Do you have ECC on?
--
Szilárd


--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped


On 04/28/2013 05:45 PM, Justin Lemkul wrote:


Frequent failures suggest instability in the simulated system. Check 
your .log file or stderr for informative Gromacs diagnostic information.


-Justin 



my log file didn't have any errors, the end of topped log file something 
like:


DD  step 2259  vol min/aver 0.967  load imb.: force  0.8%

   Step   Time Lambda
   226045200.00.0

   Energies (kJ/mol)
  AngleU-BProper Dih.  Improper Dih.  LJ-14
9.86437e+034.02406e+043.52809e+046.13542e+02 8.61815e+03
 Coulomb-14LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.
1.25055e+043.05477e+04   -9.05956e+03   -6.02400e+05 1.58357e+03
 Position Rest.  PotentialKinetic En.   Total Energy Temperature
1.39149e+02   -4.72066e+051.37165e+05   -3.34901e+05 3.11958e+02
 Pres. DC (bar) Pressure (bar)   Constr. rmsd
   -2.94092e+02   -7.91535e+011.79812e-05


also in the information file I only obtained information:


step 13300, will finish Tue Apr 30 14:41
NOTE: Turning on dynamic load balancing


Probably the machine was restarted from time to time?

best
Albert


--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped

On Mon, Apr 29, 2013 at 2:41 PM, Albert mailmd2...@gmail.com wrote:
 On 04/28/2013 05:45 PM, Justin Lemkul wrote:


 Frequent failures suggest instability in the simulated system. Check your
 .log file or stderr for informative Gromacs diagnostic information.

 -Justin



 my log file didn't have any errors, the end of topped log file something
 like:

 DD  step 2259  vol min/aver 0.967  load imb.: force  0.8%

Step   Time Lambda
226045200.00.0

Energies (kJ/mol)
   AngleU-BProper Dih.  Improper Dih.  LJ-14
 9.86437e+034.02406e+043.52809e+046.13542e+02 8.61815e+03
  Coulomb-14LJ (SR)  Disper. corr.   Coulomb (SR)   Coul. recip.
 1.25055e+043.05477e+04   -9.05956e+03   -6.02400e+05 1.58357e+03
  Position Rest.  PotentialKinetic En.   Total Energy Temperature
 1.39149e+02   -4.72066e+051.37165e+05   -3.34901e+05 3.11958e+02
  Pres. DC (bar) Pressure (bar)   Constr. rmsd
-2.94092e+02   -7.91535e+011.79812e-05


 also in the information file I only obtained information:


 step 13300, will finish Tue Apr 30 14:41
 NOTE: Turning on dynamic load balancing


 Probably the machine was restarted from time to time?

The segv indicates that mdrun crashed and not that the machine was
restarted. The GPU detection output (both on stderr and log) should
show whether ECC is on (and so does the nvidia-smi tool).

Cheers,
--
Szilárd



 best
 Albert



 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the www
 interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped


On 04/29/2013 03:31 PM, Szilárd Páll wrote:

The segv indicates that mdrun crashed and not that the machine was
restarted. The GPU detection output (both on stderr and log) should
show whether ECC is on (and so does the nvidia-smi tool).

Cheers,
--
Szilárd


yes it was on:


Reading file heavy.tpr, VERSION 4.6.1 (single precision)
Using 4 MPI threads
Using 8 OpenMP threads per tMPI thread

5 GPUs detected:
  #0: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
  #1: NVIDIA GeForce GTX 650, compute cap.: 3.0, ECC:  no, stat: compatible
  #2: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
  #3: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
  #4: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible

4 GPUs user-selected for this run: #0, #2, #3, #4

--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped

In that case, while it isn't very likely, the issue could be caused by
some implementation detail which aims to avoid performance loss caused
by an issue in the NVIDIA drivers.

Try running with the GMX_CUDA_STREAMSYNC environment variable set.

Btw, were there any other processes using the GPU while mdrun was running?

Cheers,
--
Szilárd


On Mon, Apr 29, 2013 at 3:32 PM, Albert mailmd2...@gmail.com wrote:
 On 04/29/2013 03:31 PM, Szilárd Páll wrote:

 The segv indicates that mdrun crashed and not that the machine was
 restarted. The GPU detection output (both on stderr and log) should
 show whether ECC is on (and so does the nvidia-smi tool).

 Cheers,
 --
 Szilárd


 yes it was on:


 Reading file heavy.tpr, VERSION 4.6.1 (single precision)
 Using 4 MPI threads
 Using 8 OpenMP threads per tMPI thread

 5 GPUs detected:
   #0: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
   #1: NVIDIA GeForce GTX 650, compute cap.: 3.0, ECC:  no, stat: compatible
   #2: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
   #3: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible
   #4: NVIDIA Tesla K20m, compute cap.: 3.5, ECC: yes, stat: compatible

 4 GPUs user-selected for this run: #0, #2, #3, #4


 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the www
 interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped


On 04/29/2013 03:47 PM, Szilárd Páll wrote:

In that case, while it isn't very likely, the issue could be caused by
some implementation detail which aims to avoid performance loss caused
by an issue in the NVIDIA drivers.

Try running with the GMX_CUDA_STREAMSYNC environment variable set.

Btw, were there any other processes using the GPU while mdrun was running?

Cheers,
--
Szilárd


thanks for kind reply.
There is no any other process when I am running Gromacs.

do you mean I should set GMX_CUDA_STREAMSYNC in the job script like:

export GMX_CUDA_STREAMSYNC=/opt/cuda-5.0

?

THX
Albert



--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped

On Mon, Apr 29, 2013 at 3:51 PM, Albert mailmd2...@gmail.com wrote:
 On 04/29/2013 03:47 PM, Szilárd Páll wrote:

 In that case, while it isn't very likely, the issue could be caused by
 some implementation detail which aims to avoid performance loss caused
 by an issue in the NVIDIA drivers.

 Try running with the GMX_CUDA_STREAMSYNC environment variable set.

 Btw, were there any other processes using the GPU while mdrun was running?

 Cheers,
 --
 Szilárd


 thanks for kind reply.
 There is no any other process when I am running Gromacs.

 do you mean I should set GMX_CUDA_STREAMSYNC in the job script like:

 export GMX_CUDA_STREAMSYNC=/opt/cuda-5.0

Sort of, but the value does not matter. So if your shell is bash, the
above as well as simply export GMX_CUDA_STREAMSYNC= will work fine.

Let us know if this avoided the crash - when you have simulated long
enough to be able to judge.

Cheers,
--
Szilárd


 ?

 THX
 Albert




 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the www
 interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU job often stopped

2013-04-28 Thread Justin Lemkul




On 4/28/13 11:27 AM, Albert wrote:

Dear:

   I am running MD jobs in a workstation with 4 K20 GPU and I found that the job
always failed with following messages from time to time:


[tesla:03432] *** Process received signal ***
[tesla:03432] Signal: Segmentation fault (11)
[tesla:03432] Signal code: Address not mapped (1)
[tesla:03432] Failing at address: 0xfffe02de67e0
[tesla:03432] [ 0] /lib/x86_64-linux-gnu/libpthread.so.0(+0xfcb0) 
[0x7f4666da1cb0]
[tesla:03432] [ 1] mdrun_mpi() [0x47dd61]
[tesla:03432] [ 2] mdrun_mpi() [0x47d8ae]
[tesla:03432] [ 3]
/opt/intel/lib/intel64/libiomp5.so(__kmp_invoke_microtask+0x93) [0x7f46667904f3]
[tesla:03432] *** End of error message ***
--
mpirun noticed that process rank 0 with PID 3432 on node tesla exited on signal
11 (Segmentation fault).
--


I can continue the jobs with mdrun option -append -cpi, but it still stopped
from time to time. I am just wondering what's the problem?



Frequent failures suggest instability in the simulated system.  Check your .log 
file or stderr for informative Gromacs diagnostic information.


-Justin

--


Justin A. Lemkul, Ph.D.
Research Scientist
Department of Biochemistry
Virginia Tech
Blacksburg, VA
jalemkul[at]vt.edu | (540) 231-9080
http://www.bevanlab.biochem.vt.edu/Pages/Personal/justin


--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU efficiency question

2013-04-27 Thread Mark Abraham

Probably the part of the calculation done on the GPU is not rate limiting.
There's no point having four chefs to make one dish...

Look at the beginning and end of your .log files for diagnostic
information. If this is a single node, you should be using threadMPI, not
real MPI. Generally four CPU cores vs four GPU cores will require an
extremely large PP load for the GPUs to all be effective.

Mark

On Fri, Apr 26, 2013 at 8:35 PM, Albert mailmd2...@gmail.com wrote:

Dear:

I've got two GTX690 in a a workstation and I found that when I run the md
production with following two command:

mpirun -np 4 md_run_mpi

mpirun -np 2 md_run_mpi

the efficiency are the same. I notice that gromacs can detect 4 GPU
(probably because GTX690 have two core..):

4 GPUs detected on host node4:
#0: NVIDIA GeForce GTX 690, compute cap.: 3.0, ECC: no, stat: compatible
#1: NVIDIA GeForce GTX 690, compute cap.: 3.0, ECC: no, stat: compatible
#2: NVIDIA GeForce GTX 690, compute cap.: 3.0, ECC: no, stat: compatible
#3: NVIDIA GeForce GTX 690, compute cap.: 3.0, ECC: no, stat: compatible

why the -np 2 and -np 4 are the same efficiency? shouldn't it be
faster for -np 4 ?

thank you very much

Albert

--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/**mailman/listinfo/gmx-usershttp://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at http://www.gromacs.org/**
Support/Mailing_Lists/Searchhttp://www.gromacs.org/Support/Mailing_Lists/Searchbefore
posting!
* Please don't post (un)subscribe requests to the list. Use the www
interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read
http://www.gromacs.org/**Support/Mailing_Listshttp://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU performance

2013-04-10 Thread Szilárd Páll

On Wed, Apr 10, 2013 at 3:34 AM, Benjamin Bobay bgbo...@ncsu.edu wrote:

 Szilárd -

 First, many thanks for the reply.

 Second, I am glad that I am not crazy.

 Ok so based on your suggestions, I think I know what the problem is/was.
 There was a sander process running on 1 of the CPUs.  Clearly GROMACS was
 trying to use 4 with Using 4 OpenMP thread. I just did not catch that.
 Sorry! Rookie mistake.

 Which I guess leads me to my next question (sorry if its too naive):

 (1) When running GROMACS (or a I guess any other CUDA based programs), its
 best to have all the CPUs free, right? I guess based on my results I have
 pretty much answered that question.  Although I thought that as long as I
 have one CPU available to run the GPU it would be good: would setting
 -ntmpi 1 -ntomp 1 help or would I take a major hit in ns/day as well?


Such a behavior is not specific to GROMACS or CUDA-accelerated codes, but
all compute-intensive codes that expect to be running alone on the set of
CPU cores they are started on. As you could see on the output, mdrun
automatically detected that you have 4 CPU cores and as Mark saied, it
tries to use all of them along the GPU. As one of the cores was busy, you
ended up in a situation in which four threads of mdrun plus the
(presumably) one thread of sander are competing for four cores. This is
made even worse by the fact that when using a full machine, mdrun locks its
threads to physical cores to prevent the OS from moving them around (which
can cause performance loss).

Secondly, using a single core with a GPU will not result in a very good
performance in GROMACS. The current GROMACS acceleration expects to run on
a couple of CPU cores together with a GPU - which is the typical balance of
CPU-GPU hardware most clusters (1 GPU/socket) as well as many home users
would have (1-2 GPUs for 4-8 CPU cores).



 If I try the benchmarks again just to see (for fun) with Using 4 OpenMP
 thread, under top I have - so I think the CPU is fine :
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
 24791 bobayb20   0 48.3g  51m 7576 R 299.1  0.2  11:32.90
 mdrun


Nope, that just means, roughly speaking, that sander is probably fully
using one core and the four thread of mdrun are crammed on the remaining
three cores - which is bad.

However, you can simply run mdrun using three threads which will run fine
along sander. Whether this will be efficient or not, you'll have to see.
Note that if some other program is using the GPU as well, don't expect full
performance - but the difference will be much less than in the case
of oversubscribed CPU cores.

Cheers,
--
Szilárd



 When I have a chance (after this sander run is done - hopefully soon) I can
 try the benchmarks again.

 Thanks again for the help!

 Ben
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU performance

2013-04-09 Thread Szilárd Páll

Hi Ben,

That performance is not reasonable at all - neither for CPU only run on
your quad-core Sandy Bridge, nor for the CPU+GPU run. For the latter you
should be getting more like 50 ns/day or so.

What's strange about your run is that the CPU-GPU load balancing is picking
a *very* long cut-off which means that your CPU is for some reason
performing very badly. Check how is mdrun behaving while running in
top/htop nad if you are not seeing ~400% CPU utilization, there is
something wrong - perhaps threads getting locked to the same core (to check
that try -pin off).

Secondly, note that you are using OpenMM-specific settings from the old
GROMACS-OpenMM comparison benchmarks in which the grid spacing is overly
coarse (you could use something like a fourier-spacing=0.125 or even larger
with rc=1.0).

Cheers,

--
Szilárd


On Tue, Apr 9, 2013 at 10:27 PM, Benjamin Bobay bgbo...@ncsu.edu wrote:

 Good afternoon -

 I recently installed gromacs-4.6 on CentOS6.3 and the installation went
 just fine.

 I have a Tesla C2075 GPU.

 I then downloaded the benchmark directories and ran a bench mark on the
 GPU/ dhfr-solv-PME.bench

 This is what I got:

 Using 1 MPI thread
 Using 4 OpenMP threads

 1 GPU detected:
   #0: NVIDIA Tesla C2075, compute cap.: 2.0, ECC: yes, stat: compatible

 1 GPU user-selected for this run: #0


 Back Off! I just backed up ener.edr to ./#ener.edr.1#
 starting mdrun 'Protein in water'
 -1 steps, infinite ps.
 step   40: timed with pme grid 64 64 64, coulomb cutoff 1.000: 4122.9
 M-cycles
 step   80: timed with pme grid 56 56 56, coulomb cutoff 1.143: 3685.9
 M-cycles
 step  120: timed with pme grid 48 48 48, coulomb cutoff 1.333: 3110.8
 M-cycles
 step  160: timed with pme grid 44 44 44, coulomb cutoff 1.455: 3365.1
 M-cycles
 step  200: timed with pme grid 40 40 40, coulomb cutoff 1.600: 3499.0
 M-cycles
 step  240: timed with pme grid 52 52 52, coulomb cutoff 1.231: 3982.2
 M-cycles
 step  280: timed with pme grid 48 48 48, coulomb cutoff 1.333: 3129.2
 M-cycles
 step  320: timed with pme grid 44 44 44, coulomb cutoff 1.455: 3425.4
 M-cycles
 step  360: timed with pme grid 42 42 42, coulomb cutoff 1.524: 2979.1
 M-cycles
   optimal pme grid 42 42 42, coulomb cutoff 1.524
 step 4300 performance: 1.8 ns/day

 and from the nvidia-smi output:
 Tue Apr  9 10:13:46 2013
 +--+

 | NVIDIA-SMI 4.304.37   Driver Version: 304.37
 |

 |---+--+--+
 | GPU  Name | Bus-IdDisp.  | Volatile Uncorr.
 ECC |
 | Fan  Temp  Perf  Pwr:Usage/Cap| Memory-Usage | GPU-Util  Compute
 M. |

 |===+==+==|
 |   0  Tesla C2075  | :03:00.0  On |
 0 |
 | 30%   67CP080W / 225W |   4%  200MB / 5375MB |  4%
 Default |

 +---+--+--+



 +-+
 | Compute processes:   GPU
 Memory |
 |  GPU   PID  Process name
 Usage  |

 |=|
 |0 22568  mdrun
 59MB  |

 +-+


 So I am only getting 1.8ns/day ! Is that right? It seems very very
 small compared to the CPU test where I am getting the same:

 step 200 performance: 1.8 ns/dayvol 0.79  imb F 14%

 From the md.log of the GPU test:
 Detecting CPU-specific acceleration.
 Present hardware specification:
 Vendor: GenuineIntel
 Brand:  Intel(R) Xeon(R) CPU E5-2603 0 @ 1.80GHz
 Family:  6  Model: 45  Stepping:  7
 Features: aes apic avx clfsh cmov cx8 cx16 htt lahf_lm mmx msr nonstop_tsc
 pcid pclmuldq pdcm pdpe1gb popcnt pse rdtscp sse2 sse3 sse4.1 sse4.2 ssse3
 tdt x2a
 pic
 Acceleration most likely to fit this hardware: AVX_256
 Acceleration selected at GROMACS compile time: AVX_256


 1 GPU detected:
   #0: NVIDIA Tesla C2075, compute cap.: 2.0, ECC: yes, stat: compatible

 1 GPU user-selected for this run: #0

 Will do PME sum in reciprocal space.

 Any thoughts as to why it is so slow?

 many thanks!
 Ben

 --
 
 Research Assistant Professor
 North Carolina State University
 Department of Molecular and Structural Biochemistry
 128 Polk Hall
 Raleigh, NC 27695
 Phone: (919)-513-0698
 Fax: (919)-515-2047
 
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read

Re: [gmx-users] GPU performance

2013-04-09 Thread Benjamin Bobay

Szilárd -

First, many thanks for the reply.

Second, I am glad that I am not crazy.

Ok so based on your suggestions, I think I know what the problem is/was.
There was a sander process running on 1 of the CPUs.  Clearly GROMACS was
trying to use 4 with Using 4 OpenMP thread. I just did not catch that.
Sorry! Rookie mistake.

Which I guess leads me to my next question (sorry if its too naive):

(1) When running GROMACS (or a I guess any other CUDA based programs), its
best to have all the CPUs free, right? I guess based on my results I have
pretty much answered that question.  Although I thought that as long as I
have one CPU available to run the GPU it would be good: would setting
-ntmpi 1 -ntomp 1 help or would I take a major hit in ns/day as well?

If I try the benchmarks again just to see (for fun) with Using 4 OpenMP
thread, under top I have - so I think the CPU is fine :
PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
24791 bobayb20   0 48.3g  51m 7576 R 299.1  0.2  11:32.90
mdrun


When I have a chance (after this sander run is done - hopefully soon) I can
try the benchmarks again.

Thanks again for the help!

Ben
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU performance

2013-04-09 Thread Mark Abraham

On Apr 10, 2013 3:34 AM, Benjamin Bobay bgbo...@ncsu.edu wrote:

 Szilárd -

 First, many thanks for the reply.

 Second, I am glad that I am not crazy.

 Ok so based on your suggestions, I think I know what the problem is/was.
 There was a sander process running on 1 of the CPUs.  Clearly GROMACS was
 trying to use 4 with Using 4 OpenMP thread. I just did not catch that.
 Sorry! Rookie mistake.

 Which I guess leads me to my next question (sorry if its too naive):

 (1) When running GROMACS (or a I guess any other CUDA based programs), its
 best to have all the CPUs free, right? I guess based on my results I have
 pretty much answered that question.  Although I thought that as long as I
 have one CPU available to run the GPU it would be good: would setting
 -ntmpi 1 -ntomp 1 help or would I take a major hit in ns/day as well?

Some codes might treat the CPU as a I/O, MPI and memory-serving
co-processor of the GPU; those codes will tend to be insensitive to the
CPU config. GROMACS goes to great lengths to use all the hardware in a
dynamically load-balanced way, so CPU load and config tend to affect the
bottom line immediately.

Mark

 If I try the benchmarks again just to see (for fun) with Using 4 OpenMP
 thread, under top I have - so I think the CPU is fine :
 PID USER  PR  NI  VIRT  RES  SHR S %CPU %MEMTIME+  COMMAND
 24791 bobayb20   0 48.3g  51m 7576 R 299.1  0.2  11:32.90
 mdrun


 When I have a chance (after this sander run is done - hopefully soon) I
can
 try the benchmarks again.

 Thanks again for the help!

 Ben
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU version of GROMACS 4.6 in MacOS cluster

2013-03-08 Thread George Patargias

Hi Szilard

Thanks for this tip; it was extremely useful. The problem was indeed the
incompatibility between the installed NVIDIA driver and the CUDA 5.0
runtime library. Installation of an older driver solved the problem. The
programs devideQuery etc can now detect the GPU.

GROMACS can also detect now the card but unfortunately aborts with the
following error

Fatal error: Incorrect launch configuration: mismatching number of PP MPI
processes and GPUs per node.
mdrun_mpi was started with 12 PP MPI processes per node, but only 1 GPU
were detected.

Here is my command line
mpirun -np 12 mdrun_mpi -s test.tpr -deffnm test_out -nb gpu

What can be the problem?

Thanks again

 Hi George,
 As I said before, that just means that most probably the GPU driver is
not
 compatible with the CUDA runtime (libcudart) that you installed with the
CUDA toolkit. I've no clue about the Mac OS installers and releases,
you'll
 have to do the research on that. Let us know if you have further
(GROMACS-related) issues.
 Cheers,
 --
 Szil?rd
 On Fri, Mar 1, 2013 at 2:48 PM, George Patargias g...@bioacademy.gr
wrote:
 Hi Szilαrd
 Thanks for your reply. I have run the deviceQuery utility and what I
got
 back is
 /deviceQuery Starting...
  CUDA Device Query (Runtime API) version (CUDART static linking)
 cudaGetDeviceCount returned 38
 - no CUDA-capable device is detected
 Should I understand from this that the CUDA driver was not installed from
 the MAC OS  X CUDA 5.0 Production Release?
 George
  HI,
  That looks like the driver does not work or is incompatible with the
runtime. Please get the SDK, compile a simple program, e.g.
 deviceQuery
  and
  see if that works (I suspect that it won't).
  Regarding your machines, just FYI, the Quadro 4000 is a pretty slow
 card
  (somewhat slower than a GTX 460) so you'll hava a quite strong
 resource
  imbalance: a lot of CPU compute power (2x Xeon 5xxx, right?) and
 little
  GPU
  compute power which will lead to the CPU idling while waiting for the
 GPU.
  Cheers,
  --
  Szilαrd
  On Thu, Feb 28, 2013 at 4:52 PM, George Patargias
  g...@bioacademy.grwrote:
  Hello
  We are trying to install the GPU version of GROMACS 4.6 on our own
MacOS cluster. So for the cluster nodes that have the NVIDIA Quadro
 4000
  cards:
  - We have downloaded and install the MAC OS  X CUDA 5.0 Production
Release
  from here: https://developer.nvidia.com/cuda-downloads
  placing the libraries contained in this download in
 /usr/local/cuda/lib
  - We have managed to compile GROMACS 4.6 linking it statically with
these
  CUDA libraries and the MPI libraries (with BUILD_SHARED_LIBS=OFF and
GMX_PREFER_STATIC_LIBS=ON)
  Unfortunately, when we tried to run a test job with the generated
mdrun_mpi, GROMACS reported that it cannot detect any CUDA-enabled
devices. It also reports 0.0 version for CUDA driver and runtime. Is
the actual CUDA driver missing from the MAC OS  X CUDA 5.0
 Production
  Release that we installed? Do we need to install it from here:
http://www.nvidia.com/object/cuda-mac-driver.html
  Or is something else that we need to do?
  Many thanks in advance.
  George
  Dr. George Patargias
  Postdoctoral Researcher
  Biomedical Research Foundation
  Academy of Athens
  4, Soranou Ephessiou
  115 27
  Athens
  Greece
  Office: +302106597568
  --
  gmx-users mailing listgmx-users@gromacs.org
  http://lists.gromacs.org/mailman/listinfo/gmx-users
  * Please search the archive at
  http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the www
interface or send it to gmx-users-requ...@gromacs.org.
  * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
  --
  gmx-users mailing listgmx-users@gromacs.org
  http://lists.gromacs.org/mailman/listinfo/gmx-users
  * Please search the archive at
  http://www.gromacs.org/Support/Mailing_Lists/Search before posting! *
Please don't post (un)subscribe requests to the list. Use the www
interface or send it to gmx-users-requ...@gromacs.org.
  * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
 Dr. George Patargias
 Postdoctoral Researcher
 Biomedical Research Foundation
 Academy of Athens
 4, Soranou Ephessiou
 115 27
 Athens
 Greece
 Office: +302106597568
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting! *
Please don't post (un)subscribe requests to the list. Use the www
interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting! *
Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 *

Re: [gmx-users] GPU version of GROMACS 4.6 in MacOS cluster

2013-03-01 Thread George Patargias

Hi Szilαrd

Thanks for your reply. I have run the deviceQuery utility and what I got
back is

/deviceQuery Starting...

 CUDA Device Query (Runtime API) version (CUDART static linking)

cudaGetDeviceCount returned 38
- no CUDA-capable device is detected

Should I understand from this that the CUDA driver was not installed from
the MAC OS  X CUDA 5.0 Production Release?

George


 HI,

 That looks like the driver does not work or is incompatible with the
 runtime. Please get the SDK, compile a simple program, e.g. deviceQuery
 and
 see if that works (I suspect that it won't).

 Regarding your machines, just FYI, the Quadro 4000 is a pretty slow card
 (somewhat slower than a GTX 460) so you'll hava a quite strong resource
 imbalance: a lot of CPU compute power (2x Xeon 5xxx, right?) and little
 GPU
 compute power which will lead to the CPU idling while waiting for the GPU.

 Cheers,

 --
 Szilαrd


 On Thu, Feb 28, 2013 at 4:52 PM, George Patargias
 g...@bioacademy.grwrote:

 Hello

 We are trying to install the GPU version of GROMACS 4.6 on our own
 MacOS cluster. So for the cluster nodes that have the NVIDIA Quadro 4000
 cards:

 - We have downloaded and install the MAC OS  X CUDA 5.0 Production
 Release
 from here: https://developer.nvidia.com/cuda-downloads

 placing the libraries contained in this download in /usr/local/cuda/lib

 - We have managed to compile GROMACS 4.6 linking it statically with
 these
 CUDA libraries and the MPI libraries (with BUILD_SHARED_LIBS=OFF and
 GMX_PREFER_STATIC_LIBS=ON)

 Unfortunately, when we tried to run a test job with the generated
 mdrun_mpi, GROMACS reported that it cannot detect any CUDA-enabled
 devices. It also reports 0.0 version for CUDA driver and runtime.

 Is the actual CUDA driver missing from the MAC OS  X CUDA 5.0 Production
 Release that we installed? Do we need to install it from here:

 http://www.nvidia.com/object/cuda-mac-driver.html

 Or is something else that we need to do?

 Many thanks in advance.
 George


 Dr. George Patargias
 Postdoctoral Researcher
 Biomedical Research Foundation
 Academy of Athens
 4, Soranou Ephessiou
 115 27
 Athens
 Greece

 Office: +302106597568


 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists



Dr. George Patargias
Postdoctoral Researcher
Biomedical Research Foundation
Academy of Athens
4, Soranou Ephessiou
115 27
Athens
Greece

Office: +302106597568

-- 
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU version of GROMACS 4.6 in MacOS cluster

2013-03-01 Thread Szilárd Páll

Hi George,

As I said before, that just means that most probably the GPU driver is not
compatible with the CUDA runtime (libcudart) that you installed with the
CUDA toolkit. I've no clue about the Mac OS installers and releases, you'll
have to do the research on that. Let us know if you have further
(GROMACS-related) issues.

Cheers,

--
Szilárd


On Fri, Mar 1, 2013 at 2:48 PM, George Patargias g...@bioacademy.gr wrote:

 Hi Szilαrd

 Thanks for your reply. I have run the deviceQuery utility and what I got
 back is

 /deviceQuery Starting...

  CUDA Device Query (Runtime API) version (CUDART static linking)

 cudaGetDeviceCount returned 38
 - no CUDA-capable device is detected

 Should I understand from this that the CUDA driver was not installed from
 the MAC OS  X CUDA 5.0 Production Release?

 George


  HI,
 
  That looks like the driver does not work or is incompatible with the
  runtime. Please get the SDK, compile a simple program, e.g. deviceQuery
  and
  see if that works (I suspect that it won't).
 
  Regarding your machines, just FYI, the Quadro 4000 is a pretty slow card
  (somewhat slower than a GTX 460) so you'll hava a quite strong resource
  imbalance: a lot of CPU compute power (2x Xeon 5xxx, right?) and little
  GPU
  compute power which will lead to the CPU idling while waiting for the
 GPU.
 
  Cheers,
 
  --
  Szilαrd
 
 
  On Thu, Feb 28, 2013 at 4:52 PM, George Patargias
  g...@bioacademy.grwrote:
 
  Hello
 
  We are trying to install the GPU version of GROMACS 4.6 on our own
  MacOS cluster. So for the cluster nodes that have the NVIDIA Quadro 4000
  cards:
 
  - We have downloaded and install the MAC OS  X CUDA 5.0 Production
  Release
  from here: https://developer.nvidia.com/cuda-downloads
 
  placing the libraries contained in this download in /usr/local/cuda/lib
 
  - We have managed to compile GROMACS 4.6 linking it statically with
  these
  CUDA libraries and the MPI libraries (with BUILD_SHARED_LIBS=OFF and
  GMX_PREFER_STATIC_LIBS=ON)
 
  Unfortunately, when we tried to run a test job with the generated
  mdrun_mpi, GROMACS reported that it cannot detect any CUDA-enabled
  devices. It also reports 0.0 version for CUDA driver and runtime.
 
  Is the actual CUDA driver missing from the MAC OS  X CUDA 5.0 Production
  Release that we installed? Do we need to install it from here:
 
  http://www.nvidia.com/object/cuda-mac-driver.html
 
  Or is something else that we need to do?
 
  Many thanks in advance.
  George
 
 
  Dr. George Patargias
  Postdoctoral Researcher
  Biomedical Research Foundation
  Academy of Athens
  4, Soranou Ephessiou
  115 27
  Athens
  Greece
 
  Office: +302106597568
 
 
  --
  gmx-users mailing listgmx-users@gromacs.org
  http://lists.gromacs.org/mailman/listinfo/gmx-users
  * Please search the archive at
  http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
  * Please don't post (un)subscribe requests to the list. Use the
  www interface or send it to gmx-users-requ...@gromacs.org.
  * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
 
  --
  gmx-users mailing listgmx-users@gromacs.org
  http://lists.gromacs.org/mailman/listinfo/gmx-users
  * Please search the archive at
  http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
  * Please don't post (un)subscribe requests to the list. Use the
  www interface or send it to gmx-users-requ...@gromacs.org.
  * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists
 


 Dr. George Patargias
 Postdoctoral Researcher
 Biomedical Research Foundation
 Academy of Athens
 4, Soranou Ephessiou
 115 27
 Athens
 Greece

 Office: +302106597568

 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at
 http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
 * Please don't post (un)subscribe requests to the list. Use the
 www interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU version of GROMACS 4.6 in MacOS cluster

2013-03-01 Thread Albert


The easiest way for solution is to kill MacOS ans switch to Linux.

;-)

Albert


On 03/01/2013 06:03 PM, Szilárd Páll wrote:

Hi George,

As I said before, that just means that most probably the GPU driver is not
compatible with the CUDA runtime (libcudart) that you installed with the
CUDA toolkit. I've no clue about the Mac OS installers and releases, you'll
have to do the research on that. Let us know if you have further
(GROMACS-related) issues.

Cheers,

--
Szilárd


--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

2012-12-18 Thread Albert


On 12/17/2012 08:06 PM, Justin Lemkul wrote:
It seems to me that the system is simply crashing like any other that 
becomes unstable.  Does the simulation run at all on plain CPU?


-Justin 



Thank you very much Justin, it's really helpful. I've checked that the 
structure after minization and found that there is some problem with my 
ligand. I regenerated the ligand toplogy with acpype, and resubmit for 
mimization and NVT. Now it goes well. So probably the problems comes 
from the incorrect ligand topolgy which make the system very unstable.


best
Albert
--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

Hi,

That unfortunately tell exactly about the reason why mdrun is stuck. Can
you reproduce the issue on another machines or with different launch
configurations? At which step does it get stuck (-stepout 1 can help)?

Please try the following:
- try running on a single GPU;
- try running on CPUs only (-nb cpu and to match closer the GPU setup with
-ntomp 12);
- try running in GPU emulation mode with the GMX_EMULATE_GPU=1 env. var
set (and to match closer the GPU setup with -ntomp 12)
- provide a backtrace (using gdb).

Cheers,

--
Szilárd



On Mon, Dec 17, 2012 at 5:37 PM, Albert mailmd2...@gmail.com wrote:

 hello:

  I am running GMX-4.6 beta2 GPU work in a 24 CPU core workstation with two
 GTX590, it stacked there without any output i.e the .xtc file size is
 always 0 after hours of running. Here is the md.log file I found:


 Using CUDA 8x8x8 non-bonded kernels

 Potential shift: LJ r^-12: 0.112 r^-6 0.335, Ewald 1.000e-05
 Initialized non-bonded Ewald correction tables, spacing: 7.82e-04 size:
 1536

 Removing pbc first time
 Pinning to Hyper-Threading cores with 12 physical cores in a compute node
 There are 1 flexible constraints

 WARNING: step size for flexible constraining = 0
  All flexible constraints will be rigid.
  Will try to keep all flexible constraints at their original
 length,
  but the lengths may exhibit some drift.

 Initializing Parallel LINear Constraint Solver
 Linking all bonded interactions to atoms
 There are 161872 inter charge-group exclusions,
 will use an extra communication step for exclusion forces for PME

 The initial number of communication pulses is: X 1
 The initial domain decomposition cell size is: X 1.83 nm

 The maximum allowed distance for charge groups involved in interactions is:
  non-bonded interactions   1.200 nm
 (the following are initial values, they could change due to box
 deformation)
 two-body bonded interactions  (-rdd)   1.200 nm
   multi-body bonded interactions  (-rdd)   1.200 nm
   atoms separated by up to 5 constraints  (-rcon)  1.826 nm

 When dynamic load balancing gets turned on, these settings will change to:
 The maximum number of communication pulses is: X 1
 The minimum size for domain decomposition cells is 1.200 nm
 The requested allowed shrink of DD cells (option -dds) is: 0.80
 The allowed shrink of domain decomposition cells is: X 0.66
 The maximum allowed distance for charge groups involved in interactions is:
  non-bonded interactions   1.200 nm
 two-body bonded interactions  (-rdd)   1.200 nm
   multi-body bonded interactions  (-rdd)   1.200 nm
   atoms separated by up to 5 constraints  (-rcon)  1.200 nm

 Making 1D domain decomposition grid 4 x 1 x 1, home cell index 0 0 0

 Center of mass motion removal mode is Linear
 We have the following groups for center of mass motion removal:
   0:  Protein_LIG_POPC
   1:  Water_and_ions

  PLEASE READ AND CITE THE FOLLOWING REFERENCE 
 G. Bussi, D. Donadio and M. Parrinello
 Canonical sampling through velocity rescaling
 J. Chem. Phys. 126 (2007) pp. 014101
   --- Thank You ---  



 THX
 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/**mailman/listinfo/gmx-usershttp://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at http://www.gromacs.org/**
 Support/Mailing_Lists/Searchhttp://www.gromacs.org/Support/Mailing_Lists/Searchbefore
  posting!
 * Please don't post (un)subscribe requests to the list. Use the www
 interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read 
 http://www.gromacs.org/**Support/Mailing_Listshttp://www.gromacs.org/Support/Mailing_Lists

--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

2012-12-17 Thread Albert


hello:

 I reduced the GPU to two, and it said:

Back Off! I just backed up nvt.log to ./#nvt.log.1#
Reading file nvt.tpr, VERSION 4.6-dev-20121004-5d6c49d (single precision)

NOTE: GPU(s) found, but the current simulation can not use GPUs
  To use a GPU, set the mdp option: cutoff-scheme = Verlet
  (for quick performance testing you can use the -testverlet option)

Using 2 MPI processes

4 GPUs detected on host CUDANodeA:
  #0: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible
  #1: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible
  #2: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible
  #3: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible

Making 1D domain decomposition 2 x 1 x 1

* WARNING * WARNING * WARNING * WARNING * WARNING * WARNING *
We have just committed the new CPU detection code in this branch,
and will commit new SSE/AVX kernels in a few days. However, this
means that currently only the NxN kernels are accelerated!
In the mean time, you might want to avoid production runs in 4.6.


when I run it with single GPU, it produced lots of pdb file with prefix 
step, and then it crashed with messages:


Wrote pdb files with previous and current coordinates
Warning: 1-4 interaction between 4674 and 4706 at distance 434.986 which 
is larger than the 1-4 table size 2.200 nm

These are ignored for the rest of the simulation
This usually means your system is exploding,
if not, you should increase table-extension in your mdp file
or with user tables increase the table size
[CUDANodeA:20659] *** Process received signal ***
[CUDANodeA:20659] Signal: Segmentation fault (11)
[CUDANodeA:20659] Signal code: Address not mapped (1)
[CUDANodeA:20659] Failing at address: 0xc7aa00dc
[CUDANodeA:20659] [ 0] /lib64/libpthread.so.0(+0xf2d0) [0x2ab25c76d2d0]
[CUDANodeA:20659] [ 1] /opt/gromacs-4.6/lib/libmd_mpi.so.6(+0x11020f) 
[0x2ab259e0720f]
[CUDANodeA:20659] [ 2] /opt/gromacs-4.6/lib/libmd_mpi.so.6(+0x111c94) 
[0x2ab259e08c94]
[CUDANodeA:20659] [ 3] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(gmx_pme_do+0x1d2e) [0x2ab259e0cbae]
[CUDANodeA:20659] [ 4] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(do_force_lowlevel+0x1eef) 
[0x2ab259ddd62f]
[CUDANodeA:20659] [ 5] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(do_force_cutsGROUP+0x1495) 
[0x2ab259e72a45]

[CUDANodeA:20659] [ 6] mdrun_mpi(do_md+0x8133) [0x4334c3]
[CUDANodeA:20659] [ 7] mdrun_mpi(mdrunner+0x19e9) [0x411639]
[CUDANodeA:20659] [ 8] mdrun_mpi(main+0x17db) [0x4373db]
[CUDANodeA:20659] [ 9] /lib64/libc.so.6(__libc_start_main+0xfd) 
[0x2ab25c999bfd]

[CUDANodeA:20659] [10] mdrun_mpi() [0x407f09]
[CUDANodeA:20659] *** End of error message ***

[1]Segmentation faultmdrun_mpi -v -s nvt.tpr -c nvt.gro 
-g nvt.log -x nvt.xtc




here is the .mdp file I used:

title   = NVT equilibration for OR-POPC system
define  = -DPOSRES -DPOSRES_LIG ; Protein is position restrained 
(uses the posres.itp file information)

; Parameters describing the details of the NVT simulation protocol
integrator  = md; Algorithm (md = molecular dynamics 
[leap-frog integrator]; md-vv = md using velocity verlet; sd = 
stochastic dynamics)

dt  = 0.002 ; Time-step (ps)
nsteps  = 25; Number of steps to run (0.002 * 25 
= 500 ps)


; Parameters controlling output writing
nstxout = 0 ; Write coordinates to output .trr file 
every 2 ps
nstvout = 0 ; Write velocities to output .trr file 
every 2 ps

nstfout = 0

nstxtcout   = 1000
nstenergy   = 1000  ; Write energies to output .edr file 
every 2 ps

nstlog  = 1000  ; Write output to .log file every 2 ps

; Parameters describing neighbors searching and details about 
interaction calculations

ns_type = grid  ; Neighbor list search method (simple, grid)
nstlist = 50; Neighbor list update frequency (after 
every given number of steps)

rlist   = 1.2   ; Neighbor list search cut-off distance (nm)
rlistlong   = 1.4
rcoulomb= 1.2   ; Short-range Coulombic interactions 
cut-off distance (nm)
rvdw= 1.2   ; Short-range van der Waals cutoff 
distance (nm)
pbc = xyz   ; Direction in which to use Perodic 
Boundary Conditions (xyz, xy, no)

cutoff-scheme   =Verlet  ; GPU running

; Parameters for treating bonded interactions
continuation= no; Whether a fresh start or a 
continuation from a previous run (yes/no)

constraint_algorithm = LINCS; Constraint algorithm (LINCS / SHAKE)
constraints = all-bonds ; Which bonds/angles to constrain 
(all-bonds / hbonds / none / all-angles / h-angles)
lincs_iter  = 1 ; Number of iterations to correct for 
rotational lengthening in LINCS (related to accuracy)
lincs_order = 4 ; Highest order in the expansion of the 
constraint

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

Hi,

How about GPU emulation or CPU-only runs? Also, please try setting the
number of therads to 1 (-ntomp 1).


--
Szilárd



On Mon, Dec 17, 2012 at 6:01 PM, Albert mailmd2...@gmail.com wrote:

 hello:

  I reduced the GPU to two, and it said:

 Back Off! I just backed up nvt.log to ./#nvt.log.1#
 Reading file nvt.tpr, VERSION 4.6-dev-20121004-5d6c49d (single precision)

 NOTE: GPU(s) found, but the current simulation can not use GPUs
   To use a GPU, set the mdp option: cutoff-scheme = Verlet
   (for quick performance testing you can use the -testverlet option)

 Using 2 MPI processes

 4 GPUs detected on host CUDANodeA:
   #0: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible
   #1: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible
   #2: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible
   #3: NVIDIA GeForce GTX 590, compute cap.: 2.0, ECC:  no, stat: compatible

 Making 1D domain decomposition 2 x 1 x 1

 * WARNING * WARNING * WARNING * WARNING * WARNING * WARNING *
 We have just committed the new CPU detection code in this branch,
 and will commit new SSE/AVX kernels in a few days. However, this
 means that currently only the NxN kernels are accelerated!
 In the mean time, you might want to avoid production runs in 4.6.


 when I run it with single GPU, it produced lots of pdb file with prefix
 step, and then it crashed with messages:

 Wrote pdb files with previous and current coordinates
 Warning: 1-4 interaction between 4674 and 4706 at distance 434.986 which
 is larger than the 1-4 table size 2.200 nm
 These are ignored for the rest of the simulation
 This usually means your system is exploding,
 if not, you should increase table-extension in your mdp file
 or with user tables increase the table size
 [CUDANodeA:20659] *** Process received signal ***
 [CUDANodeA:20659] Signal: Segmentation fault (11)
 [CUDANodeA:20659] Signal code: Address not mapped (1)
 [CUDANodeA:20659] Failing at address: 0xc7aa00dc
 [CUDANodeA:20659] [ 0] /lib64/libpthread.so.0(+**0xf2d0) [0x2ab25c76d2d0]
 [CUDANodeA:20659] [ 1] /opt/gromacs-4.6/lib/libmd_**mpi.so.6(+0x11020f)
 [0x2ab259e0720f]
 [CUDANodeA:20659] [ 2] /opt/gromacs-4.6/lib/libmd_**mpi.so.6(+0x111c94)
 [0x2ab259e08c94]
 [CUDANodeA:20659] [ 3] 
 /opt/gromacs-4.6/lib/libmd_**mpi.so.6(gmx_pme_do+0x1d2e)
 [0x2ab259e0cbae]
 [CUDANodeA:20659] [ 4] /opt/gromacs-4.6/lib/libmd_**
 mpi.so.6(do_force_lowlevel+**0x1eef) [0x2ab259ddd62f]
 [CUDANodeA:20659] [ 5] /opt/gromacs-4.6/lib/libmd_**
 mpi.so.6(do_force_cutsGROUP+**0x1495) [0x2ab259e72a45]
 [CUDANodeA:20659] [ 6] mdrun_mpi(do_md+0x8133) [0x4334c3]
 [CUDANodeA:20659] [ 7] mdrun_mpi(mdrunner+0x19e9) [0x411639]
 [CUDANodeA:20659] [ 8] mdrun_mpi(main+0x17db) [0x4373db]
 [CUDANodeA:20659] [ 9] /lib64/libc.so.6(__libc_start_**main+0xfd)
 [0x2ab25c999bfd]
 [CUDANodeA:20659] [10] mdrun_mpi() [0x407f09]
 [CUDANodeA:20659] *** End of error message ***

 [1]Segmentation faultmdrun_mpi -v -s nvt.tpr -c nvt.gro -g
 nvt.log -x nvt.xtc



 here is the .mdp file I used:

 title   = NVT equilibration for OR-POPC system
 define  = -DPOSRES -DPOSRES_LIG ; Protein is position restrained
 (uses the posres.itp file information)
 ; Parameters describing the details of the NVT simulation protocol
 integrator  = md; Algorithm (md = molecular dynamics
 [leap-frog integrator]; md-vv = md using velocity verlet; sd = stochastic
 dynamics)
 dt  = 0.002 ; Time-step (ps)
 nsteps  = 25; Number of steps to run (0.002 * 25 =
 500 ps)

 ; Parameters controlling output writing
 nstxout = 0 ; Write coordinates to output .trr file
 every 2 ps
 nstvout = 0 ; Write velocities to output .trr file
 every 2 ps
 nstfout = 0

 nstxtcout   = 1000
 nstenergy   = 1000  ; Write energies to output .edr file every
 2 ps
 nstlog  = 1000  ; Write output to .log file every 2 ps

 ; Parameters describing neighbors searching and details about interaction
 calculations
 ns_type = grid  ; Neighbor list search method (simple,
 grid)
 nstlist = 50; Neighbor list update frequency (after
 every given number of steps)
 rlist   = 1.2   ; Neighbor list search cut-off distance
 (nm)
 rlistlong   = 1.4
 rcoulomb= 1.2   ; Short-range Coulombic interactions
 cut-off distance (nm)
 rvdw= 1.2   ; Short-range van der Waals cutoff
 distance (nm)
 pbc = xyz   ; Direction in which to use Perodic
 Boundary Conditions (xyz, xy, no)
 cutoff-scheme   =Verlet  ; GPU running

 ; Parameters for treating bonded interactions
 continuation= no; Whether a fresh start or a continuation
 from a previous run (yes/no)
 constraint_algorithm = LINCS; Constraint algorithm (LINCS / SHAKE)
 constraints = all-bonds ; Which bonds/angles

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

2012-12-17 Thread Albert


On 12/17/2012 06:08 PM, Szilárd Páll wrote:

Hi,

How about GPU emulation or CPU-only runs? Also, please try setting the
number of therads to 1 (-ntomp 1).


--
Szilárd



hello:

I am running in GPU emulation mode with the GMX_EMULATE_GPU=1 env. var
set (and to match closer the GPU setup with -ntomp 12), it failed with log:

Back Off! I just backed up step33b.pdb to ./#step33b.pdb.2#

Back Off! I just backed up step33c.pdb to ./#step33c.pdb.2#
Wrote pdb files with previous and current coordinates
[CUDANodeA:20753] *** Process received signal ***
[CUDANodeA:20753] Signal: Segmentation fault (11)
[CUDANodeA:20753] Signal code: Address not mapped (1)
[CUDANodeA:20753] Failing at address: 0x106ae6a00

[1]Segmentation faultmdrun_mpi -v -s nvt.tpr -c nvt.gro -g 
nvt.log -x nvt.xtc -ntomp 12




I also tried , number of therads to 1 (-ntomp 1), it failed with following 
messages:


Back Off! I just backed up step33c.pdb to ./#step33c.pdb.1#
Wrote pdb files with previous and current coordinates
[CUDANodeA:20740] *** Process received signal ***
[CUDANodeA:20740] Signal: Segmentation fault (11)
[CUDANodeA:20740] Signal code: Address not mapped (1)
[CUDANodeA:20740] Failing at address: 0x1f74a96ec
[CUDANodeA:20740] [ 0] /lib64/libpthread.so.0(+0xf2d0) [0x2b351d3022d0]
[CUDANodeA:20740] [ 1] /opt/gromacs-4.6/lib/libmd_mpi.so.6(+0x11020f) 
[0x2b351a99c20f]
[CUDANodeA:20740] [ 2] /opt/gromacs-4.6/lib/libmd_mpi.so.6(+0x111c94) 
[0x2b351a99dc94]
[CUDANodeA:20740] [ 3] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(gmx_pme_do+0x1d2e) [0x2b351a9a1bae]
[CUDANodeA:20740] [ 4] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(do_force_lowlevel+0x1eef) 
[0x2b351a97262f]
[CUDANodeA:20740] [ 5] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(do_force_cutsVERLET+0x1756) 
[0x2b351aa04736]
[CUDANodeA:20740] [ 6] 
/opt/gromacs-4.6/lib/libmd_mpi.so.6(do_force+0x3bf) [0x2b351aa0a0df]

[CUDANodeA:20740] [ 7] mdrun_mpi(do_md+0x8133) [0x4334c3]
[CUDANodeA:20740] [ 8] mdrun_mpi(mdrunner+0x19e9) [0x411639]
[CUDANodeA:20740] [ 9] mdrun_mpi(main+0x17db) [0x4373db]
[CUDANodeA:20740] [10] /lib64/libc.so.6(__libc_start_main+0xfd) 
[0x2b351d52ebfd]

[CUDANodeA:20740] [11] mdrun_mpi() [0x407f09]
[CUDANodeA:20740] *** End of error message ***

[1]Segmentation faultmdrun_mpi -v -s nvt.tpr -c nvt.gro 
-g nvt.log -x nvt.xtc -ntomp 1




--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.

* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

Hi Albert,

Thanks for the testing.

Last questions.
- What version are you using? Is it beta2 release or latest git? if it's
the former, getting the latest git might help if...
-  (do) you happen to be using GMX_GPU_ACCELERATION=None (you shouldn't!)?
A bug triggered only with this setting has been fixed recently.

If the above doesn't help, please file a bug report and attach a tpr so we
can reproduce.

Cheers,

--
Szilárd



On Mon, Dec 17, 2012 at 6:21 PM, Albert mailmd2...@gmail.com wrote:

 On 12/17/2012 06:08 PM, Szilárd Páll wrote:

 Hi,

 How about GPU emulation or CPU-only runs? Also, please try setting the
 number of therads to 1 (-ntomp 1).


 --
 Szilárd


 hello:

 I am running in GPU emulation mode with the GMX_EMULATE_GPU=1 env. var
 set (and to match closer the GPU setup with -ntomp 12), it failed with log:

 Back Off! I just backed up step33b.pdb to ./#step33b.pdb.2#

 Back Off! I just backed up step33c.pdb to ./#step33c.pdb.2#

 Wrote pdb files with previous and current coordinates
 [CUDANodeA:20753] *** Process received signal ***
 [CUDANodeA:20753] Signal: Segmentation fault (11)
 [CUDANodeA:20753] Signal code: Address not mapped (1)
 [CUDANodeA:20753] Failing at address: 0x106ae6a00

 [1]Segmentation faultmdrun_mpi -v -s nvt.tpr -c nvt.gro -g
 nvt.log -x nvt.xtc -ntomp 12




 I also tried , number of therads to 1 (-ntomp 1), it failed with following
 messages:


 Back Off! I just backed up step33c.pdb to ./#step33c.pdb.1#

 Wrote pdb files with previous and current coordinates
 [CUDANodeA:20740] *** Process received signal ***
 [CUDANodeA:20740] Signal: Segmentation fault (11)
 [CUDANodeA:20740] Signal code: Address not mapped (1)
 [CUDANodeA:20740] Failing at address: 0x1f74a96ec
 [CUDANodeA:20740] [ 0] /lib64/libpthread.so.0(+**0xf2d0) [0x2b351d3022d0]
 [CUDANodeA:20740] [ 1] /opt/gromacs-4.6/lib/libmd_**mpi.so.6(+0x11020f)
 [0x2b351a99c20f]
 [CUDANodeA:20740] [ 2] /opt/gromacs-4.6/lib/libmd_**mpi.so.6(+0x111c94)
 [0x2b351a99dc94]
 [CUDANodeA:20740] [ 3] 
 /opt/gromacs-4.6/lib/libmd_**mpi.so.6(gmx_pme_do+0x1d2e)
 [0x2b351a9a1bae]
 [CUDANodeA:20740] [ 4] /opt/gromacs-4.6/lib/libmd_**
 mpi.so.6(do_force_lowlevel+**0x1eef) [0x2b351a97262f]
 [CUDANodeA:20740] [ 5] /opt/gromacs-4.6/lib/libmd_**
 mpi.so.6(do_force_cutsVERLET+**0x1756) [0x2b351aa04736]
 [CUDANodeA:20740] [ 6] /opt/gromacs-4.6/lib/libmd_**mpi.so.6(do_force+0x3bf)
 [0x2b351aa0a0df]
 [CUDANodeA:20740] [ 7] mdrun_mpi(do_md+0x8133) [0x4334c3]
 [CUDANodeA:20740] [ 8] mdrun_mpi(mdrunner+0x19e9) [0x411639]
 [CUDANodeA:20740] [ 9] mdrun_mpi(main+0x17db) [0x4373db]
 [CUDANodeA:20740] [10] /lib64/libc.so.6(__libc_start_**main+0xfd)
 [0x2b351d52ebfd]
 [CUDANodeA:20740] [11] mdrun_mpi() [0x407f09]
 [CUDANodeA:20740] *** End of error message ***

 [1]Segmentation faultmdrun_mpi -v -s nvt.tpr -c nvt.gro -g
 nvt.log -x nvt.xtc -ntomp 1




 --
 gmx-users mailing listgmx-users@gromacs.org
 http://lists.gromacs.org/**mailman/listinfo/gmx-usershttp://lists.gromacs.org/mailman/listinfo/gmx-users
 * Please search the archive at http://www.gromacs.org/**
 Support/Mailing_Lists/Searchhttp://www.gromacs.org/Support/Mailing_Lists/Searchbefore
  posting!
 * Please don't post (un)subscribe requests to the list. Use the www
 interface or send it to gmx-users-requ...@gromacs.org.
 * Can't post? Read 
 http://www.gromacs.org/**Support/Mailing_Listshttp://www.gromacs.org/Support/Mailing_Lists

--
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU running problem with GMX-4.6 beta2

2012-12-17 Thread Mark Abraham

On Mon, Dec 17, 2012 at 6:01 PM, Albert mailmd2...@gmail.com wrote:

 hello:

  I reduced the GPU to two, and it said:

 Back Off! I just backed up nvt.log to ./#nvt.log.1#
 Reading file nvt.tpr, VERSION 4.6-dev-20121004-5d6c49d (single precision)


This is a development version from October 1. Please use the mdrun version
you think you're using :-)

Mark
-- 
gmx-users mailing listgmx-users@gromacs.org
http://lists.gromacs.org/mailman/listinfo/gmx-users
* Please search the archive at 
http://www.gromacs.org/Support/Mailing_Lists/Search before posting!
* Please don't post (un)subscribe requests to the list. Use the 
www interface or send it to gmx-users-requ...@gromacs.org.
* Can't post? Read http://www.gromacs.org/Support/Mailing_Lists

Re: [gmx-users] GPU running problem with GMX-4.6 beta2