Hi all,

I have an issue doing parallel runs where the simulation would just hang at seemingly random intervals anywhere from an hour to a day. There are no error messages reported in the logs and nothing funny from dmesg.

My set up is two dual-core Pentium D. I run with -np 4 to take advantage of all cores.

When I issue a top command when the run is frozen, I notice that mdrun_mpi is at 0% CPU usage and sometimes sleeping on the "slave" node. On the "master" node, CPU usage is close to max. There is no network activity according to the blinking lights on my switch as well.

When I do a run with -np 2 where one process is run on each computer, the run seems to carry on stably.

I am using:
Gromacs 3.3.1
Lam 7.1.2
Parallel Knoppix Kernel version 2.6.12

Gromacs was compiled from source.

I understand from searching the archives that a few people have had a problem similar to mine, but I could not fine a straight answer.

Thanks!
Jason


_______________________________________________
gmx-users mailing list    gmx-users@gromacs.org
http://www.gromacs.org/mailman/listinfo/gmx-users
Please don't post (un)subscribe requests to the list. Use the www interface or send it to [EMAIL PROTECTED]
Can't post? Read http://www.gromacs.org/mailing_lists/users.php

Reply via email to