Try to remove the "lapw0" string from the .machines file, so it reads:
1:nx1 1:nx1 1:nx62 1:nx62 granularity:1 extrafine:1 If it will not work, also try running lapw0 in serial mode : lapw0:nx1 1:nx1 1:nx1 1:nx62 1:nx62 granularity:1 extrafine:1 also, take a look at the scripts which generate the proper .machines file: http://www.wien2k.at/reg_user/faq/pbs.html regards, Yurko On 23 February 2010 06:24, Zhiyong Zhang <zyzhang at stanford.edu> wrote: > OK. Here are some more clues about the problem: > > forrtl: severe (64): input conversion error, unit 19, file > /home/zzhang/wien2k-runs/lapw/TiC/TiC.vns > Image ? ? ? ? ? ? ?PC ? ? ? ? ? ? ? ?Routine ? ? ? ? ? ?Line ? ? ? ?Source > lapw1 ? ? ? ? ? ? ?00000000004E6F1E ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?00000000004E611A ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?000000000049FB76 ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?000000000046D75A ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?000000000046CD76 ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?0000000000486885 ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?00000000004540F8 ?rdswar_ ? ? ? ? ? ? ? ? ? ?29 > ?rdswar_tmp_.F > lapw1 ? ? ? ? ? ? ?0000000000435FD3 ?inilpw_ ? ? ? ? ? ? ? ? ? 393 ?inilpw.f > lapw1 ? ? ? ? ? ? ?0000000000438224 ?MAIN__ ? ? ? ? ? ? ? ? ? ? 41 > ?lapw1_tmp_.F > lapw1 ? ? ? ? ? ? ?0000000000404422 ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > libc.so.6 ? ? ? ? ?0000003E1251C40B ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > lapw1 ? ? ? ? ? ? ?000000000040436A ?Unknown ? ? ? ? ? ? ? Unknown ?Unknown > > I checked the TiC.vns in the parallel calculation and found the following > (Please note the NaN entries): > > ? ? TOTAL POTENTIAL IN INTERSTITIAL > > ? ? ? ? ? ? ? ?136 NUMBER OF PW > ? ? ? 0 ? ?0 ? ?0 NaN ? ? ? ? ? ? ? ?0.000000000000E+00 > ? ? ?-1 ? -1 ? -1 0.966480192428E-08 0.000000000000E+00 > ? ? ? 0 ? ?0 ? -2 0.237305964226E-06 0.000000000000E+00 > ? ? ? 0 ? -2 ? -2 0.383070560427E-08 0.000000000000E+00 > ? ? ?-1 ? -1 ? -3-0.108089242452E-08 0.000000000000E+00 > > However, in the TiC.vns from the serial run, which seem to have worked fine, > I found the following: > > ? ? TOTAL POTENTIAL IN INTERSTITIAL > > ? ? ? ? ? ? ? ?136 NUMBER OF PW > ? ? ? 0 ? ?0 ? ?0-0.227173083856E-01 0.000000000000E+00 > ? ? ?-1 ? -1 ? -1 0.114592956480E-02 0.000000000000E+00 > ? ? ? 0 ? ?0 ? -2-0.115420958078E-01 0.000000000000E+00 > ? ? ? 0 ? -2 ? -2 0.184312999415E-01 0.000000000000E+00 > ? ? ?-1 ? -1 ? -3-0.137802961139E-03 0.000000000000E+00 > ? ? ?-2 ? -2 ? -2-0.285539143809E-02 0.000000000000E+00 > > Does anybody have any clue about the problem? > > Thanks again, > > Zhiyong > > > ----- Original Message ----- > From: "Zhiyong Zhang" <zyzhang at stanford.edu> > To: "A Mailing list for WIEN2k users" <wien at zeus.theochem.tuwien.ac.at> > Sent: Monday, February 22, 2010 9:15:44 PM GMT -08:00 US/Canada Pacific > Subject: Re: [Wien] parallel wien2k > > Hello Ricardo and All, > > Thank you for the information. I think you are right that part of the problem > is because no forces printed. The example I am using is the TiC in the user > guide. when I used "run_lapw -i 40 0.001 -I" in serial mode it worked fine. > > The problem "/home/zzhang/wien2k/lapw1para lapw1.def" seems to be due to the > .machines file definition. If I remove the "lapw1:nx1 ?nx1 ?nx62 ?nx62" from > the .machines file ans use the following .machines file, > > lapw0:nx1 ?nx1 ?nx62 ?nx62 > 1:nx1 > 1:nx1 > 1:nx62 > 1:nx62 > granularity:1 > extrafine:1 > > Then the LAPW1 can run in parallel. > > Does this mean that lapw1/2 can only be run in k-point parallel mode, not > fine grain MPI mode? > > How ever, I still got the following error in TiC.dayfile: > > 4 number_of_parallel_jobs > ? ? nx1(11) 0.226u 0.017s 0.31 76.18% ? ? ?0+0k 0+0io 0pf+0w > ? ? nx1(11) 0.224u 0.009s 0.31 73.04% ? ? ?0+0k 0+0io 0pf+0w > ? ? nx62(11) 0.222u 0.008s 0.32 71.21% ? ? ?0+0k 0+0io 0pf+0w > ? ? nx62(11) 0.222u 0.010s 0.26 88.21% ? ? ?0+0k 0+0io 0pf+0w > ? ? nx1(1) 0.224u 0.008s 0.26 88.89% ? ? ?0+0k 0+0io 0pf+0w > ? ? nx1(1) 0.223u 0.008s 0.26 88.17% ? ? ?0+0k 0+0io 0pf+0w > ? ? nx62(1) 0.222u 0.009s 0.26 86.19% ? ? ?0+0k 0+0io 0pf+0w > ** ?LAPW1 crashed! > 0.062u 0.436s 0:11.45 4.2% ? ? ?0+0k 0+0io 0pf+0w > error: command ? /home/zzhang/wien2k/lapw1para lapw1.def ? failed > > Which files should I read to find possible causes of the crash? I looked the > *.error files but can't seem to find anything useful. > > Best, > Zhiyong > > > > ----- Original Message ----- > From: "Ricardo Faccio" <rfaccio at fq.edu.uy> > To: "A Mailing list for WIEN2k users" <wien at zeus.theochem.tuwien.ac.at> > Sent: Monday, February 22, 2010 8:28:35 PM GMT -08:00 US/Canada Pacific > Subject: Re: [Wien] parallel wien2k > > Hi Zhiyong > What is your test case? remember that forces are printed if you have atoms > located in general positions. For example, Fe in the bcc space group, will > not print forces, since all atoms have the same symmetric environment. > Regards > Ricardo > > -- > ?------------------------------------------------------------------------- > ----- ? Dr. Ricardo Faccio > > ?Mail: Cryssmat-Lab., C?tedra de F?sica, DETEMA > ?Facultad de Qu?mica, Universidad de la Rep?blica > ? ? ? Av. Gral. Flores 2124, C.C. 1157 > ? ? ? C.P. 11800, Montevideo, Uruguay. > ?E-mail: rfaccio at fq.edu.uy > ?Phone: 598 2 9241860 Int. 109 > ? ? ? ? ? ? 598 2 9290705 > ?Fax: ? ?598 2 9241906 > ?Web: ?http://cryssmat.fq.edu.uy/ricardo/ricardo.htm > >> Dear All, >> >> >> >> I am trying to test wien2k in parallel mode and I got into some problem. I >> am using >> >> >> >> run_lapw -p -i 40 -fc 0.001 -I >> >> >> >> If I use a number of 0.001 for the option fc above, I got the following >> error: >> >> >> >> Force-convergence not possible. Forces not present. >> >> >> >> If I do not use a number for the -fc option, but use "run_lapw -p -i 40 >> -fc >> -I" instead >> >> >> >> Then lapw0 finishes without a problem but the program doesn't branch to >> lapw1. An error message is generated when doing the test >> >> >> >> "if ($fcut == "0") goto lapw1 >> >> >> >> I was able to do "run_lapw -p -i 40 -I", without the "-fc" option at all >> and >> was able to finish "lapw0 -p" and then start "lapw1 -p" but got into the >> following error: >> >> >> >> error: command ? /home/zzhang/wien2k/lapw1para lapw1.def ? failed >> >> >> >> Does anybody have similar problems and know how to fix this? >> >> >> >> It does the following: >> >> >> >> running LAPW1 in parallel mode (using .machines) >> >> >> >> and the .machines file is as follows: >> >> >> >> # >> >> lapw0:nx1 ?nx1 ?nx62 ?nx62 >> >> lapw1:nx1 ?nx1 ?nx62 ?nx62 >> >> lapw2:nx1 ?nx1 ?nx62 ?nx62 >> >> 1:nx1 >> >> 1:nx1 >> >> 1:nx62 >> >> 1:nx62 >> >> granularity:1 >> >> extrafine:1 >> >> >> >> Thanks, >> >> Zhiyong >> >> >> >> _______________________________________________ >> Wien mailing list >> Wien at zeus.theochem.tuwien.ac.at >> http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien >> > > > _______________________________________________ > Wien mailing list > Wien at zeus.theochem.tuwien.ac.at > http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien > _______________________________________________ > Wien mailing list > Wien at zeus.theochem.tuwien.ac.at > http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien > _______________________________________________ > Wien mailing list > Wien at zeus.theochem.tuwien.ac.at > http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien > -- Yurko (aka Yuriy, Iurii, Jurij etc) Natanzon PhD student Department for Structural Research (NZ31) Henryk Niewodnicza?ski Institute of Nuclear Physics Polish Academy of Sciences ul. Radzikowskiego 152, 31-342 Krakow, Poland E-mail: Yurii.Natanzon at ifj.edu.pl, yurko.natanzon at gmail.com