Re: [Wien] lapwso_mpi error

Gavin Abo Thu, 17 Nov 2016 05:08:04 -0800

So you are using the ifort version with the unformatted file read bug.Based on the Intel page at the link in the previous post below, did youtry recompiling lapwso_mpi with -O0 or revert to one of the versions ofifort that Intel mentioned to see if it fixed the problem or not?


On 11/14/2016 8:34 AM, Md. Fhokrul Islam wrote:

Hi Gavin,
Thanks for your suggestion. Yes, I am using 16.0.3.210 version ofifort. Debugging such a
big file with 'od' seems to be difficult but I will try witha smaller system and see if I get the
same error.



Fhokrul



------------------------------------------------------------------------
*From:* Wien <wien-boun...@zeus.theochem.tuwien.ac.at> on behalf ofGavin Abo <gs...@crimson.ua.edu>
*Sent:* Sunday, November 13, 2016 11:40 PM
*To:* A Mailing list for WIEN2k users
*Subject:* Re: [Wien] lapwso_mpi error
Ok, I agree that it is likely not due to the set up of the scratchdirectory.
What version of ifort was used? If you happened to use 16.0.3.210,maybe it is caused by an ifort bug [https://software.intel.com/en-us/articles/read-failure-unformatted-file-io-psxe-16-update-3].
Perhaps you can use the linux "od" command to try to troubleshot andidentify what the data mismatch is between the writing and reading ofthe 3Mn.vectordn_1 file, similar to what is described on the web pages at:
https://software.intel.com/en-us/forums/intel-fortran-compiler-for-linux-and-mac-os-x/topic/269993

https://software.intel.com/en-us/forums/intel-fortran-compiler-for-linux-and-mac-os-x/topic/270436

https://software.intel.com/en-us/forums/intel-fortran-compiler-for-linux-and-mac-os-x/topic/268503
Though, it might be harder to diagnose with the large 3Mn.vectordn_1,which looks to be about 12 GB. So you may want to create a mpi SOcalculation that creates a smaller case.vectordn_1 for that.
On 11/13/2016 7:30 AM, Md. Fhokrul Islam wrote:
Hi Gavin,
In my .bashrc scratch is defined as $SCRATCH = ./ so if I use thecommand
echo $SCRATCH, it always returns ./
For large jobs, I use local temporary directory that is associatedwith each node
in our system and is given by $SNIC_TMP. This temporary directory iscreated
on fly, so I set $SCRATCH = $SNIC_TMP in my job submission script. AsI said
this set up works fine if I do MPI calculations without spin-orbitand I get converged
results. But if I submit the job after initializing with spin-orbit,it crashes at lapwso.
SO I think problem is probably not due to the set up with scratchdirectory, it is
something to do with MPI version of LAPWSO.



Thanks for your comment.


Fhokrul

_______________________________________________
Wien mailing list
Wien@zeus.theochem.tuwien.ac.at
http://zeus.theochem.tuwien.ac.at/mailman/listinfo/wien
SEARCH the MAILING-LIST at:  
http://www.mail-archive.com/wien@zeus.theochem.tuwien.ac.at/index.html

Re: [Wien] lapwso_mpi error

Reply via email to