Dear Professor Giannozzi,

Thank you so much for your answer. You are right, I did not really think
about the parallelization for the initial SCF calculation, I was more
puzzled by the pp.x calculation. If I understand you correctly, using
something like "pp.x -nk 16" might also help speed up the LDOS calculation
as well? But there also might be an upper limit of how fast this performs
and it could be that I simply have to wait a while to obtain the LDOS? As
in, except for my suboptimal parallelization strategy, the long runtime of
pp.x I observed was not due to wrong usage but simply to computational load
of the problem?

Kind regards
Lenz Fiedler

PhD Student (HZDR / CASUS)


Am Di., 16. März 2021 um 16:36 Uhr schrieb Paolo Giannozzi <
p.gianno...@gmail.com>:

> Your supercell is small but you have a high cutoff and a large number of
> k-points, which makes the overall computational load quite heavy. Moreover
> your choice of parallelization is less than optimal: you should use both
> k-point and plane-wave parallelization, something like "mpirun -np 160 pw.x
> -nk 16" or so. For reference: I ran your Fe.pw.scf.in input in 20' on a
> small 36-core machine with two (powerful) GPU's. The nscf calculation took
> 1h20' for the calculation and is taking as much time if not more to write
> the files. Also note that not all postprocessing calculations are optimized
> for large runs.
>
> Paolo
>
> On Mon, Mar 15, 2021 at 7:23 PM Lenz Fiedler <fiedler.l...@gmail.com>
> wrote:
>
>> (resent because something went wrong with the header line)
>>
>> Hi users,
>>
>> I am experiencing a problem with pp.x. I want to calculate the LDOS for a
>> 16 Fe atom supercell using a 560 value energy grid and the 3D grid given by
>> the DFT calculation (100x100x100 grid points). I have performed the DFT
>> calculation by first doing a SCF calculation (8x8x8 k-points) and then a
>> non-SCF calculation (14x14x14 k-points) successfully. Now I am trying to
>> do:
>>
>> mpirun -np 160 pp.x -in Fe.pp.ldos.in
>>
>> but the calculation takes way longer than I anticipated. The entire DFT
>> calculation took less than a day, while for the LDOS, after about 12 hours,
>> only 20 tmp files had been written. Am I doing something wrong? Or is this
>> expected? At this rate, calculating the LDOS would take days, which is why
>> I am assuming I am doing something wrong.
>> Please find my output and input files here:
>>
>>
>> https://drive.google.com/drive/folders/1R-m5jlw1bcNxe3nBXjc4dms-IoMMp4Ir?usp=sharing
>> (the pp.out file is from a run where I investigated if a lower number of
>> CPUs helps, but that was slower, as expected.)
>>
>> Kind regards
>> Lenz
>>
>> PhD Student (HZDR / CASUS)
>> _______________________________________________
>> Quantum ESPRESSO is supported by MaX (www.max-centre.eu)
>> users mailing list users@lists.quantum-espresso.org
>> https://lists.quantum-espresso.org/mailman/listinfo/users
>
>
>
> --
> Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
> Univ. Udine, via delle Scienze 206, 33100 Udine, Italy
> Phone +39-0432-558216, fax +39-0432-558222
>
> _______________________________________________
> Quantum ESPRESSO is supported by MaX (www.max-centre.eu)
> users mailing list users@lists.quantum-espresso.org
> https://lists.quantum-espresso.org/mailman/listinfo/users
_______________________________________________
Quantum ESPRESSO is supported by MaX (www.max-centre.eu)
users mailing list users@lists.quantum-espresso.org
https://lists.quantum-espresso.org/mailman/listinfo/users

Reply via email to