Re: [Pw_forum] GGA+U

2017-02-13 Thread Giuseppe Mattioli

Dear F. Marsusi

> Is this what one expected always from +U calculations,

As I told you before nothing can be considered as "expected" when you apply the 
U correction to a strongly hybridized sp2 system such as a pi-
conjugated molecule or graphene. In my experience this is not a good idea, 
because you obtain, for example, a higher ionization energy (and this is 
right when you correct LDA/GGA delocalization errors, and in agreement with a 
correction with exact exchange like B3LYP or HSE) but also a lower 
electron affinity (and this is wrong, and the opposite behavior of a correction 
with exact exchange). So you are walking in "terra incognita" where 
nothing can be given for granted.
HTH
Giuseppe

On Friday, February 10, 2017 08:38:37 PM FARAH MARSUSI wrote:
> Dear Giuseppe,
> 
> Many thanks for your quick response. As you have correctly guessed, the 
> system is an organic material (fluorinated graphene). From U=0 to 3.4, M is
> increasing gradually up to the expected value, M remains constant till U=3.6 
> eV, then by increasing U, again M is reduced very soon to zero. Is
> this what one expected always from +U calculations, when goes up further from 
> the optimum U value? From your answer, it is concluded to be a
> strange behavior due to the organic nature of this material.
> 
> Best regards,
> F. Marsusi,
> 
> Amirkabir University of Technology.​
> 
> On Fri, 02/10/2017 08:09 PM, Giuseppe Mattioli  
> wrote:
> > Dear F. Marsusi
> > First of all you are not providing any detail of your system, so we cannot 
> > even guess what is "natural" for it.
> > However, you are using the Hubbard U correction in a semiempirical way, and 
> > there is therefore no way to choose the U value but that reproducing
> > some measured parameter, and not very much to do if unexpected or strange 
> > results appear at a given (unphysical?, problematic?) U value.
> > In my experience the application of U to organic molecules has been always 
> > problematic due to the strong hybridization between C 2s and C 2p
> > orbitals, and to the application of the correction to 2p orbitals only.
> > HTH
> > Giuseppe
> > 
> > On Friday, February 10, 2017 07:25:10 PM FARAH MARSUSI wrote:
> > > Dear all,
> > > 
> > > By GGA+U as implemented in QE, the correct magnetization (M) and band gap 
> > > was obtained. The correct U value for each atom was obtained by
> > > intensive
> > > step by step runs to reach gradually the experimental M value, and 
> > > therefore band gap. All results are OK till now (the U value itself also= 
> > > 3.4
> > > eV
> > > for carbon and fluorine is acceptable ). However, by enlarging the 
> > > obtained U value a bit (to 3.7 eV), the predicted M come back to that of 
> > > pure
> > > GGA. Is it natural, or a problem exist?
> > > 
> > > 
> > > Best regards,
> > > F. Marsusi,
> > 
> > 
> > - Article premier - Les hommes naissent et demeurent
> > libres et égaux en droits. Les distinctions sociales
> > ne peuvent être fondées que sur l'utilité commune
> > - Article 2 - Le but de toute association politique
> > est la conservation des droits naturels et
> > imprescriptibles de l'homme. Ces droits sont la liberté,
> > la propriété, la sûreté et la résistance à l'oppression.
> > 
> > 
> >Giuseppe Mattioli
> >CNR - ISTITUTO DI STRUTTURA DELLA MATERIA
> >v. Salaria Km 29,300 - C.P. 10
> >I 00015 - Monterotondo Stazione (RM), Italy
> >Tel + 39 06 90672342 - Fax +39 06 90672316
> >E-mail: " target="_blank">
> >http://www.ism.cnr.it/en/staff/giuseppe-mattioli/
> >ResearcherID: F-6308-2012
> > 
> > ___
> > Pw_forum mailing list
> > Pw_forum@pwscf.org
> > http://pwscf.org/mailman/listinfo/pw_forum
> > 
> > --
> > This email was Anti Virus checked by Astaro Security Gateway. 
> > http://www.sophos.com


- Article premier - Les hommes naissent et demeurent
libres et égaux en droits. Les distinctions sociales
ne peuvent être fondées que sur l'utilité commune
- Article 2 - Le but de toute association politique
est la conservation des droits naturels et 
imprescriptibles de l'homme. Ces droits sont la liberté,
la propriété, la sûreté et la résistance à l'oppression.


   Giuseppe Mattioli
   CNR - ISTITUTO DI STRUTTURA DELLA MATERIA   
   v. Salaria Km 29,300 - C.P. 10
   I 00015 - Monterotondo Stazione (RM), Italy
   Tel + 39 06 90672342 - Fax +39 06 90672316
   E-mail: 
   http://www.ism.cnr.it/en/staff/giuseppe-mattioli/
   ResearcherID: F-6308-2012

___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum


Re: [Pw_forum] PW.x homogeneous electric field berry phase calculation in trigonal cell

2017-02-13 Thread Louis Fry-Bouriaux
Hi Lorenzo,


I have got it to work using both automatic and manual k-points, the mistake 
was to specify both 'berry=.true' and 'lelfield=.true.' at the same time and 
using 'nscf' calculation. The process I was going by was the description in the 
header of bp_c_phase.f90, when as you alluded to the electric field code is 
very similar but separate so the latter was the correct way to go with 'scf' 
calculation.


For 'berry=.true.', one needs to run a normal 'scf' with 'lelfield=.false.' to 
get the ground state first, then the polarisation can be calculated (P=0 which 
is a good sign for my system) as described in bp_c_phase.f90. I will now test 
other gdir than 3.


For 'lelfield=.true.', one can just specify a value of the field, both auto and 
manual work (again with gdir=3 so far). Afterwards one can apparently do a 
'nscf' run with 'berry=.true.' (and 'lelfield=.false.') which reports the 
polarisation in the same format as the 0 field calculation above which is now 
non-zero as expected.


Example 10 was helpful too, if anyone reads this that should be a good point of 
reference.


Finally when you were talking about the bottleneck, I suppose you were talking 
about the efield code, my impression so far is this is not a problem using 4 
processors, I will also test using 8 and compare the time taken. I have no idea 
how fast it 'should' be with proper parallisation, assuming it is possible to 
parallelise.


Kindest regards,

Louis


From: pw_forum-boun...@pwscf.org  on behalf of 
Louis Fry-Bouriaux 
Sent: 09 February 2017 11:38:01
To: PWSCF Forum
Subject: Re: [Pw_forum] PW.x homogeneous electric field berry phase calculation 
in trigonal cell


Hi Lorenzo,


   Thanks for clarifying!


   Yes that was quite silly of me to have nosym=.false., but yes it has no 
effect I still get the 'wrong g' error.


   I will now manually specify the k-points and relay what happens later..


   I am very interested in solving the bottleneck issue as I wish to perform a 
much more intensive calculation using a large amorphous cell, and later move on 
to using ESM, so I am debugging as best as I can to first understand the code 
and what exactly causes the bottleneck. Are you able to share any additional 
technical information on this issue?


Thank you so much for you help and assistance,

Louis


From: pw_forum-boun...@pwscf.org  on behalf of 
Lorenzo Paulatto 
Sent: 08 February 2017 19:57:19
To: PWSCF Forum
Subject: Re: [Pw_forum] PW.x homogeneous electric field berry phase calculation 
in trigonal cell

On Wednesday, February 8, 2017 5:46:25 PM CET Louis Fry-Bouriaux wrote:
> In your last email you mentioned the field 'pdir', I have not been able
> to find this in the documentation, is this what you meant?

It is the internal name of a variable in the subroutine that computes the
macroscopic polarisation, but I think it is always gdir in the end.

> as this is where the error is when I use other than gdir=3 (I added ln 364
> to identify what fails exactly)

I never did this myself, but from the code expects strings of k-points, one
after each other. Each string is formed by nppstr k-points, aligned in the
direction gdir; there can be as many string as you want, in principle to form
a grid in the two directions orthogonal to gdir.

Maybe in the case gdir=3 these string are formed "spontaneously" when using k-
points automatic, but it does not work in the other cases. The polarisation
(berry=.true.) and the electric field (lefield=.true.) codes are very similar,
but not identical, in theory the second is more general, I'm not too familiar
with either.

I've noticed that you set nosym=.false., I'm quite sure that that swith is
overridden with lefield, but to be sure, could you try to sisable it?




___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum
___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum

[Pw_forum] function "dgamma"

2017-02-13 Thread Mahmoud Payami Shabestari
Dear QE developers,

Kindly, in the subroutine "vnonloc" of program "hgh2qe.f90", the function 
"dgamma" is used but is not already defined. So the compilation is failed. 
Should I borrow some single-argument gamma function from somewhere to 
succeed?

Best regards,
Mahmoud Payami, AEOI___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum

Re: [Pw_forum] PW.x homogeneous electric field berry phase calculation in trigonal cell

2017-02-13 Thread Lorenzo Paulatto
On Monday, February 13, 2017 11:43:08 AM CET Louis Fry-Bouriaux wrote:
> Finally when you were talking about the bottleneck, I suppose you were
> talking about the efield code, my impression so far is this is not a
> problem using 4 processors, I will also test using 8 and compare the time
> taken. I have no idea how fast it 'should' be with proper parallisation,
> assuming it is possible to parallelise.

When you increase the number of CPUs, you would expect the time to decreased 
linearly, if over a certain number of CPUs it stops decreasing or if it 
decreases slower than linear, it is a bottleneck. This will always happen 
eventually, but with berry/lefield it happens much sooner.

Thank you for reporting back! I hope this information will be useful to future 
users

-- 
Dr. Lorenzo Paulatto 
IdR @ IMPMC -- CNRS & Université Paris 6
phone: +33 (0)1 442 79822 / skype: paulatz
www:   http://www-int.impmc.upmc.fr/~paulatto/
mail:  23-24/423 Boîte courrier 115, 4 place Jussieu 75252 Paris Cédex 05

___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum


Re: [Pw_forum] function "dgamma"

2017-02-13 Thread Paolo Giannozzi
Hi Mahmoud, "dgamma" is an intrinsic fortran mathematical function
that is not present on all compilers. Try with gfortran as in the
README file: the compilation of "hgh2qe.f90" is independent from the
rest of QE

Paolo

On Mon, Feb 13, 2017 at 1:13 PM, Mahmoud Payami Shabestari
 wrote:
>
> Dear QE developers,
>
> Kindly, in the subroutine "vnonloc" of program "hgh2qe.f90", the function
> "dgamma" is used but is not already defined. So the compilation is failed.
> Should I borrow some single-argument gamma function from somewhere to
> succeed?
>
> Best regards,
> Mahmoud Payami, AEOI
>
> ___
> Pw_forum mailing list
> Pw_forum@pwscf.org
> http://pwscf.org/mailman/listinfo/pw_forum



-- 
Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
Univ. Udine, via delle Scienze 208, 33100 Udine, Italy
Phone +39-0432-558216, fax +39-0432-558222
___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum


Re: [Pw_forum] function "dgamma"

2017-02-13 Thread Mahmoud Payami Shabestari
Dear Paolo,

Thank you very much for your reply.
I had also checked by using "gfortran" and it failed. Maybe because my 
gfortran it too old (version 4.1.2).
Ok, I will make it.

Thank you again,
Best regards
Mahmoud
-Original Message-
From: Paolo Giannozzi 
To: PWSCF Forum 
Date: Mon, 13 Feb 2017 14:28:58 +0100
Subject: Re: [Pw_forum] function "dgamma"


Hi Mahmoud, "dgamma" is an intrinsic fortran mathematical function
that is not present on all compilers. Try with gfortran as in the
README file: the compilation of "hgh2qe.f90" is independent from the
rest of QE

Paolo

On Mon, Feb 13, 2017 at 1:13 PM, Mahmoud Payami Shabestari
mailto:mpayami%40aeoi.org.ir]> wrote:
>
> Dear QE developers,
>
> Kindly, in the subroutine "vnonloc" of program "hgh2qe.f90", the function
> "dgamma" is used but is not already defined. So the compilation is failed.
> Should I borrow some single-argument gamma function from somewhere to
> succeed?
>
> Best regards,
> Mahmoud Payami, AEOI
>
> ___
> Pw_forum mailing list
> Pw_forum@pwscf.org [mailto:Pw_forum%40pwscf.org]
> http://pwscf.org/mailman/listinfo/pw_forum 
[http://pwscf.org/mailman/listinfo/pw_forum]



--
Paolo Giannozzi, Dip. Scienze Matematiche Informatiche e Fisiche,
Univ. Udine, via delle Scienze 208, 33100 Udine, Italy
Phone +39-0432-558216, fax +39-0432-558222
___
Pw_forum mailing list
Pw_forum@pwscf.org [mailto:Pw_forum%40pwscf.org]
http://pwscf.org/mailman/listinfo/pw_forum 
[http://pwscf.org/mailman/listinfo/pw_forum]___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum

[Pw_forum] How to understand Local DOS of a certain atom in qe example

2017-02-13 Thread 朱林光
Dear all qe users:
   I  have  a problem of understanding Local DOS of a certain atom in qe 
example,especially the term irmin(*,*) and irmax in the following input file,I 
don't understand why and how we can do the LDOS calculations of a certain atom 
only by defining the value of irmin and irmax?especially how can we use the 
value of irmin to specify the LDOS we are going to calculate is above the 
surface Al not above the surface As? for example  irmin(1,1)= 0, irmax(1,1)= 2, 
irmin(2,1)= 0, irmax(2,1)= 2, irmin(3,1)=63, irmax(3,1)=65,
 &projwfc
prefix  = 'AlAs110'
outdir='$TMP_DIR/',
ngauss=0
degauss=0.01
DeltaE=0.02
tdosinboxes=.true.
plotboxes=.true.
n_proj_boxes=8

!! Boxes centered on the first vacuum layer:
  !! 1) above the surface Al
irmin(1,1)= 0, irmax(1,1)= 2, irmin(2,1)= 0, irmax(2,1)= 2, irmin(3,1)=63, 
irmax(3,1)=65,
  !! 2) above the surface As
irmin(1,2)= 9, irmax(1,2)=11, irmin(2,2)= 5, irmax(2,2)= 7, irmin(3,2)=63, 
irmax(3,2)=65,
  !! 3) above the 2nd layer Al
irmin(1,3)= 9, irmax(1,3)=11, irmin(2,3)=14, irmax(2,3)=16, irmin(3,3)=63, 
irmax(3,3)=65,
  !! 4) as large as the surface unit cell
irmin(1,4)= 1, irmax(1,4)=18, irmin(2,4)= 1, irmax(2,4)=27, irmin(3,4)=63, 
irmax(3,4)=65,

!! Same as above, centered on the second vacuum layer:
irmin(1,5)= 0, irmax(1,5)= 2, irmin(2,5)= 0, irmax(2,5)= 2, irmin(3,5)=72, 
irmax(3,5)=74,
irmin(1,6)= 9, irmax(1,6)=11, irmin(2,6)= 5, irmax(2,6)= 7, irmin(3,6)=72, 
irmax(3,6)=74,
irmin(1,7)= 9, irmax(1,7)=11, irmin(2,7)=14, irmax(2,7)=16, irmin(3,7)=72, 
irmax(3,7)=74,
irmin(1,8)= 1, irmax(1,8)=18, irmin(2,8)= 1, irmax(2,8)=27, irmin(3,8)=72, 
irmax(3,8)=74,
 /
___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum


[Pw_forum] Fw: Re: Re: optimization problems of vanadium oxide

2017-02-13 Thread WANG YUANQING
Dear all,

Hi. Still I have not fixed the relaxation problem. But I have some new 
findings. The output of the vc-relax calculation shows that the force in y 
position is totally zero (in each step). As can be seen from the input file, I 
did not set this constraint on the relaxation. Does this result mean something 
wrong in input? It is really appreciated if someone can answer this. Thank you 
in advance.

Best,

Yuanqing Wang


--- Original Message 
Subject: Re: Re: [Pw_forum] optimization problems of vanadium oxide
Date: Tue, 17 Jan 2017 18:33:24 +0900
From: WANG YUANQING 
To: PWSCF Forum 
Cc: 

Dear Manu,

Hi. Sorry for the late reply. I tried your settings (cov_thr and bfgs), 
however, it does not help. The force is still very large during relaxation. I 
found two oxygens tend to approach potassium very closely during relaxation, 
which is not reasonable.

Besides, I want to move the atoms in all directions. Is there anyone who can 
suggest me some alternative methods? Thank you in advance!

Best,

Yuanqing

 
Date: Thu, 5 Jan 2017 12:42:07 -0500
From: Manu Hegde 
Subject: Re: [Pw_forum] optimization problems of vanadium oxide
To: PWSCF Forum 
Message-ID:

Content-Type: text/plain; charset="utf-8"

I am not expert in this but i can suggest few things try conv_thr=10^-8
and use ion/cell_dynamics=bfgs. Also which direction you want to move the
atoms?. I mean cell_dofree?. all the xyz direction?
Hope it helps you.
Manu


On Thu, Jan 5, 2017 at 12:35 PM,   
wrote:

> Dear QE users,
>
> I am trying to optimize the crystal structure of K2V8O21. The input file
> is given below. I tried many different sets (vc-realx and relax.
> constrained or not), or optimization algorithm (BFGS or damp).  However, I
> cannot get converged result. Can someone give me some suggestions? Thank
> you very much!
>
> Best,
>
> Yuanqing Wang
>
> postdoctor
> RIKEN, Japan
>
> --One input file---
> &control
> calculation='vc-relax',
> restart_mode = 'from_scratch',
> prefix = 'k2v8o21',
> outdir = './',
> tprnfor = .TRUE.
> pseudo_dir = '/pseudo/',
> nstep=200
>  /
>  &system
> ibrav = 0,
> nat= 62, ntyp= 3,
> ecutwfc = 150,
> ecutrho = 600,
> tot_charge=0,
> occupations='smearing', smearing='mp', degauss=0.03
>  /
>  &electrons
>mixing_beta = 0.3
>  /
>  &ions
>   ion_dynamics='damp'
>  /
>  &CELL
>   cell_dynamics='damp-pr'
>  /
> ATOMIC_SPECIES
>  K  39.0983 K.pz-hgh.UPF
>  V  50.942  V.pz-hgh.UPF
>  O  15.999  O.pz-hgh.UPF
> CELL_PARAMETERS angstrom
>  13.746484076   0.0  -0.193637952
>   0.0   3.360623858   0.0
>  -0.429832895   0.0  13.681512366
> ATOMIC_POSITIONS angstrom
> K   6.984312493   0.0   4.188728225 0 0 0
> K   6.332338675   0.0   9.299146598 0 0 0
> K   0.43470   1.680311929   4.286848404 0 0 0
> K  13.205507698   1.680311929   9.201026418 0 0 0
> V  10.878749236   1.680311929  12.433287018
> V   2.437901942   1.680311929   1.054587499
> V   4.008352623   0.0  12.539262335
> V   9.308298554   0.0   0.948612182
> V  11.369939010   0.0   6.612591821
> V   1.946712158   0.0   6.875283001
> V   4.495045176   1.680311929   6.720753553
> V   8.821605991   1.680311929   6.767121270
> V  11.150388833   0.0   3.743423629
> V   2.166262335   0.0   9.744451194
> V   4.296183684   1.680311929   3.847414705
> V   9.020467484   1.680311929   9.640460118
> V  12.578611942   0.0   1.090039654
> V   0.738039229   0.0  12.397835067
> V   5.704373956   1.680311929   1.184004573
> V   7.612277214   1.680311929  12.303870148
> O  12.349856377   0.0   5.231345609
> O   0.966794818   0.0   8.256528398
> O   5.490033518   1.680311929   5.354972202
> O   7.826617675   1.680311929   8.132901806
> O  10.708076383   1.680311929   6.225623611
> O   2.608574796   1.680311929   7.262250803
> O   3.828996973   0.0   6.322373797
> O   9.487654207   0.0   7.165500618
> O  12.137386541   1.680311929   0.583012891
> O   1.179264646   1.680311929  12.904861320
> O   5.265470757  -0.0   0.685756586
> O   8.051180430   0.0  12.802117625
> O  11.927069357   1.680311929  11.327461557
> O   1.389581812   1.680311929   2.160413266
> O   5.060238394   0.0  11.437026216
> O   8.256412774   0.0   2.050848606
> O  10.802990364   0.0   1.920124897
> O   2.513660822   0.0  11.567749315
> O   3.937601468   1.680311929   2.030377822
> O   9.379049718   1.680311929  11.457496389
> O   9.720341944   0.0   4.289255974
> O   3.596309249   0.0   9.198618033
> O   2.865221165   1.680311929   4.391289428
> O  10.451430029   1.680311929   9.096584579
> O   1.048281817   0.0   5

[Pw_forum] General quenstion about metallic supercell

2017-02-13 Thread Afshin Arjangmehr
Dear all,

I have a question regarding constructing supercells in QE and performing
vc-relax. Based on recommendations available on pwscf website, I used
xcrysden to generate a rather large cell (consisting 125 atoms of
vanadium, available in attached file). But, when I modify my input script
and again visualize the structure, "translational asymmetric cell" view
shows a very strange configuration of atoms. Also, performing vc-relax
takes incredibly long time (about 6 days on 64-cores).
Is there something wrong with my method? Is this a correct super cell?
I use QE5.4.0 compiled with ifort.
-
Method:

1- Constructing a simple bcc unit cell (~ in1.txt)
2- Visualizing using xcrysden, changing to primitive cell mode, repeating
cell in x,y and z-direction (5x5x5) and saving configuration (~ data.xyz)
3- Extracting atomic positions from saved configuration and modifying the
input script (in2.txt)

Thank you very much for your response.

-- 
With Best Regards
Afshin Arjhangemehr
PhD candidate in Radiation Application
Shahid Beheshti University G.C, Tehran, IRAN
(+98) 912 439 20 64
 &CONTROL
 calculation = 'scf' ,
restart_mode = 'from_scratch' ,

etot_conv_thr = 1.0E-8  ,
   forc_conv_thr = 1.0D-8 ,
   outdir='/home/koma/software/QE-USER/Output',
   pseudo_dir = '/home/koma/software/QE-USER/Pseudo-potential',
tprnfor   = .true.
tstress = .true.
   /
&SYSTEM
ibrav = 3,
celldm(1) = 5.62586,
nat = 1,
ntyp = 1,
ecutwfc = 100
occupations = 'smearing' ,
degauss= 0.01 ,
smearing= 'gaussian',
nspin=2 ,
starting_magnetization(1) = 0.2 ,
 /
&ELECTRONS
  conv_thr = 1.D-8 ,
 /
&IONS
  ion_dynamics= 'bfgs',
  upscale  = 70.d0,
  pot_extrapolation = 'atomic',
  wfc_extrapolation = 'none' ,
/
ATOMIC_SPECIES
V50.6941V.pbe-spn-kjpaw_psl.0.2.3.UPF
ATOMIC_POSITIONS (angstrom)
   V 0   0  0 
K_POINTS {automatic}
16 16 16 0 0 0  


data.xyz
Description: Protein Databank data
 &CONTROL
calculation = 'vc-relax' ,
restart_mode = 'from_scratch' ,
prefix = 'V_125atoms' ,
etot_conv_thr = 1.0E-2  ,
forc_conv_thr = 1.0D-2 ,
outdir='/share/users/mahdiani/espresso/Output',
pseudo_dir = '/share/users/mahdiani/espresso/PS',
tprnfor   = .true.
tstress = .true.
 /
&SYSTEM
ibrav = 3,
celldm(1) = 28.6293 ,
nat = 125,
ntyp = 1,
ecutwfc = 47 ,
occupations = 'smearing' ,
degauss= 0.05 ,
smearing= 'gaussian',
nspin=2 ,
starting_magnetization(1) = 0.05 ,
 /
&ELECTRONS
  conv_thr = 1.D-2 ,
 /
&IONS
  ion_dynamics= 'bfgs',
  upscale  = 70.d0,
  pot_extrapolation = 'atomic',
  wfc_extrapolation = 'none' ,
 /
&CELL
   cell_dynamics = 'bfgs' ,
   cell_factor = 2 ,
 /
ATOMIC_SPECIES
V50.6941V.pbe-spn-kjpaw_psl.0.2.3.UPF
ATOMIC_POSITIONS (angstrom)
   V0.000.000.00
   V   -1.5149967090   -1.51499670901.5149967090
   V   -3.0299934180   -3.02999341803.0299934180
   V   -4.5449901270   -4.54499012704.5449901270
   V   -6.0599868360   -6.05998683606.0599868360
   V   -1.51499670901.51499670901.5149967090
   V   -3.02999341800.003.0299934180
   V   -4.5449901270   -1.51499670904.5449901270
   V   -6.0599868360   -3.02999341806.0599868360
   V   -7.5749835450   -4.54499012707.5749835450
   V   -3.02999341803.02999341803.0299934180
   V   -4.54499012701.51499670904.5449901270
   V   -6.05998683600.006.0599868360
   V   -7.5749835450   -1.51499670907.5749835450
   V   -9.0899802540   -3.02999341809.0899802540
   V   -4.54499012704.54499012704.5449901270
   V   -6.05998683603.02999341806.0599868360
   V   -7.57498354501.51499670907.5749835450
   V   -9.08998025400.009.0899802540
   V  -10.6049769630   -1.5149967090   10.6049769630
   V   -6.05998683606.05998683606.0599868360
   V   -7.57498354504.54499012707.5749835450
   V   -9.08998025403.02999341809.0899802540
   V  -10.60497696301.5149967090   10.6049769630
   V  -12.11997367

Re: [Pw_forum] PW.x homogeneous electric field berry phase calculation in trigonal cell

2017-02-13 Thread Louis Fry-Bouriaux
Thanks Lorenzo I hope so too, I think the best references are Examples 4 and 
10, I have this tendency to just go ahead once I get something working, need to 
work on that :P


Indeed I have reproduced almost exactly what you have said. What I can 
confirm when using bp_c_phase (no electric field):


- all gdir work, only gdir=3 has a notable improvement in performance.

- when gdir=3, up to 4 processors scaling is good, on 8 it is terrible it 
actually takes longer, WALL time is notably larger than CPU time.

- the call to 'CALL mp_sum(aux_g(:), intra_bgrp_comm )' is made when gdir != 3.


My current understanding is that mp_sum takes the trace of the 'aux_g' matrix, 
whereas for gdir=3 there is significantly less code that ends up building the 
matrix 'aux' which is finally used to build 'mat'. The matrix 'evc' represents 
the wavefunctions built using plane waves, but 'evc' is used in many files. 
Since bp_c_phase is executed last, 'evc' has already been built and is only 
read in this file. With this and comparing the output I notice that performance 
when gdir=3 is better for almost all routines.. I will continue debugging 
tomorrow on the 8 processor machine where the differences are much more 
noticeable.. Do you think I should contact Paolo Giannozzi directly to better 
understand what is going on here?


Thanks so much [??]

Louis


From: pw_forum-boun...@pwscf.org  on behalf of 
Lorenzo Paulatto 
Sent: 13 February 2017 13:04:22
To: PWSCF Forum
Subject: Re: [Pw_forum] PW.x homogeneous electric field berry phase calculation 
in trigonal cell

On Monday, February 13, 2017 11:43:08 AM CET Louis Fry-Bouriaux wrote:
> Finally when you were talking about the bottleneck, I suppose you were
> talking about the efield code, my impression so far is this is not a
> problem using 4 processors, I will also test using 8 and compare the time
> taken. I have no idea how fast it 'should' be with proper parallisation,
> assuming it is possible to parallelise.

When you increase the number of CPUs, you would expect the time to decreased
linearly, if over a certain number of CPUs it stops decreasing or if it
decreases slower than linear, it is a bottleneck. This will always happen
eventually, but with berry/lefield it happens much sooner.

Thank you for reporting back! I hope this information will be useful to future
users

--
Dr. Lorenzo Paulatto
IdR @ IMPMC -- CNRS & Université Paris 6
phone: +33 (0)1 442 79822 / skype: paulatz
www:   http://www-int.impmc.upmc.fr/~paulatto/
mail:  23-24/423 Boîte courrier 115, 4 place Jussieu 75252 Paris Cédex 05

___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum
___
Pw_forum mailing list
Pw_forum@pwscf.org
http://pwscf.org/mailman/listinfo/pw_forum