A suggestion here: transiesta usually starts with a standard siesta run, if 
you’re not restarting from a .TSDE file for instance - this may explain the 
speedup you get. 

arthur


> On 29 Aug 2016, at 15:52, Yangchuan Li <liyangch...@utexas.edu> wrote:
> 
> Hi Nick,
> 
> Thank you for your answers. I appreciate it.
> Recently, I do some other tests and I put it here.
> 
> cores    Transiesta                Tbtrans
> 32      8h 19m 41s 557ms    47m 46s 89ms
> 48      6h 29m 41s 770ms    47m 37s 510ms
> 64      4h 18m 26s 283ms    47m 32s 215ms
> 96      4h 20m 11s 997ms    47m 28s 189ms
> 
> The number of energy points is 43. I have two questions for the result. 
> 1. Why it takes longer time on 48 node if transiesta parallels across energy 
> points?
> 2. Why it takes the same time for tbtrans if tbtrans parallels across energy 
> points?
> 
> Another question is on the linear scalability for atom number. I have read 
> this article: The SIESTA method; developments and applicability 
> <http://iopscience.iop.org/article/10.1088/0953-8984/20/6/064208/meta;jsessionid=6E37FEDC0B8943CD097C536638603E54.c4.iopscience.cld.iop.org>.
>  It said siesta was coupled to several linear-scaling solvers. And also said 
> other method scaling as N^3. Does this means siesta scaling between linear 
> and N^3? What about transiesta and tbtrans?
> 
> Thanks for your time. Really appreciate your help.
> 
> Best,
> Yangchuan 
> ME | UT Austin
> 
> On Wed, Aug 24, 2016 at 3:58 PM, Nick Papior <nickpap...@gmail.com 
> <mailto:nickpap...@gmail.com>> wrote:
> 
> 
> 
> On 24 Aug 2016 21:55, "Yangchuan Li" <liyangch...@utexas.edu 
> <mailto:liyangch...@utexas.edu>> wrote:
> >
> > Hi Nick,
> >
> > I really appreciate your answer. It makes my work more efficient.
> > Recently I also do a siesta speed test. It's a geometry optimization 
> > running. I select k grid to be 1x1x4. And I run on 64, 128, 256 cores. 
> > There is no speed up as a result.
> > So, is it right to make the following summary?
> > 1. siesta is parallel across k points (points in .KP file which after 
> > symmetry analysis). 
> No, by default siesta is parallel over orbitals. Unless you specifically 
> request k parallelization. 
> However, there is a limit to the scalability (for any code), say if your 
> system have very few orbitals then parallelization to many cores will be 
> dominated by overhead. 
> > 2. transiesta is parallel across energy points, which is the sum of three 
> > variable (TS.ComplexContour.NumCircle, TS.ComplexContour.NumLine, 
> > TS.ComplexContour.NumPoles)
> Yes. 
> > 3. tbtrans is parallel across k points, which is not related with kz. But 
> > the kxy point after symmetry analysis.
> No, tbtrans is also parallel across energy points. 
> > 4. There is no need to use cores more than energy points.
> For tbtrans and transiesta, this is currently true. 
> 
> > Is there anything wrong with my understanding? 
> > Thanks for your time.
> >
> > Best regards,
> > Yangchuan Li
> > Graduate research assistant
> > ME | UT Austin
> >
> > On Fri, Aug 19, 2016 at 4:12 PM, Nick Papior <nickpap...@gmail.com 
> > <mailto:nickpap...@gmail.com>> wrote:
> >>
> >> Transiesta does not calculate the transmission, so the number of energy 
> >> points is irrelevant for this calculation.
> >>
> >> The number of points for transiesta is determined from the flags explained 
> >> in section 8.7.3 of siesta 4.0 manual. 
> >> See here: 
> >> https://launchpad.net/siesta/4.0/4.0/+download/siesta.pdf 
> >> <https://launchpad.net/siesta/4.0/4.0/+download/siesta.pdf>
> >>
> >> --
> >>
> >> Kind regards Nick Papior
> >>
> >>
> >> On 19 Aug 2016 22:16, "Yangchuan Li" <liyangch...@utexas.edu 
> >> <mailto:liyangch...@utexas.edu>> wrote:
> >>>
> >>> Hi Nick,
> >>>
> >>> Thank you for your answer. Yes, you are right. In the output of tbtrans 
> >>> the number of energy points is 43. 
> >>> But how can I know the number ahead? In electrode the k-points are 
> >>> 1x1x10, in system the k-pts are 1x1x100. And I select 121 points in 
> >>> transmission vs energy plot. How transiesta select the final points to be 
> >>> 43? 
> >>> Hope you can answer this so I can use resources efficiently. I appreciate 
> >>> your answer. Thanks. 
> >>>
> >>> Yangchuan
> >>> Graduate research assistant
> >>> ME | UT Austin
> >>>
> >>>
> >>> On Thu, Aug 18, 2016 at 12:00 AM, Nick Papior <nickpap...@gmail.com 
> >>> <mailto:nickpap...@gmail.com>> wrote:
> >>>>
> >>>> This is not surprising, transiesta is only parallelized across energy 
> >>>> points. This means you probably have between 33 and 64 energy points on 
> >>>> your contour.
> >>>>
> >>>> 2016-08-17 23:00 GMT+02:00 Yangchuan Li <liyangch...@utexas.edu 
> >>>> <mailto:liyangch...@utexas.edu>>:
> >>>>>
> >>>>> Dear siesta users,
> >>>>>
> >>>>> I did a speed up test of transiesta (version 4.0 stable)  on a cluster. 
> >>>>> I run a job of 380 atoms with different number of cores. And here is 
> >>>>> the result. The running time doesn't include tbtrans running.
> >>>>> cores    Transiesta          
> >>>>> 32      8h 19m 41s 557ms    
> >>>>> 64      4h 18m 26s 283ms   
> >>>>> 96      4h 20m 11s 997ms   
> >>>>> 128     4h 18m 9s 189ms     
> >>>>> 256     4h 18m 37s 196ms    
> >>>>> 512     4h 19m 8s 984ms   
> >>>>>
> >>>>> It seems strange because from 32 to 64 I got a perfect speed up, while 
> >>>>> increase the cores from 64 doesn't give me anything. 
> >>>>> So, why this happened and is there a way to speed up more?
> >>>>> Any comments will be appreciated. Thanks.
> >>>>>
> >>>>> Yangchuan
> >>>>> Graduate research assistant
> >>>>> ME | UT Austin
> >>>>
> >>>>
> >>>>
> >>>>
> >>>> -- 
> >>>> Kind regards Nick
> >>>
> >>>
> >
> 
> --
> Kind regards Nick 
> 

Responder a