By the way, the workstation has a total of 36 cores / 72 threads, so using 
mpirun -np 20 is possible (and should be equivalent) on both platforms. 

Thanks,
cap79

> On Feb 1, 2017, at 2:52 PM, Andy Witzig <cap1...@icloud.com> wrote:
> 
> Hi all,
> 
> I’m testing my application on a SMP workstation (dual Intel Xeon E5-2697 V4 
> 2.3 GHz Intel Broadwell (boost 2.8-3.1GHz) processors 128GB RAM) and am 
> seeing a 4x performance drop compared to a cluster system with 2.6GHz Intel 
> Haswell with 20 cores / node and 128GB RAM/node.  Both applications have been 
> compiled using OpenMPI 1.6.4.  I have tried running:
> 
> mpirun -np 20 $EXECUTABLE $INPUT_FILE
> mpirun -np 20 --mca btl self,sm $EXECUTABLE $INPUT_FILE
> 
> and others, but cannot achieve the same performance on the workstation as is 
> seen on the cluster.  The workstation outperforms on other non-MPI but 
> multi-threaded applications, so I don’t think it’s a hardware issue.
> 
> Any help you can provide would be appreciated.
> 
> Thanks,
> cap79
> _______________________________________________
> users mailing list
> users@lists.open-mpi.org
> https://rfd.newmexicoconsortium.org/mailman/listinfo/users
_______________________________________________
users mailing list
users@lists.open-mpi.org
https://rfd.newmexicoconsortium.org/mailman/listinfo/users

Reply via email to