Re: [OMPI users] What collective implementation is used when?

Gilles Gouaillardet Thu, 9 Jul 2015 01:33:00 -0400 (EDT)

Saliya,

there are several things here :
1) which collective module is used ?
2) if the tuned collective module is used, then which algo is used ?
3) which btl is used ?


First, btl is independent of the collective module.

That means that if you do a collective operation, intra nodecommunications will (likely) use sm or vader btl which is optimized forshared memory, and openib/tcp/whatever for inter nodes communications.

There is a collective module called coll_sm, and if i understandcorrectly, it works only on single node communicators, and avoid usingany btl if possible.

collective modules have different priorities and they do not necessarilyimplement all collective operations.for example, the inter module do not implement barriers on an intracommunicator. conversely, the tuned module do not implement barrier onan inter communicator.

in most cases (e.g. default config + intra communicator) the tunedcollective module is used.each operation has several implementation and they are chosen based oncommunicator size and message size. this can be overriden by environmentvariable and config file as previously described by George.

Last but not least, some collective modules (hierarch, ml, ?) implementhierarchical collective, which means they should be optimized for multinode / multi tasks per node.that being said, ml is not production ready, and i am not sure wheterhierarch is actively maintained)


i hope this helps

Gilles

On 7/9/2015 5:37 AM, Saliya Ekanayake wrote:

Hi,
I see the same collective operation (say allgatherv) implemented indifferent ways under tuned, sm, and inter packages. I read from thedocumentation [1] that these get picked up depending on the transport.
Say I run 12 procs per node on 2 nodes totaling 24 procs. If I callallGatherv collective, will it pick shared memory version tocommunicate between procs in the same node and use another for internode communication? If so, how can I know/control this?
Also, if I force the algorithm as,

coll_tuned_use_dynamic_rules = 1
coll_tuned_allgatherv_algorithm = 3

will it not get the advantage of shared memory?

[1] https://www.open-mpi.org/faq/?category=sm

Thank you,
Saliya

--
Saliya Ekanayake
Ph.D. Candidate | Research Assistant
School of Informatics and Computing | Digital Science Center
Indiana University, Bloomington
Cell 812-391-4914
http://saliya.org


_______________________________________________
users mailing list
us...@open-mpi.org
Subscription: http://www.open-mpi.org/mailman/listinfo.cgi/users
Link to this post: 
http://www.open-mpi.org/community/lists/users/2015/07/27265.php

Re: [OMPI users] What collective implementation is used when?

Reply via email to