Ralph Castain wrote:

Hi Bryan

I have seen similar issues on LANL clusters when message sizes were fairly large. How big are your buffers when you call Allreduce? Can you send us your Allreduce call params (e.g., the reduce operation, datatype, num elements)?

If you don't want to send that to the list, you can send it to me at LANL.

I haven't seen any updates on this. Please tell me Bryan sent info to Ralph at LANL and Ralph nailed this one. Please! :^)

Reply via email to