[OMPI devel] allocating sm memory with page alignment

Eugene Loh Fri, 29 Aug 2008 20:49:20 -0400

(I'm new to Open MPI.)

I'm looking at the sm BTL.

In mca_btl_sm_add_procs(), there's a loop over peer processes, with acall to ompi_fifo_init(). That is, one call to ompi_fifo_init() foreach connection (sender/receiver pair).

In ompi_fifo_init(), there's an allocation ofsizeof(ompi_cb_fifo_wrapper_t), and a call to ompi_cb_fifo_init(), whichin turn has two allocations: one of a bunch of pointers and another ofsizeof(ompi_cb_fifo_ctl_t).


In short, for each connection, there are three allocations:

*) sizeof(ompi_cb_fifo_wrapper_t)... about 64 bytes on LP64
*) a bunch of pointers... about 1 Kbyte on LP64
*) sizeof(ompi_cb_fifo_ctl_t)... about 12 bytes

Let me say this yet another way. For N local processes, there areN*(N-1) per-connection allocations, most of which are 64 bytes or smaller.

BUT, in ompi_fifo_init() and ompi_cb_fifo_init(), we ask for pagealignment of each allocation. Further, in mca_mpool_sm_alloc() thatalignment is further reinforced to be on page boundaries.

As the number of local processes increases, therefore theseper-connection allocations become very costly. For 8K pages, forexample, and 100 on-node processes, we're talking 3*100*100*8K = 240Mbytes. For 512 on-node processes (yes, we have nodes this big), that's6 Gbyte... most of which is unused. (E.g., allocating more than an 8Kpage when we only need 64 or 12 bytes.)

Okay, long intro. Let me start with a short question: do we reallyneed page alignment for these allocations? Would cacheline alignment beokay?


(I imagine I'll have follow-up questions once the answers start to roll in.)

[OMPI devel] allocating sm memory with page alignment

Reply via email to