General questions about Arrow & Plasma

Matthias Vallentin Thu, 16 Nov 2017 07:30:39 -0800

Two question about Plasma; my use case is sharing Arrow data between aC++ and Python application (eventually also R).1. What's the typical memory allocation procedure when using Plasma andArrow? Do I first construct a builder, populate it, finish it, and*then* copy it into mmaped buffer? Or do I obtain mmaped buffer fromPlasma first, in which the builder operates incrementally until it'sfull? If I understand it correctly, a Plasma buffer has a fixed size,so I wonder how you accommodate the fact that the Arrow builderconstructs a record batches incrementally, while at the same timeavoiding extra copying of large memory chunks after finishing thebuilder.

1. Do I need Plasma to exchange the mmapped buffers between the twoapps? Or could I mmap my Arrow data manually and tell pyarrow througha different mechanism to obtain the shared buffer?

   Matthias

General questions about Arrow & Plasma

Reply via email to