chineking created ARROW-2879:
--------------------------------
Summary: Arrow plasma can only use a small part of specified
shared memory
Key: ARROW-2879
URL: https://issues.apache.org/jira/browse/ARROW-2879
Project: Apache Arrow
Issue Type: Bug
Components: Python
Reporter: chineking
Hi, thanks for the great job of arrow, it helps us a lot.
However, we encounter a problem when we were using plasma.
The sample code:
{code:python}
import numpy as np
import pyarrow as pa
import pyarrow.plasma as plasma
client = plasma.connect("/tmp/plasma", "", 0)
puts = []
nbytes = 0
while True:
a = np.ones((1000, 1000))
try:
oid = client.put(a)
puts.append(client.get(oid))
nbytes += a.nbytes
except pa.lib.PlasmaStoreFull:
print('use nbytes', nbytes)
break
{code}
We start a plasma store with 1G memory, but the nbytes output above is only
496000000, which cannot even reach half of the memory we specified.
I cannot figure out why plasma can only use such a small part of shared memory.
Could anybody help me? Thanks a lot.
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)