Uninitialized buffer memory leads to buffer warnings

2018-07-03 Thread Dimitri Vorona
Hi all, currently, running json-integration-test with valgrind leads to the following warning: "Syscall param write(buf) points to uninitialised byte(s)". This is caused by PrimitiveBufferBuilder not initializing its data memory. Note: we initialize null_bitmap_data_ by zeroing, i.e. setting all v

Re: Uninitialized buffer memory leads to buffer warnings

2018-07-03 Thread Antoine Pitrou
Hi, Writing uninitialized memory risks leaking private data. This might lead to security issues. I'd go for option 2. Option 3 sounds much more costly (we would be zero-initializing large memory areas instead of small padding areas). Regards Antoine. Le 03/07/2018 à 13:11, Dimitri Vorona

Re: Uninitialized buffer memory leads to buffer warnings

2018-07-03 Thread Dimitri Vorona
Hi, I misunderstood the issue: the zeroing of the padding is correctly handled by the writers, as long as they are careful not to read beyond the specified size of the buffers (as opposed to its capacity). The valgrind warning came from the way AppendNull(s) is implemented in some of the builders

Re: Uninitialized buffer memory leads to buffer warnings

2018-07-03 Thread Wes McKinney
hi Dimitri, Zeroing the padding is probably a good idea; I'd be interested to look at the diff to see how many code paths are impacted. We already have a number of places where we are zeroing buffers but as you have found it is not 100% consistent. - Wes On Tue, Jul 3, 2018 at 10:13 AM, Dimitri

[jira] [Created] (ARROW-2787) Memory Issue passing table from python to c++ via cython

2018-07-03 Thread Joseph Toth (JIRA)
Joseph Toth created ARROW-2787: -- Summary: Memory Issue passing table from python to c++ via cython Key: ARROW-2787 URL: https://issues.apache.org/jira/browse/ARROW-2787 Project: Apache Arrow Iss

[jira] [Created] (ARROW-2788) [Plasma] Defining Delete semantics

2018-07-03 Thread Philipp Moritz (JIRA)
Philipp Moritz created ARROW-2788: - Summary: [Plasma] Defining Delete semantics Key: ARROW-2788 URL: https://issues.apache.org/jira/browse/ARROW-2788 Project: Apache Arrow Issue Type: Improve

[jira] [Created] (ARROW-2789) [JS] Minor DataFrame improvements

2018-07-03 Thread Brian Hulette (JIRA)
Brian Hulette created ARROW-2789: Summary: [JS] Minor DataFrame improvements Key: ARROW-2789 URL: https://issues.apache.org/jira/browse/ARROW-2789 Project: Apache Arrow Issue Type: Improvemen