Re: [JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Siddharth Teotia
Also look here to see how validity and data are copied independently between two vectors bypassing all Arrow APIs and directly manipulating memory. The link points to

Re: [JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Siddharth Teotia
Eric, you can take a look here how we try to optimize the copy (validity and data) in/out of vectors. We try to start with word-wise copy (64 column values and thus 64 valid

Re: [JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Wes McKinney
hi Eric, Antoine recently did some work on faster bitsetting in C++ by unrolling the main loop to set one byte at a time https://github.com/apache/arrow/blob/27b869ae5df31f3be61e76e9d96ea7d9b557/cpp/src/arrow/util/bit-util.h#L598 This yielded major speedups when setting a lot of bits. A simi

[JAVA] SIMD vectorized fill of ArrowBuf from Java primitive type array?

2018-07-23 Thread Eric Wohlstadter
Hi all, I work on a project that uses Arrow streaming format to transfer data between Java processes. We're also following the progress on Java support for Plasma, and may decide use Plasma also. We typically uses a pattern like this to fill Arrow vectors from Java arrays: int[] inputValues