[jira] [Created] (ARROW-6343) [Java] [Vector] Fix allocation helper
Praveen Kumar Desabandu created ARROW-6343: -- Summary: [Java] [Vector] Fix allocation helper Key: ARROW-6343 URL: https://issues.apache.org/jira/browse/ARROW-6343 Project: Apache Arrow Issue Type: Bug Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Allocation helper, not passing around value count for varchar vectors. -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6332) [Java] [CPP] Handle size of varchar vectors correctly
Praveen Kumar Desabandu created ARROW-6332: -- Summary: [Java] [CPP] Handle size of varchar vectors correctly Key: ARROW-6332 URL: https://issues.apache.org/jira/browse/ARROW-6332 Project: Apache Arrow Issue Type: Bug Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Currently we default the initial size of variable width vectors. This limits that maximum size of these buffers to 3969 -- This message was sent by Atlassian Jira (v8.3.2#803003)
[jira] [Created] (ARROW-6268) Empty buffer should have a valid address
Praveen Kumar Desabandu created ARROW-6268: -- Summary: Empty buffer should have a valid address Key: ARROW-6268 URL: https://issues.apache.org/jira/browse/ARROW-6268 Project: Apache Arrow Issue Type: Bug Components: Java Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu As part of arrow buffer refactoring, the empty buffers are created with invalid addresses. This breaks client code that assumed that even empty buffers are valid. Return the address of the static EMPTY buffer. -- This message was sent by Atlassian JIRA (v7.6.14#76016)
[jira] [Created] (ARROW-5900) [Gandiva] [Java] Decimal precision,scale bounds check
Praveen Kumar Desabandu created ARROW-5900: -- Summary: [Gandiva] [Java] Decimal precision,scale bounds check Key: ARROW-5900 URL: https://issues.apache.org/jira/browse/ARROW-5900 Project: Apache Arrow Issue Type: Improvement Components: Java Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Currently we accept decimal precision of values, need bounds checking that it is between 1-38 inclusive. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5860) [Java] [Vector] Fix decimal byte setter
Praveen Kumar Desabandu created ARROW-5860: -- Summary: [Java] [Vector] Fix decimal byte setter Key: ARROW-5860 URL: https://issues.apache.org/jira/browse/ARROW-5860 Project: Apache Arrow Issue Type: Bug Components: Java Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5824) [Gandiva] [C++] Fix decimal null
Praveen Kumar Desabandu created ARROW-5824: -- Summary: [Gandiva] [C++] Fix decimal null Key: ARROW-5824 URL: https://issues.apache.org/jira/browse/ARROW-5824 Project: Apache Arrow Issue Type: Bug Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Decimal literal uses hard code precision and scale. It should use the input precision and scale instead. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5776) [Gandiva][Crossbow] Revert template to have commit ids.
Praveen Kumar Desabandu created ARROW-5776: -- Summary: [Gandiva][Crossbow] Revert template to have commit ids. Key: ARROW-5776 URL: https://issues.apache.org/jira/browse/ARROW-5776 Project: Apache Arrow Issue Type: Bug Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu We are dependent on the commit ids being present in the cross bow travis templates so that we can sync our builds against the same commit id that was used to create the artifacts. So reverting back fetch-head to give back arrow-head. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5737) Gandiva not building in manylinux
Praveen Kumar Desabandu created ARROW-5737: -- Summary: Gandiva not building in manylinux Key: ARROW-5737 URL: https://issues.apache.org/jira/browse/ARROW-5737 Project: Apache Arrow Issue Type: Bug Components: C++ - Gandiva Reporter: Praveen Kumar Desabandu The gandiva many linux builds have started failing post [https://github.com/apache/arrow/commit/0fc5bc429fbe527b1e42db4307cde8d0ce2818c6] ninja is unable to interpret the [make_precompiled_bitcode.py|https://github.com/apache/arrow/commit/0fc5bc429fbe527b1e42db4307cde8d0ce2818c6#diff-456ea80d0a4228a2dbf98b5d47615e07] correctly. [~pitrou] - I tried to fix it but did not make much progress :) Could you please help out. Error is "File "/arrow/cpp/src/gandiva/precompiled/../make_precompiled_bitcode.py", line 22 marker = B"" SyntaxError: invalid syntax -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5723) [Gandiva][Crossbow] Builds failing
Praveen Kumar Desabandu created ARROW-5723: -- Summary: [Gandiva][Crossbow] Builds failing Key: ARROW-5723 URL: https://issues.apache.org/jira/browse/ARROW-5723 Project: Apache Arrow Issue Type: Bug Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Latest builds are failing. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5696) [Gandiva] [C++] Introduce castVarcharVarchar
Praveen Kumar Desabandu created ARROW-5696: -- Summary: [Gandiva] [C++] Introduce castVarcharVarchar Key: ARROW-5696 URL: https://issues.apache.org/jira/browse/ARROW-5696 Project: Apache Arrow Issue Type: Task Components: C++ - Gandiva Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5637) [Gandiva] [Java]Complete IN Expression
Praveen Kumar Desabandu created ARROW-5637: -- Summary: [Gandiva] [Java]Complete IN Expression Key: ARROW-5637 URL: https://issues.apache.org/jira/browse/ARROW-5637 Project: Apache Arrow Issue Type: Task Components: C++ - Gandiva, Java Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Complete Java/JNI components for Gandiva In Expression. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5485) [Gandiva][Crossbow] OSx builds failing
Praveen Kumar Desabandu created ARROW-5485: -- Summary: [Gandiva][Crossbow] OSx builds failing Key: ARROW-5485 URL: https://issues.apache.org/jira/browse/ARROW-5485 Project: Apache Arrow Issue Type: Task Components: Packaging Affects Versions: 0.14.0 Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.14.0 OSX builds are failing for the last 3 days. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5476) [Java][Memory] Fix Netty ArrowBuf Slice
Praveen Kumar Desabandu created ARROW-5476: -- Summary: [Java][Memory] Fix Netty ArrowBuf Slice Key: ARROW-5476 URL: https://issues.apache.org/jira/browse/ARROW-5476 Project: Apache Arrow Issue Type: Task Affects Versions: 0.14.0 Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.14.0 The slice of netty arrow buf depends on arrow buf reader and writer indexes, but arrow buf is supposed to only track memory addr + length and there are places where the arrow buf indexes are not in sync with netty. So slice should use the indexes in Netty Arrow Buf instead. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5443) [Gandiva][Crossbow] Turn parquet encryption off
Praveen Kumar Desabandu created ARROW-5443: -- Summary: [Gandiva][Crossbow] Turn parquet encryption off Key: ARROW-5443 URL: https://issues.apache.org/jira/browse/ARROW-5443 Project: Apache Arrow Issue Type: Task Components: Packaging Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.14.0 Turn off Parquet Encryption for Gandiva crossbow, since it pulls in additional dependencies that is not required/present currently. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5434) Introduce wrappers for backward compatibility.
Praveen Kumar Desabandu created ARROW-5434: -- Summary: Introduce wrappers for backward compatibility. Key: ARROW-5434 URL: https://issues.apache.org/jira/browse/ARROW-5434 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu * Introduce some wrapper methods in ArrowBuf to reduce amount of client changes. * Changes were introduced as part of patch to support arrow buffers on random memory. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-5164) [Gandiva] [C++] Introduce 32bit hash functions
Praveen Kumar Desabandu created ARROW-5164: -- Summary: [Gandiva] [C++] Introduce 32bit hash functions Key: ARROW-5164 URL: https://issues.apache.org/jira/browse/ARROW-5164 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4959) [Gandiva][Crossbow] Builds broken
Praveen Kumar Desabandu created ARROW-4959: -- Summary: [Gandiva][Crossbow] Builds broken Key: ARROW-4959 URL: https://issues.apache.org/jira/browse/ARROW-4959 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Looks like cross bow builds for Gandiva is broken for the last few days. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4913) [Java][Memory] Limit number of ledgers and arrowbufs
Praveen Kumar Desabandu created ARROW-4913: -- Summary: [Java][Memory] Limit number of ledgers and arrowbufs Key: ARROW-4913 URL: https://issues.apache.org/jira/browse/ARROW-4913 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Block allocations if the number of BufferLedgers or ArrowBufs are above a configurable limit. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4808) [Java][Vector] Convenience methods for setting decimal vector
Praveen Kumar Desabandu created ARROW-4808: -- Summary: [Java][Vector] Convenience methods for setting decimal vector Key: ARROW-4808 URL: https://issues.apache.org/jira/browse/ARROW-4808 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Provide more convenience methods to set decimal vector, specifically # Accept arrow buffers encode in little endian bytes that are of size < 16 bytes # Accept arrow buffers that are encoded in big endian and could of size <=16 bytes -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4539) [Java]List vector child value count not set correctly
Praveen Kumar Desabandu created ARROW-4539: -- Summary: [Java]List vector child value count not set correctly Key: ARROW-4539 URL: https://issues.apache.org/jira/browse/ARROW-4539 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.14.0 We are not correctly processing list vectors that could have null values. The child value count would be off there by losing data in variable width vectors. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4371) [Gandiva] Generate object ids for modules
Praveen Kumar Desabandu created ARROW-4371: -- Summary: [Gandiva] Generate object ids for modules Key: ARROW-4371 URL: https://issues.apache.org/jira/browse/ARROW-4371 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Generate object identifiers for Projector and Filter, so that we can test the external layer to C++ if the identifier is the same to ensure that caching is not broken. Introduce JNI api to query for the same. However this is not of high priority since we have tests in the core gandiva layer for the same. Current tests depend on time but it is causing spurious failures. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4358) [Gandiva][Crossbow] Trusty build broken
Praveen Kumar Desabandu created ARROW-4358: -- Summary: [Gandiva][Crossbow] Trusty build broken Key: ARROW-4358 URL: https://issues.apache.org/jira/browse/ARROW-4358 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu As a side effect of [https://github.com/apache/arrow/commit/1b8a7bc3baa4bce660c18a13934115d55f8733df,] java builds on trusty are broken due to removal of travis maven in this commit. This Jira is to support both environments.. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4100) [Gandiva][C++] Fix regex to ignore.
Praveen Kumar Desabandu created ARROW-4100: -- Summary: [Gandiva][C++] Fix regex to ignore. Key: ARROW-4100 URL: https://issues.apache.org/jira/browse/ARROW-4100 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4003) [Gandiva][Java] Safeguard jvm before loading the gandiva library
Praveen Kumar Desabandu created ARROW-4003: -- Summary: [Gandiva][Java] Safeguard jvm before loading the gandiva library Key: ARROW-4003 URL: https://issues.apache.org/jira/browse/ARROW-4003 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Today we load the gandiva library always when trying to use the jni bridge, but we have run into issues causing the jvm to crash in untested paths. Proposal is to do load the library in a separate process first and if it works only then load in the current process. This would be done only once at startup/first load. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3983) [Gandiva][Crossbow] Use static boost while packaging
Praveen Kumar Desabandu created ARROW-3983: -- Summary: [Gandiva][Crossbow] Use static boost while packaging Key: ARROW-3983 URL: https://issues.apache.org/jira/browse/ARROW-3983 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Gandiva is getting some transitive dependencies to Boost from Arrow. Since we are using the static version of arrow in the packaged gandiva library, it was thought that we would be using the static versions of boost. This holds true in linux where there is no dependency on shared arrow library, but in mac there seems to be a dependency on shared boost libraries even for the static arrow library. So using "ARROW_BOOST_USE_SHARED" to force use the boost static libraries while packaging Gandiva in Crossbow. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3973) [Gandiva][Java] Move the benchmark tests out of unit test scope.
Praveen Kumar Desabandu created ARROW-3973: -- Summary: [Gandiva][Java] Move the benchmark tests out of unit test scope. Key: ARROW-3973 URL: https://issues.apache.org/jira/browse/ARROW-3973 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.12.0 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3970) [Gandiva][C++] Remove unnecessary boost dependencies
Praveen Kumar Desabandu created ARROW-3970: -- Summary: [Gandiva][C++] Remove unnecessary boost dependencies Key: ARROW-3970 URL: https://issues.apache.org/jira/browse/ARROW-3970 Project: Apache Arrow Issue Type: Task Affects Versions: 0.12.0 Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Remove unnecessary dynamic dependencies on Boost since we are anyway using the static versions. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3902) [Gandiva] [C++] Remove static c++ linked in Gandiva.
Praveen Kumar Desabandu created ARROW-3902: -- Summary: [Gandiva] [C++] Remove static c++ linked in Gandiva. Key: ARROW-3902 URL: https://issues.apache.org/jira/browse/ARROW-3902 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Hi, [~wesm_impala_7e40], I am looking into switching Gandiva Redhat developer toolchain. We are not too familiar with it and not sure the effort required there. In the meanwhile for the short term, can we turn get Crossbow builds to only do static linking for Dremio builds (through a travis env variable)? and Arrow can ship Gandiva linked to std-c++ dynamically? We can then move to redhat toolchain for 0.13 version of Arrow? Thx. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3859) [Java] Fix ComplexWriter backward incompatible change
Praveen Kumar Desabandu created ARROW-3859: -- Summary: [Java] Fix ComplexWriter backward incompatible change Key: ARROW-3859 URL: https://issues.apache.org/jira/browse/ARROW-3859 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Fix For: 0.12.0 This commit [https://github.com/apache/arrow/commit/a56c009257a71979d5ed0b021197c7a9d5ed5021] changed the default behavior for some of the methods to be non-backward compatible. Will raise the PR to revert it to previous behavior while adhering to check style guidelines. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3860) [Gandiva] [C++] Fix packaging broken recently
Praveen Kumar Desabandu created ARROW-3860: -- Summary: [Gandiva] [C++] Fix packaging broken recently Key: ARROW-3860 URL: https://issues.apache.org/jira/browse/ARROW-3860 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.12.0 This [commit|https://github.com/apache/arrow/commit/ba2b2ea2301f067cc95306e11546ddb6d402a55c#diff-d5e5df5984ba660e999a7c657039f6af] broke gandiva packaging by removing static linking of std c++, since dremio consumes a fat jar that includes packaged gandiva native libraries we would need to statically link std c++. As suggested in the commit message will re-introduce it as a CMake Flag. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3609) [Gandiva] Move benchmark tests out of unit test
Praveen Kumar Desabandu created ARROW-3609: -- Summary: [Gandiva] Move benchmark tests out of unit test Key: ARROW-3609 URL: https://issues.apache.org/jira/browse/ARROW-3609 Project: Apache Arrow Issue Type: Task Components: C++, Gandiva Reporter: Praveen Kumar Desabandu Fix For: 0.12.0 Currently the benchmarks are run as integ tests. We should move them out as gbenchmark tests. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3489) [Gandiva] Support for in expressions
Praveen Kumar Desabandu created ARROW-3489: -- Summary: [Gandiva] Support for in expressions Key: ARROW-3489 URL: https://issues.apache.org/jira/browse/ARROW-3489 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Assignee: Praveen Kumar Desabandu Fix For: 0.12.0 Add support for in-expressions to gandiva. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3471) [Gandiva] Investigate caching isomorphic expressions
Praveen Kumar Desabandu created ARROW-3471: -- Summary: [Gandiva] Investigate caching isomorphic expressions Key: ARROW-3471 URL: https://issues.apache.org/jira/browse/ARROW-3471 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Fix For: 0.12.0 Two expressions say add(a+b) and add(c+d), could potentially be reused if the only thing differing are the names. Test E2E. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3386) Platform independent gandiva jar
Praveen Kumar Desabandu created ARROW-3386: -- Summary: Platform independent gandiva jar Key: ARROW-3386 URL: https://issues.apache.org/jira/browse/ARROW-3386 Project: Apache Arrow Issue Type: Task Reporter: Praveen Kumar Desabandu Currently we only package .so for the gandiva jar, we would need a packaged lib for windows and mac. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3385) Deploy gandiva snapshot jars automatically
Praveen Kumar Desabandu created ARROW-3385: -- Summary: Deploy gandiva snapshot jars automatically Key: ARROW-3385 URL: https://issues.apache.org/jira/browse/ARROW-3385 Project: Apache Arrow Issue Type: Task Components: Gandiva Reporter: Praveen Kumar Desabandu Deploy the packaged gandiva jar to maven snapshot repo from travis. Do this only for commits that modify gandiva. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3384) Sync remaining commits from gandiva repo
Praveen Kumar Desabandu created ARROW-3384: -- Summary: Sync remaining commits from gandiva repo Key: ARROW-3384 URL: https://issues.apache.org/jira/browse/ARROW-3384 Project: Apache Arrow Issue Type: Task Components: C++, Gandiva Reporter: Praveen Kumar Desabandu After initial merge some new commits were done in gandiva, we need to port them to the arrow repo. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3383) Gandiva Java in travis ci
Praveen Kumar Desabandu created ARROW-3383: -- Summary: Gandiva Java in travis ci Key: ARROW-3383 URL: https://issues.apache.org/jira/browse/ARROW-3383 Project: Apache Arrow Issue Type: Task Components: Gandiva, Java Reporter: Praveen Kumar Desabandu Enable and test for gandiva java in travis ci. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-3382) Gandiva CPP in travis CI.
Praveen Kumar Desabandu created ARROW-3382: -- Summary: Gandiva CPP in travis CI. Key: ARROW-3382 URL: https://issues.apache.org/jira/browse/ARROW-3382 Project: Apache Arrow Issue Type: Task Components: C++ Reporter: Praveen Kumar Desabandu Integrate and test Gandiva-Cpp in travis. This would unblock new PRs to gandiva. -- This message was sent by Atlassian JIRA (v7.6.3#76005)