[jira] [Comment Edited] (ARROW-4144) [Java] Arrow-to-JDBC

2020-06-01 Thread Micah Kornfield (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123402#comment-17123402
 ] 

Micah Kornfield edited comment on ARROW-4144 at 6/2/20, 6:56 AM:
-

[~uwe] have you come across a use-case for writing to JDBC sources?


was (Author: emkornfi...@gmail.com):
@uwe have you come across a use-case for writing to JDBC sources?

> [Java] Arrow-to-JDBC
> 
>
> Key: ARROW-4144
> URL: https://issues.apache.org/jira/browse/ARROW-4144
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Reporter: Michael Pigott
>Assignee: Chen
>Priority: Major
>
> ARROW-1780 reads a query from a JDBC data source and converts the ResultSet 
> to an Arrow VectorSchemaRoot.  However, there is no built-in adapter for 
> writing an Arrow VectorSchemaRoot back to the database.
> ARROW-3966 adds JDBC field metadata:
>  * The Catalog Name
>  * The Table Name
>  * The Field Name
>  * The Field Type
> We can use this information to ask for the field information from the 
> database via the 
> [DatabaseMetaData|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html]
>  object.  We can then create INSERT or UPDATE statements based on the [list 
> of primary 
> keys|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html#getPrimaryKeys(java.lang.String,%20java.lang.String,%20java.lang.String)]
>  in the table:
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> NULL, insert that record into the database.
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> not NULL, update the existing record in the database.
> We can also perform the same data conversion in reverse based on the field 
> types queried from the database.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-4144) [Java] Arrow-to-JDBC

2020-06-01 Thread Micah Kornfield (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17123402#comment-17123402
 ] 

Micah Kornfield commented on ARROW-4144:


@uwe have you come across a use-case for writing to JDBC sources?

> [Java] Arrow-to-JDBC
> 
>
> Key: ARROW-4144
> URL: https://issues.apache.org/jira/browse/ARROW-4144
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Reporter: Michael Pigott
>Assignee: Chen
>Priority: Major
>
> ARROW-1780 reads a query from a JDBC data source and converts the ResultSet 
> to an Arrow VectorSchemaRoot.  However, there is no built-in adapter for 
> writing an Arrow VectorSchemaRoot back to the database.
> ARROW-3966 adds JDBC field metadata:
>  * The Catalog Name
>  * The Table Name
>  * The Field Name
>  * The Field Type
> We can use this information to ask for the field information from the 
> database via the 
> [DatabaseMetaData|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html]
>  object.  We can then create INSERT or UPDATE statements based on the [list 
> of primary 
> keys|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html#getPrimaryKeys(java.lang.String,%20java.lang.String,%20java.lang.String)]
>  in the table:
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> NULL, insert that record into the database.
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> not NULL, update the existing record in the database.
> We can also perform the same data conversion in reverse based on the field 
> types queried from the database.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8972) [Java] Support range value comparison for large varchar/varbinary vectors

2020-06-01 Thread Micah Kornfield (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Micah Kornfield resolved ARROW-8972.

Resolution: Fixed

> [Java] Support range value comparison for large varchar/varbinary vectors
> -
>
> Key: ARROW-8972
> URL: https://issues.apache.org/jira/browse/ARROW-8972
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: Java
>Reporter: Liya Fan
>Assignee: Liya Fan
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Support comparing a range of values for LargeVarCharVector and 
> LargeVarBinaryVector.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-9000) Java build crashes with JDK14

2020-06-01 Thread Micah Kornfield (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-9000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Micah Kornfield updated ARROW-9000:
---
Component/s: Java

> Java build crashes with JDK14
> -
>
> Key: ARROW-9000
> URL: https://issues.apache.org/jira/browse/ARROW-9000
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Java
>Reporter: Laurent Goujon
>Assignee: Laurent Goujon
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Current master tree does not build with JDK14. The issue seems to be caused 
> by error prone plugin:
> {noformat}
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-compiler-plugin:3.6.2:compile 
> (default-compile) on project arrow-memory: Compilation failure
> [ERROR] 
> /Users/laurent/devel/arrow/java/memory/src/main/java/org/apache/arrow/memory/BufferLedger.java:[545,15]
>  error: An unhandled exception was thrown by the Error Prone static analysis 
> plugin.
> [ERROR]  Please report this at 
> https://github.com/google/error-prone/issues/new and include the following:
> [ERROR]   
> [ERROR]  error-prone version: 2.3.3
> [ERROR]  BugPattern: TypeParameterUnusedInFormals
> [ERROR]  Stack Trace:
> [ERROR]  java.lang.NoSuchFieldError: bound
> [ERROR]   at 
> com.google.errorprone.bugpatterns.TypeParameterUnusedInFormals.matchMethod(TypeParameterUnusedInFormals.java:71)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.processMatchers(ErrorProneScanner.java:433)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:725)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCMethodDecl.accept(JCTree.java:916)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:90)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitClass(TreeScanner.java:187)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:535)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCClassDecl.accept(JCTree.java:823)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitCompilationUnit(TreeScanner.java:144)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:546)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCCompilationUnit.accept(JCTree.java:603)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:56)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:55)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScannerTransformer.apply(ErrorProneScannerTransformer.java:43)
> [ERROR]   at 
> com.google.errorprone.ErrorProneAnalyzer.finished(ErrorProneAnalyzer.java:151)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.api.MultiTaskListener.finished(MultiTaskListener.java:132)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1423)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1370)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.compile(JavaCompiler.java:959)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:316)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:176)
> [ERROR]   at jdk.compiler/com.sun.tools

[jira] [Updated] (ARROW-9000) [Java] build crashes with JDK14

2020-06-01 Thread Micah Kornfield (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-9000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Micah Kornfield updated ARROW-9000:
---
Summary: [Java] build crashes with JDK14  (was: Java build crashes with 
JDK14)

> [Java] build crashes with JDK14
> ---
>
> Key: ARROW-9000
> URL: https://issues.apache.org/jira/browse/ARROW-9000
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Java
>Reporter: Laurent Goujon
>Assignee: Laurent Goujon
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Current master tree does not build with JDK14. The issue seems to be caused 
> by error prone plugin:
> {noformat}
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-compiler-plugin:3.6.2:compile 
> (default-compile) on project arrow-memory: Compilation failure
> [ERROR] 
> /Users/laurent/devel/arrow/java/memory/src/main/java/org/apache/arrow/memory/BufferLedger.java:[545,15]
>  error: An unhandled exception was thrown by the Error Prone static analysis 
> plugin.
> [ERROR]  Please report this at 
> https://github.com/google/error-prone/issues/new and include the following:
> [ERROR]   
> [ERROR]  error-prone version: 2.3.3
> [ERROR]  BugPattern: TypeParameterUnusedInFormals
> [ERROR]  Stack Trace:
> [ERROR]  java.lang.NoSuchFieldError: bound
> [ERROR]   at 
> com.google.errorprone.bugpatterns.TypeParameterUnusedInFormals.matchMethod(TypeParameterUnusedInFormals.java:71)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.processMatchers(ErrorProneScanner.java:433)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:725)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCMethodDecl.accept(JCTree.java:916)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:90)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitClass(TreeScanner.java:187)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:535)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCClassDecl.accept(JCTree.java:823)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitCompilationUnit(TreeScanner.java:144)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:546)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCCompilationUnit.accept(JCTree.java:603)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:56)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:55)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScannerTransformer.apply(ErrorProneScannerTransformer.java:43)
> [ERROR]   at 
> com.google.errorprone.ErrorProneAnalyzer.finished(ErrorProneAnalyzer.java:151)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.api.MultiTaskListener.finished(MultiTaskListener.java:132)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1423)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1370)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.compile(JavaCompiler.java:959)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:316)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.co

[jira] [Resolved] (ARROW-9000) [Java] build crashes with JDK14

2020-06-01 Thread Micah Kornfield (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-9000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Micah Kornfield resolved ARROW-9000.

Resolution: Fixed

> [Java] build crashes with JDK14
> ---
>
> Key: ARROW-9000
> URL: https://issues.apache.org/jira/browse/ARROW-9000
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Java
>Reporter: Laurent Goujon
>Assignee: Laurent Goujon
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
> Current master tree does not build with JDK14. The issue seems to be caused 
> by error prone plugin:
> {noformat}
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-compiler-plugin:3.6.2:compile 
> (default-compile) on project arrow-memory: Compilation failure
> [ERROR] 
> /Users/laurent/devel/arrow/java/memory/src/main/java/org/apache/arrow/memory/BufferLedger.java:[545,15]
>  error: An unhandled exception was thrown by the Error Prone static analysis 
> plugin.
> [ERROR]  Please report this at 
> https://github.com/google/error-prone/issues/new and include the following:
> [ERROR]   
> [ERROR]  error-prone version: 2.3.3
> [ERROR]  BugPattern: TypeParameterUnusedInFormals
> [ERROR]  Stack Trace:
> [ERROR]  java.lang.NoSuchFieldError: bound
> [ERROR]   at 
> com.google.errorprone.bugpatterns.TypeParameterUnusedInFormals.matchMethod(TypeParameterUnusedInFormals.java:71)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.processMatchers(ErrorProneScanner.java:433)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:725)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCMethodDecl.accept(JCTree.java:916)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:90)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitClass(TreeScanner.java:187)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:535)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCClassDecl.accept(JCTree.java:823)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitCompilationUnit(TreeScanner.java:144)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:546)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCCompilationUnit.accept(JCTree.java:603)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:56)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:55)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScannerTransformer.apply(ErrorProneScannerTransformer.java:43)
> [ERROR]   at 
> com.google.errorprone.ErrorProneAnalyzer.finished(ErrorProneAnalyzer.java:151)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.api.MultiTaskListener.finished(MultiTaskListener.java:132)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1423)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1370)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.compile(JavaCompiler.java:959)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:316)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:176)
> [ERROR]   at jdk.compiler/com.sun

[jira] [Commented] (ARROW-8999) [Python][C++] Non-deterministic segfault in "AMD64 MacOS 10.15 Python 3.7" build

2020-06-01 Thread Yibo Cai (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8999?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121531#comment-17121531
 ] 

Yibo Cai commented on ARROW-8999:
-

Looks like some hidden bugs like https://github.com/apache/arrow/pull/7123.
I met with same error once last week.
My patch https://github.com/apache/arrow/pull/7135 handles buffer directly, 
would double check if all boundary conditions are safe.

> [Python][C++] Non-deterministic segfault in "AMD64 MacOS 10.15 Python 3.7" 
> build
> 
>
> Key: ARROW-8999
> URL: https://issues.apache.org/jira/browse/ARROW-8999
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Python
>Reporter: Wes McKinney
>Priority: Major
> Fix For: 1.0.0
>
>
> I've been seeing this segfault periodically the last week, does anyone have 
> an idea what might be wrong?
> https://github.com/apache/arrow/pull/7273/checks?check_run_id=717249862



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-4144) [Java] Arrow-to-JDBC

2020-06-01 Thread Michael Pigott (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121516#comment-17121516
 ] 

Michael Pigott commented on ARROW-4144:
---

Hi, and thanks for taking a look at this!  I saw Arrow as a tool that
could be used to transfer data between systems, and not just for
in-memory analysis.  However, I don't need this today, and if nobody
else has asked for it, then there might be more valuable JIRA tickets
than mine for you to dedicate your time to?

Thanks again!
Mike



> [Java] Arrow-to-JDBC
> 
>
> Key: ARROW-4144
> URL: https://issues.apache.org/jira/browse/ARROW-4144
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Reporter: Michael Pigott
>Assignee: Chen
>Priority: Major
>
> ARROW-1780 reads a query from a JDBC data source and converts the ResultSet 
> to an Arrow VectorSchemaRoot.  However, there is no built-in adapter for 
> writing an Arrow VectorSchemaRoot back to the database.
> ARROW-3966 adds JDBC field metadata:
>  * The Catalog Name
>  * The Table Name
>  * The Field Name
>  * The Field Type
> We can use this information to ask for the field information from the 
> database via the 
> [DatabaseMetaData|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html]
>  object.  We can then create INSERT or UPDATE statements based on the [list 
> of primary 
> keys|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html#getPrimaryKeys(java.lang.String,%20java.lang.String,%20java.lang.String)]
>  in the table:
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> NULL, insert that record into the database.
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> not NULL, update the existing record in the database.
> We can also perform the same data conversion in reverse based on the field 
> types queried from the database.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-4144) [Java] Arrow-to-JDBC

2020-06-01 Thread Chen (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Chen reassigned ARROW-4144:
---

Assignee: Chen

> [Java] Arrow-to-JDBC
> 
>
> Key: ARROW-4144
> URL: https://issues.apache.org/jira/browse/ARROW-4144
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Reporter: Michael Pigott
>Assignee: Chen
>Priority: Major
>
> ARROW-1780 reads a query from a JDBC data source and converts the ResultSet 
> to an Arrow VectorSchemaRoot.  However, there is no built-in adapter for 
> writing an Arrow VectorSchemaRoot back to the database.
> ARROW-3966 adds JDBC field metadata:
>  * The Catalog Name
>  * The Table Name
>  * The Field Name
>  * The Field Type
> We can use this information to ask for the field information from the 
> database via the 
> [DatabaseMetaData|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html]
>  object.  We can then create INSERT or UPDATE statements based on the [list 
> of primary 
> keys|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html#getPrimaryKeys(java.lang.String,%20java.lang.String,%20java.lang.String)]
>  in the table:
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> NULL, insert that record into the database.
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> not NULL, update the existing record in the database.
> We can also perform the same data conversion in reverse based on the field 
> types queried from the database.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-8992) [CI][C++] march not passing correctly for docker-compose run

2020-06-01 Thread Elliott Kipp (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8992?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121506#comment-17121506
 ] 

Elliott Kipp commented on ARROW-8992:
-

I only modified the volumes section of the docker-compose, (per 
[https://github.com/apache/arrow/pull/6907#issuecomment-612940401]) I didn't 
apply the rest of the patch. I believe I did a pull of the master at one point 
too, with the same result, but I'll double check tomorrow. 

> [CI][C++] march not passing correctly for docker-compose run
> 
>
> Key: ARROW-8992
> URL: https://issues.apache.org/jira/browse/ARROW-8992
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Packaging, Python
>Affects Versions: 0.17.0, 0.17.1
> Environment: Mendel Linux 4.0
>Reporter: Elliott Kipp
>Priority: Critical
> Attachments: CMakeError.log, CMakeOutput.log
>
>
> [https://github.com/apache/arrow/issues/7307]
> Building on the new ASUS Tinker Edge T, running Mendel Linux 4.0 (Day). 
> docker-compose build commands work fine with no errors:
>  DEBIAN=10 ARCH=arm64v8 docker-compose build debian-cpp && DEBIAN=10 
> ARCH=arm64v8 docker-compose build debian-python
> DEBIAN=10 ARCH=arm64v8 docker-compose run debian-python - fails with the 
> following:
> – Running cmake for pyarrow
>  cmake -DPYTHON_EXECUTABLE=/usr/local/bin/python -G Ninja 
> -DPYARROW_BUILD_CUDA=off -DPYARROW_BUILD_FLIGHT=on -DPYARROW_BUILD_GANDIVA=on 
> -DPYARROW_BUILD_DATASET=on -DPYARROW_BUILD_ORC=on -DPYARROW_BUILD_PARQUET=on 
> -DPYARROW_BUILD_PLASMA=on -DPYARROW_BUILD_S3=off -DPYARROW_BUILD_HDFS=off 
> -DPYARROW_USE_TENSORFLOW=off -DPYARROW_BUNDLE_ARROW_CPP=off 
> -DPYARROW_BUNDLE_BOOST=off -DPYARROW_GENERATE_COVERAGE=off 
> -DPYARROW_BOOST_USE_SHARED=on -DPYARROW_PARQUET_USE_SHARED=on 
> -DCMAKE_BUILD_TYPE=debug /arrow/python
>  – The C compiler identification is GNU 8.3.0
>  – The CXX compiler identification is GNU 8.3.0
>  – Check for working C compiler: /usr/lib/ccache/gcc
>  – Check for working C compiler: /usr/lib/ccache/gcc – works
>  – Detecting C compiler ABI info
>  – Detecting C compiler ABI info - done
>  – Detecting C compile features
>  – Detecting C compile features - done
>  – Check for working CXX compiler: /usr/lib/ccache/g++
>  – Check for working CXX compiler: /usr/lib/ccache/g++ – works
>  – Detecting CXX compiler ABI info
>  – Detecting CXX compiler ABI info - done
>  – Detecting CXX compile features
>  – Detecting CXX compile features - done
>  – System processor: aarch64
>  – Performing Test CXX_SUPPORTS_ARMV8_ARCH
>  – Performing Test CXX_SUPPORTS_ARMV8_ARCH - Failed
>  – Arrow build warning level: PRODUCTION
>  CMake Error at cmake_modules/SetupCxxFlags.cmake:338 (message):
>  Unsupported arch flag: -march=.
>  Call Stack (most recent call first):
>  CMakeLists.txt:100 (include)
> – Configuring incomplete, errors occurred!
>  See also "/build/python/temp.linux-aarch64-3.7/CMakeFiles/CMakeOutput.log".
>  See also "/build/python/temp.linux-aarch64-3.7/CMakeFiles/CMakeError.log".
>  error: command 'cmake' failed with exit status 1
> Tried the tarball release for both 0.17.0 and 0.17.1, same result. Also tried 
> compiling manually (following these instructions: 
> [https://dzone.com/articles/building-pyarrow-with-cuda-support]) with the 
> same result.
> Only modifications I made to source are editing the docker-compose volumes, 
> as described here: [https://github.com/apache/arrow/pull/6907]
> Jira opened, per request at: [https://github.com/apache/arrow/issues/7307]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-4144) [Java] Arrow-to-JDBC

2020-06-01 Thread Chen (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-4144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121502#comment-17121502
 ] 

Chen commented on ARROW-4144:
-

is it necessary to do it,?Arrow is designed for in-memory data analysis, the 
data won't be modified. 

recently i pay much  attention on Arrow, i want to make a contribution to the 
project.

if this issue is necessary, i am happy to resolve it.

> [Java] Arrow-to-JDBC
> 
>
> Key: ARROW-4144
> URL: https://issues.apache.org/jira/browse/ARROW-4144
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Java
>Reporter: Michael Pigott
>Priority: Major
>
> ARROW-1780 reads a query from a JDBC data source and converts the ResultSet 
> to an Arrow VectorSchemaRoot.  However, there is no built-in adapter for 
> writing an Arrow VectorSchemaRoot back to the database.
> ARROW-3966 adds JDBC field metadata:
>  * The Catalog Name
>  * The Table Name
>  * The Field Name
>  * The Field Type
> We can use this information to ask for the field information from the 
> database via the 
> [DatabaseMetaData|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html]
>  object.  We can then create INSERT or UPDATE statements based on the [list 
> of primary 
> keys|https://docs.oracle.com/javase/7/docs/api/java/sql/DatabaseMetaData.html#getPrimaryKeys(java.lang.String,%20java.lang.String,%20java.lang.String)]
>  in the table:
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> NULL, insert that record into the database.
>  * If the value in the VectorSchemaRoot corresponding to the primary key is 
> not NULL, update the existing record in the database.
> We can also perform the same data conversion in reverse based on the field 
> types queried from the database.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (ARROW-4530) [C++] Review Aggregate kernel state allocation/ownership semantics

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-4530?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney closed ARROW-4530.
---
Resolution: Later

> [C++] Review Aggregate kernel state allocation/ownership semantics
> --
>
> Key: ARROW-4530
> URL: https://issues.apache.org/jira/browse/ARROW-4530
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Francois Saint-Jacques
>Priority: Major
>  Labels: analytics
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (ARROW-8955) [C++] Use kernels for casting Scalar values instead of bespoke implementation

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8955?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney closed ARROW-8955.
---
Fix Version/s: (was: 1.0.0)
   Resolution: Duplicate

duplicate of ARROW-9006

> [C++] Use kernels for casting Scalar values instead of bespoke implementation
> -
>
> Key: ARROW-8955
> URL: https://issues.apache.org/jira/browse/ARROW-8955
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Priority: Major
>
> See details of casting in arrow/scalar.cc



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8844) [C++] Optimize TransferBitmap unaligned case

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8844?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-8844.
-
Fix Version/s: 1.0.0
   Resolution: Fixed

Issue resolved by pull request 7300
[https://github.com/apache/arrow/pull/7300]

> [C++] Optimize TransferBitmap unaligned case
> 
>
> Key: ARROW-8844
> URL: https://issues.apache.org/jira/browse/ARROW-8844
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Yibo Cai
>Assignee: Yibo Cai
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> TransferBitmap(CopyBitmap, InvertBitmap) unaligned case is processed 
> bit-by-bit[1]. Similar trick in this PR[2] may also be helpful here to 
> improve performance by processing in words.
> [1] 
> https://github.com/apache/arrow/blob/e5a33f1220705aec6a224b55d2a6f47fbd957603/cpp/src/arrow/util/bit_util.cc#L121-L134
> [2] https://github.com/apache/arrow/pull/7135



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-9006) [C++] Use Cast kernels to implement Scalar::Parse and Scalar::CastTo

2020-06-01 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9006:
---

 Summary: [C++] Use Cast kernels to implement Scalar::Parse and 
Scalar::CastTo
 Key: ARROW-9006
 URL: https://issues.apache.org/jira/browse/ARROW-9006
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Wes McKinney
 Fix For: 1.0.0


We should not maintain distinct (and possibly differently behaving) 
implementations of elementwise array casting and scalar casting. The new 
kernels framework provides for relatively easily generating kernels that can 
process arrays or scalars. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8929) [C++] Change compute::Arity:VarArgs min_args default to 0

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-8929.
-
Resolution: Fixed

Issue resolved by pull request 7322
[https://github.com/apache/arrow/pull/7322]

> [C++] Change compute::Arity:VarArgs min_args default to 0
> -
>
> Key: ARROW-8929
> URL: https://issues.apache.org/jira/browse/ARROW-8929
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> The issue of minimum number of arguments is separate from providing an 
> {{InputType}} for input type checking. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-9005) Support sort expression

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-9005?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-9005:
--
Labels: pull-request-available  (was: )

> Support sort expression
> ---
>
> Key: ARROW-9005
> URL: https://issues.apache.org/jira/browse/ARROW-9005
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust - DataFusion
>Reporter: QP Hou
>Assignee: QP Hou
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-9005) Support sort expression

2020-06-01 Thread QP Hou (Jira)
QP Hou created ARROW-9005:
-

 Summary: Support sort expression
 Key: ARROW-9005
 URL: https://issues.apache.org/jira/browse/ARROW-9005
 Project: Apache Arrow
  Issue Type: Improvement
  Components: Rust - DataFusion
Reporter: QP Hou
Assignee: QP Hou






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-9004) [C++][Gandiva] Upgrade to LLVM 10

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-9004?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-9004:
--
Labels: pull-request-available  (was: )

> [C++][Gandiva] Upgrade to LLVM 10
> -
>
> Key: ARROW-9004
> URL: https://issues.apache.org/jira/browse/ARROW-9004
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++ - Gandiva
>Reporter: Kouhei Sutou
>Assignee: Kouhei Sutou
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-9004) [C++][Gandiva] Upgrade to LLVM 10

2020-06-01 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-9004:
---

 Summary: [C++][Gandiva] Upgrade to LLVM 10
 Key: ARROW-9004
 URL: https://issues.apache.org/jira/browse/ARROW-9004
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++ - Gandiva
Reporter: Kouhei Sutou
Assignee: Kouhei Sutou






--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-3520) [C++] Implement List Flatten kernel

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-3520?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-3520:
---

Assignee: Wes McKinney

> [C++] Implement List Flatten kernel
> ---
>
> Key: ARROW-3520
> URL: https://issues.apache.org/jira/browse/ARROW-3520
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
> Fix For: 2.0.0
>
>
> see also ARROW-45



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8929) [C++] Change compute::Arity:VarArgs min_args default to 0

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-8929:
--
Labels: pull-request-available  (was: )

> [C++] Change compute::Arity:VarArgs min_args default to 0
> -
>
> Key: ARROW-8929
> URL: https://issues.apache.org/jira/browse/ARROW-8929
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> The issue of minimum number of arguments is separate from providing an 
> {{InputType}} for input type checking. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-9003) [C++] Add VectorFunction wrapping arrow::Concatenate

2020-06-01 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9003:
---

 Summary: [C++] Add VectorFunction wrapping arrow::Concatenate
 Key: ARROW-9003
 URL: https://issues.apache.org/jira/browse/ARROW-9003
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Wes McKinney
 Fix For: 1.0.0


This would be a varargs function 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-8929) [C++] Change compute::Arity:VarArgs min_args default to 0

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8929?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-8929:
---

Assignee: Wes McKinney

> [C++] Change compute::Arity:VarArgs min_args default to 0
> -
>
> Key: ARROW-8929
> URL: https://issues.apache.org/jira/browse/ARROW-8929
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
> Fix For: 1.0.0
>
>
> The issue of minimum number of arguments is separate from providing an 
> {{InputType}} for input type checking. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8985) [Format] Add "byte width" field with default of 16 to Decimal Flatbuffers type for forward compatibility

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8985?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-8985:
--
Labels: pull-request-available  (was: )

> [Format] Add "byte width" field with default of 16 to Decimal Flatbuffers 
> type for forward compatibility
> 
>
> Key: ARROW-8985
> URL: https://issues.apache.org/jira/browse/ARROW-8985
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Format
>Reporter: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> This will permit larger or smaller decimals to be added to the format later 
> without having to add a new Type union value



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8896) [C++] Reimplement dictionary unpacking in Cast kernels using Take

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-8896:
--
Labels: pull-request-available  (was: )

> [C++] Reimplement dictionary unpacking in Cast kernels using Take
> -
>
> Key: ARROW-8896
> URL: https://issues.apache.org/jira/browse/ARROW-8896
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> As suggested by [~apitrou] this should yield less code to maintain



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-8896) [C++] Reimplement dictionary unpacking in Cast kernels using Take

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8896?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-8896:
---

Assignee: Wes McKinney

> [C++] Reimplement dictionary unpacking in Cast kernels using Take
> -
>
> Key: ARROW-8896
> URL: https://issues.apache.org/jira/browse/ARROW-8896
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
> Fix For: 1.0.0
>
>
> As suggested by [~apitrou] this should yield less code to maintain



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-9002) [C++] Unable to load libjvm on ppc64le architecture for hdfs.connect()

2020-06-01 Thread Yuan Cheng (Jira)
Yuan Cheng created ARROW-9002:
-

 Summary: [C++] Unable to load libjvm on ppc64le architecture for 
hdfs.connect()
 Key: ARROW-9002
 URL: https://issues.apache.org/jira/browse/ARROW-9002
 Project: Apache Arrow
  Issue Type: Bug
  Components: C++
Affects Versions: 0.17.1
 Environment: Ubuntu 16.04.4 LTS
Reporter: Yuan Cheng
 Attachments: patch.txt





--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8931) [Rust] Support lexical sort in arrow compute kernel

2020-06-01 Thread Neville Dipale (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8931?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Neville Dipale resolved ARROW-8931.
---
Fix Version/s: 1.0.0
   Resolution: Fixed

Issue resolved by pull request 7265
[https://github.com/apache/arrow/pull/7265]

> [Rust] Support lexical sort in arrow compute kernel
> ---
>
> Key: ARROW-8931
> URL: https://issues.apache.org/jira/browse/ARROW-8931
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Reporter: QP Hou
>Assignee: QP Hou
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8917) [C++][Compute] Formalize "metafunction" concept

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-8917:
--
Labels: pull-request-available  (was: )

> [C++][Compute] Formalize "metafunction" concept
> ---
>
> Key: ARROW-8917
> URL: https://issues.apache.org/jira/browse/ARROW-8917
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> A metafunction is a function that provides the {{Execute}} API but does not 
> contain any kernels. Such functions can also handle non-Array/Scalar inputs 
> like RecordBatch or Table. 
> This will enable bindings to invoke such functions (like take, filter) like
> {code}
> call_function('take', [table, indices])
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-9001) [R] Box outputs as correct type in call_function

2020-06-01 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-9001:
---

 Summary: [R] Box outputs as correct type in call_function
 Key: ARROW-9001
 URL: https://issues.apache.org/jira/browse/ARROW-9001
 Project: Apache Arrow
  Issue Type: Improvement
  Components: R
Reporter: Wes McKinney
 Fix For: 1.0.0


This would prevent segfaults by putting the SEXP in the wrong kind of R6 
container



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8843) [C++] Optimize BitmapEquals unaligned case

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8843?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques resolved ARROW-8843.
---
Fix Version/s: 1.0.0
   Resolution: Fixed

Issue resolved by pull request 7285
[https://github.com/apache/arrow/pull/7285]

> [C++] Optimize BitmapEquals unaligned case
> --
>
> Key: ARROW-8843
> URL: https://issues.apache.org/jira/browse/ARROW-8843
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Yibo Cai
>Assignee: Yibo Cai
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> BitmapEquals unaligned case compares two bitmap bit-by-bit[1]. Similar tricks 
> in this PR[2] may also be helpful here to improve performance by processing 
> in words.
> [1] 
> https://github.com/apache/arrow/blob/e5a33f1220705aec6a224b55d2a6f47fbd957603/cpp/src/arrow/util/bit_util.cc#L248-L254
> [2] https://github.com/apache/arrow/pull/7135



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8485) [Integration][Java] Implement extension types integration

2020-06-01 Thread David Li (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8485?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

David Li resolved ARROW-8485.
-
Resolution: Fixed

Issue resolved by pull request 7270
[https://github.com/apache/arrow/pull/7270]

> [Integration][Java] Implement extension types integration
> -
>
> Key: ARROW-8485
> URL: https://issues.apache.org/jira/browse/ARROW-8485
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Integration, Java
>Reporter: Antoine Pitrou
>Assignee: Ryan Murray
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> Java should support the extension type integration tests added in ARROW-5649.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques resolved ARROW-8997.
---
Fix Version/s: 1.0.0
   Resolution: Fixed

Issue resolved by pull request 7316
[https://github.com/apache/arrow/pull/7316]

> [Archery] Benchmark formatter should have friendly units
> 
>
> Key: ARROW-8997
> URL: https://issues.apache.org/jira/browse/ARROW-8997
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Archery
>Reporter: Francois Saint-Jacques
>Assignee: Francois Saint-Jacques
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> The current output is not friendly to glance at. Usage of humanfriendly can 
> help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-9000) Java build crashes with JDK14

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-9000?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-9000:
--
Labels: pull-request-available  (was: )

> Java build crashes with JDK14
> -
>
> Key: ARROW-9000
> URL: https://issues.apache.org/jira/browse/ARROW-9000
> Project: Apache Arrow
>  Issue Type: Bug
>Reporter: Laurent Goujon
>Assignee: Laurent Goujon
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Current master tree does not build with JDK14. The issue seems to be caused 
> by error prone plugin:
> {noformat}
> [ERROR] Failed to execute goal 
> org.apache.maven.plugins:maven-compiler-plugin:3.6.2:compile 
> (default-compile) on project arrow-memory: Compilation failure
> [ERROR] 
> /Users/laurent/devel/arrow/java/memory/src/main/java/org/apache/arrow/memory/BufferLedger.java:[545,15]
>  error: An unhandled exception was thrown by the Error Prone static analysis 
> plugin.
> [ERROR]  Please report this at 
> https://github.com/google/error-prone/issues/new and include the following:
> [ERROR]   
> [ERROR]  error-prone version: 2.3.3
> [ERROR]  BugPattern: TypeParameterUnusedInFormals
> [ERROR]  Stack Trace:
> [ERROR]  java.lang.NoSuchFieldError: bound
> [ERROR]   at 
> com.google.errorprone.bugpatterns.TypeParameterUnusedInFormals.matchMethod(TypeParameterUnusedInFormals.java:71)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.processMatchers(ErrorProneScanner.java:433)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:725)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCMethodDecl.accept(JCTree.java:916)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:90)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitClass(TreeScanner.java:187)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:535)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCClassDecl.accept(JCTree.java:823)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreeScanner.visitCompilationUnit(TreeScanner.java:144)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:546)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:150)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.tree.JCTree$JCCompilationUnit.accept(JCTree.java:603)
> [ERROR]   at 
> jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:56)
> [ERROR]   at com.google.errorprone.scanner.Scanner.scan(Scanner.java:55)
> [ERROR]   at 
> com.google.errorprone.scanner.ErrorProneScannerTransformer.apply(ErrorProneScannerTransformer.java:43)
> [ERROR]   at 
> com.google.errorprone.ErrorProneAnalyzer.finished(ErrorProneAnalyzer.java:151)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.api.MultiTaskListener.finished(MultiTaskListener.java:132)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1423)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1370)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.JavaCompiler.compile(JavaCompiler.java:959)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:316)
> [ERROR]   at 
> jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:176)
> [ERROR]   at jdk.compiler/com.sun.tools.javac.Ma

[jira] [Created] (ARROW-9000) Java build crashes with JDK14

2020-06-01 Thread Laurent Goujon (Jira)
Laurent Goujon created ARROW-9000:
-

 Summary: Java build crashes with JDK14
 Key: ARROW-9000
 URL: https://issues.apache.org/jira/browse/ARROW-9000
 Project: Apache Arrow
  Issue Type: Bug
Reporter: Laurent Goujon
Assignee: Laurent Goujon


Current master tree does not build with JDK14. The issue seems to be caused by 
error prone plugin:
{noformat}
[ERROR] Failed to execute goal 
org.apache.maven.plugins:maven-compiler-plugin:3.6.2:compile (default-compile) 
on project arrow-memory: Compilation failure
[ERROR] 
/Users/laurent/devel/arrow/java/memory/src/main/java/org/apache/arrow/memory/BufferLedger.java:[545,15]
 error: An unhandled exception was thrown by the Error Prone static analysis 
plugin.
[ERROR]  Please report this at 
https://github.com/google/error-prone/issues/new and include the following:
[ERROR]   
[ERROR]  error-prone version: 2.3.3
[ERROR]  BugPattern: TypeParameterUnusedInFormals
[ERROR]  Stack Trace:
[ERROR]  java.lang.NoSuchFieldError: bound
[ERROR] at 
com.google.errorprone.bugpatterns.TypeParameterUnusedInFormals.matchMethod(TypeParameterUnusedInFormals.java:71)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.processMatchers(ErrorProneScanner.java:433)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:725)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.visitMethod(ErrorProneScanner.java:150)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.tree.JCTree$JCMethodDecl.accept(JCTree.java:916)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
[ERROR] at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
[ERROR] at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:90)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.visitClass(TreeScanner.java:187)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:535)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.visitClass(ErrorProneScanner.java:150)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.tree.JCTree$JCClassDecl.accept(JCTree.java:823)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:82)
[ERROR] at com.google.errorprone.scanner.Scanner.scan(Scanner.java:71)
[ERROR] at com.google.errorprone.scanner.Scanner.scan(Scanner.java:45)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.scan(TreeScanner.java:105)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.scanAndReduce(TreeScanner.java:113)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreeScanner.visitCompilationUnit(TreeScanner.java:144)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:546)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScanner.visitCompilationUnit(ErrorProneScanner.java:150)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.tree.JCTree$JCCompilationUnit.accept(JCTree.java:603)
[ERROR] at 
jdk.compiler/com.sun.source.util.TreePathScanner.scan(TreePathScanner.java:56)
[ERROR] at com.google.errorprone.scanner.Scanner.scan(Scanner.java:55)
[ERROR] at 
com.google.errorprone.scanner.ErrorProneScannerTransformer.apply(ErrorProneScannerTransformer.java:43)
[ERROR] at 
com.google.errorprone.ErrorProneAnalyzer.finished(ErrorProneAnalyzer.java:151)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.api.MultiTaskListener.finished(MultiTaskListener.java:132)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1423)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.main.JavaCompiler.flow(JavaCompiler.java:1370)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.main.JavaCompiler.compile(JavaCompiler.java:959)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:316)
[ERROR] at 
jdk.compiler/com.sun.tools.javac.main.Main.compile(Main.java:176)
[ERROR] at jdk.compiler/com.sun.tools.javac.Main.compile(Main.java:57)
[ERROR] at jdk.compiler/com.sun.tools.javac.Main.main(Main.java:43)
[ERROR] 
[ERROR] -> [Help 1]
[ERROR] 
[ERROR] To see the full stack trace of the errors, re-run Maven with the -e 
switch.
[ERROR] Re-run Maven using the -X switch to enable full debug logging.
[ERROR] 
[ERROR] For more information about the errors and possible solutions, please 
read the following articles:

[jira] [Resolved] (ARROW-7784) [C++] diff.cc is extremely slow to compile

2020-06-01 Thread Ben Kietzman (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-7784?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ben Kietzman resolved ARROW-7784.
-
Resolution: Fixed

Issue resolved by pull request 7311
[https://github.com/apache/arrow/pull/7311]

> [C++] diff.cc is extremely slow to compile
> --
>
> Key: ARROW-7784
> URL: https://issues.apache.org/jira/browse/ARROW-7784
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Antoine Pitrou
>Assignee: Wes McKinney
>Priority: Minor
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> This comes up especially when doing an optimized build. {{diff.cc}} is always 
> enabled even if all components are disabled, and it takes multiple seconds to 
> compile. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-8990) [C++] Benchmark hash table against thirdparty options, possibly vendor a thirdparty hash table library

2020-06-01 Thread Maarten Breddels (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8990?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121205#comment-17121205
 ] 

Maarten Breddels commented on ARROW-8990:
-

FYI, I've been using that library and 
[https://github.com/skarupke/flat_hash_map] for Vaex. After some benchmarking 
settled for the tsl one, but my research/benchmark wasn't very thorough, 
because the idea was I could easily switch if needed. But because the 
performance was great, I never looked back actually, so I'd be interested in 
the benchmark result.

By the same author, the [https://github.com/Tessil/hat-trie] library can also 
be very interesting to take a look at.

> [C++] Benchmark hash table against thirdparty options, possibly vendor a 
> thirdparty hash table library
> --
>
> Key: ARROW-8990
> URL: https://issues.apache.org/jira/browse/ARROW-8990
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Priority: Major
>
> While we have our own hash table implementation, it would be worthwhile to 
> set up some benchmarks so that we can compare against std::unordered_map and 
> some other thirdparty libraries for hash tables to know whether we should 
> possibly use a thirdparty library. See e.g.
> https://tessil.github.io/2016/08/29/benchmark-hopscotch-map.html
> Libraries to consider: 
> * https://github.com/sparsehash/sparsehash



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-8997:
--
Labels: pull-request-available  (was: )

> [Archery] Benchmark formatter should have friendly units
> 
>
> Key: ARROW-8997
> URL: https://issues.apache.org/jira/browse/ARROW-8997
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Archery
>Reporter: Francois Saint-Jacques
>Assignee: Francois Saint-Jacques
>Priority: Minor
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> The current output is not friendly to glance at. Usage of humanfriendly can 
> help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques reassigned ARROW-8997:
-

Assignee: Francois Saint-Jacques

> [Archery] Benchmark formatter should have friendly units
> 
>
> Key: ARROW-8997
> URL: https://issues.apache.org/jira/browse/ARROW-8997
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Archery
>Reporter: Francois Saint-Jacques
>Assignee: Francois Saint-Jacques
>Priority: Minor
>
> The current output is not friendly to glance at. Usage of humanfriendly can 
> help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-7009) [C++] Refactor filter/take kernels to use Datum instead of overloads

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-7009?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-7009:
---

Assignee: Wes McKinney  (was: Ben Kietzman)

> [C++] Refactor filter/take kernels to use Datum instead of overloads
> 
>
> Key: ARROW-7009
> URL: https://issues.apache.org/jira/browse/ARROW-7009
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: C++
>Reporter: Neal Richardson
>Assignee: Wes McKinney
>Priority: Minor
> Fix For: 1.0.0
>
>
> Followup to ARROW-6784. See discussion on 
> [https://github.com/apache/arrow/pull/5686,|https://github.com/apache/arrow/pull/5686]
>  as well as ARROW-6959.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-8917) [C++][Compute] Formalize "metafunction" concept

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8917?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney reassigned ARROW-8917:
---

Assignee: Wes McKinney

> [C++][Compute] Formalize "metafunction" concept
> ---
>
> Key: ARROW-8917
> URL: https://issues.apache.org/jira/browse/ARROW-8917
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Assignee: Wes McKinney
>Priority: Major
> Fix For: 1.0.0
>
>
> A metafunction is a function that provides the {{Execute}} API but does not 
> contain any kernels. Such functions can also handle non-Array/Scalar inputs 
> like RecordBatch or Table. 
> This will enable bindings to invoke such functions (like take, filter) like
> {code}
> call_function('take', [table, indices])
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8992) [CI][C++] march not passing correctly for docker-compose run

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8992?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-8992:

Summary: [CI][C++] march not passing correctly for docker-compose run  
(was: march not passing correctly for docker-compose run)

> [CI][C++] march not passing correctly for docker-compose run
> 
>
> Key: ARROW-8992
> URL: https://issues.apache.org/jira/browse/ARROW-8992
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Packaging, Python
>Affects Versions: 0.17.0, 0.17.1
> Environment: Mendel Linux 4.0
>Reporter: Elliott Kipp
>Priority: Critical
> Attachments: CMakeError.log, CMakeOutput.log
>
>
> [https://github.com/apache/arrow/issues/7307]
> Building on the new ASUS Tinker Edge T, running Mendel Linux 4.0 (Day). 
> docker-compose build commands work fine with no errors:
>  DEBIAN=10 ARCH=arm64v8 docker-compose build debian-cpp && DEBIAN=10 
> ARCH=arm64v8 docker-compose build debian-python
> DEBIAN=10 ARCH=arm64v8 docker-compose run debian-python - fails with the 
> following:
> – Running cmake for pyarrow
>  cmake -DPYTHON_EXECUTABLE=/usr/local/bin/python -G Ninja 
> -DPYARROW_BUILD_CUDA=off -DPYARROW_BUILD_FLIGHT=on -DPYARROW_BUILD_GANDIVA=on 
> -DPYARROW_BUILD_DATASET=on -DPYARROW_BUILD_ORC=on -DPYARROW_BUILD_PARQUET=on 
> -DPYARROW_BUILD_PLASMA=on -DPYARROW_BUILD_S3=off -DPYARROW_BUILD_HDFS=off 
> -DPYARROW_USE_TENSORFLOW=off -DPYARROW_BUNDLE_ARROW_CPP=off 
> -DPYARROW_BUNDLE_BOOST=off -DPYARROW_GENERATE_COVERAGE=off 
> -DPYARROW_BOOST_USE_SHARED=on -DPYARROW_PARQUET_USE_SHARED=on 
> -DCMAKE_BUILD_TYPE=debug /arrow/python
>  – The C compiler identification is GNU 8.3.0
>  – The CXX compiler identification is GNU 8.3.0
>  – Check for working C compiler: /usr/lib/ccache/gcc
>  – Check for working C compiler: /usr/lib/ccache/gcc – works
>  – Detecting C compiler ABI info
>  – Detecting C compiler ABI info - done
>  – Detecting C compile features
>  – Detecting C compile features - done
>  – Check for working CXX compiler: /usr/lib/ccache/g++
>  – Check for working CXX compiler: /usr/lib/ccache/g++ – works
>  – Detecting CXX compiler ABI info
>  – Detecting CXX compiler ABI info - done
>  – Detecting CXX compile features
>  – Detecting CXX compile features - done
>  – System processor: aarch64
>  – Performing Test CXX_SUPPORTS_ARMV8_ARCH
>  – Performing Test CXX_SUPPORTS_ARMV8_ARCH - Failed
>  – Arrow build warning level: PRODUCTION
>  CMake Error at cmake_modules/SetupCxxFlags.cmake:338 (message):
>  Unsupported arch flag: -march=.
>  Call Stack (most recent call first):
>  CMakeLists.txt:100 (include)
> – Configuring incomplete, errors occurred!
>  See also "/build/python/temp.linux-aarch64-3.7/CMakeFiles/CMakeOutput.log".
>  See also "/build/python/temp.linux-aarch64-3.7/CMakeFiles/CMakeError.log".
>  error: command 'cmake' failed with exit status 1
> Tried the tarball release for both 0.17.0 and 0.17.1, same result. Also tried 
> compiling manually (following these instructions: 
> [https://dzone.com/articles/building-pyarrow-with-cuda-support]) with the 
> same result.
> Only modifications I made to source are editing the docker-compose volumes, 
> as described here: [https://github.com/apache/arrow/pull/6907]
> Jira opened, per request at: [https://github.com/apache/arrow/issues/7307]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-8995) [C++] Scalar formatting code used in array/diff.cc should be reusable

2020-06-01 Thread Wes McKinney (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-8995?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121145#comment-17121145
 ] 

Wes McKinney commented on ARROW-8995:
-

Since there is already {{Scalar::ToString}}, let's use that. Does not seem 
justified to have more than one way to format scalar values. 

> [C++] Scalar formatting code used in array/diff.cc should be reusable
> -
>
> Key: ARROW-8995
> URL: https://issues.apache.org/jira/browse/ARROW-8995
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Wes McKinney
>Priority: Major
>
> Formatting Array values as strings is not specific to the diff.cc code, so it 
> may make sense to move this code elsewhere where it can be used generally 
> (perhaps a method like {{Array::FormatValue}}?). 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (ARROW-5854) [Python] Expose compare kernels on Array class

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-5854?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney resolved ARROW-5854.
-
Resolution: Fixed

Issue resolved by pull request 7273
[https://github.com/apache/arrow/pull/7273]

> [Python] Expose compare kernels on Array class
> --
>
> Key: ARROW-5854
> URL: https://issues.apache.org/jira/browse/ARROW-5854
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Python
>Reporter: Joris Van den Bossche
>Assignee: Joris Van den Bossche
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.0.0
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Expose the compare kernel for comparing with scalar or array (ARROW-3087, 
> ARROW-4990) on the python Array class.
> This can implement the {{\_\_eq\_\_}} et al dunder methods on the Array class.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8999) [Python][C++] Non-deterministic segfault in "AMD64 MacOS 10.15 Python 3.7" build

2020-06-01 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8999:
---

 Summary: [Python][C++] Non-deterministic segfault in "AMD64 MacOS 
10.15 Python 3.7" build
 Key: ARROW-8999
 URL: https://issues.apache.org/jira/browse/ARROW-8999
 Project: Apache Arrow
  Issue Type: Bug
  Components: Python
Reporter: Wes McKinney
 Fix For: 1.0.0


I've been seeing this segfault periodically the last week, does anyone have an 
idea what might be wrong?

https://github.com/apache/arrow/pull/7273/checks?check_run_id=717249862



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-6482) [Rust] Investigate enabling features in regex crate to reduce compile times

2020-06-01 Thread Paddy Horan (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17121073#comment-17121073
 ] 

Paddy Horan commented on ARROW-6482:


Yep, it's the unicode features I had in mind.

> [Rust] Investigate enabling features in regex crate to reduce compile times
> ---
>
> Key: ARROW-6482
> URL: https://issues.apache.org/jira/browse/ARROW-6482
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.14.1
>Reporter: Paddy Horan
>Priority: Minor
>  Labels: beginner
> Fix For: 1.0.0
>
>
> The regex crate recently added a feature flag to reduce compile times and 
> binary size if certain unicode related features are not needed.  We should 
> investigate using this feature.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8998) [Python] Make NumPy an optional runtime dependency

2020-06-01 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8998:
---

 Summary: [Python] Make NumPy an optional runtime dependency
 Key: ARROW-8998
 URL: https://issues.apache.org/jira/browse/ARROW-8998
 Project: Apache Arrow
  Issue Type: New Feature
  Components: Python
Reporter: Wes McKinney


Since in the relatively near future, one will be able to do non-trivial 
analytical operations and query processing natively on Arrow data structures 
through pyarrow, it does not make sense to require users to always install 
NumPy when that install pyarrow. I propose to split the NumPy-depending parts 
of libarrow_python into a libarrow_numpy (which also must be bundled) and 
moving this part of the codebase into a separate Cython module.

This refactoring should be relatively painless though there may be a number of 
packaging details to chase up since this would introduce a new shared library 
to be installed in various packaging targets. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8998) [Python] Make NumPy an optional runtime dependency

2020-06-01 Thread Wes McKinney (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8998?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Wes McKinney updated ARROW-8998:

Description: 
Since in the relatively near future, one will be able to do non-trivial 
analytical operations and query processing natively on Arrow data structures 
through pyarrow, it does not make sense to require users to always install 
NumPy when they install pyarrow. I propose to split the NumPy-depending parts 
of libarrow_python into a libarrow_numpy (which also must be bundled) and 
moving this part of the codebase into a separate Cython module.

This refactoring should be relatively painless though there may be a number of 
packaging details to chase up since this would introduce a new shared library 
to be installed in various packaging targets. 

  was:
Since in the relatively near future, one will be able to do non-trivial 
analytical operations and query processing natively on Arrow data structures 
through pyarrow, it does not make sense to require users to always install 
NumPy when that install pyarrow. I propose to split the NumPy-depending parts 
of libarrow_python into a libarrow_numpy (which also must be bundled) and 
moving this part of the codebase into a separate Cython module.

This refactoring should be relatively painless though there may be a number of 
packaging details to chase up since this would introduce a new shared library 
to be installed in various packaging targets. 


> [Python] Make NumPy an optional runtime dependency
> --
>
> Key: ARROW-8998
> URL: https://issues.apache.org/jira/browse/ARROW-8998
> Project: Apache Arrow
>  Issue Type: New Feature
>  Components: Python
>Reporter: Wes McKinney
>Priority: Major
>
> Since in the relatively near future, one will be able to do non-trivial 
> analytical operations and query processing natively on Arrow data structures 
> through pyarrow, it does not make sense to require users to always install 
> NumPy when they install pyarrow. I propose to split the NumPy-depending parts 
> of libarrow_python into a libarrow_numpy (which also must be bundled) and 
> moving this part of the codebase into a separate Cython module.
> This refactoring should be relatively painless though there may be a number 
> of packaging details to chase up since this would introduce a new shared 
> library to be installed in various packaging targets. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques updated ARROW-8997:
--
Issue Type: Improvement  (was: Bug)

> [Archery] Benchmark formatter should have friendly units
> 
>
> Key: ARROW-8997
> URL: https://issues.apache.org/jira/browse/ARROW-8997
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Archery
>Reporter: Francois Saint-Jacques
>Priority: Minor
>
> The current output is not friendly to glance at. Usage of humanfriendly can 
> help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques updated ARROW-8997:
--
Priority: Minor  (was: Major)

> [Archery] Benchmark formatter should have friendly units
> 
>
> Key: ARROW-8997
> URL: https://issues.apache.org/jira/browse/ARROW-8997
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Archery
>Reporter: Francois Saint-Jacques
>Priority: Minor
>
> The current output is not friendly to glance at. Usage of humanfriendly can 
> help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques updated ARROW-8997:
--
Component/s: Archery

> [Archery] Benchmark formatter should have friendly units
> 
>
> Key: ARROW-8997
> URL: https://issues.apache.org/jira/browse/ARROW-8997
> Project: Apache Arrow
>  Issue Type: Bug
>  Components: Archery
>Reporter: Francois Saint-Jacques
>Priority: Major
>
> The current output is not friendly to glance at. Usage of humanfriendly can 
> help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8997) [Archery] Benchmark formatter should have friendly units

2020-06-01 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-8997:
-

 Summary: [Archery] Benchmark formatter should have friendly units
 Key: ARROW-8997
 URL: https://issues.apache.org/jira/browse/ARROW-8997
 Project: Apache Arrow
  Issue Type: Bug
Reporter: Francois Saint-Jacques


The current output is not friendly to glance at. Usage of humanfriendly can 
help here.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (ARROW-8986) [Archery][ursabot] Fix benchmark diff checkout of origin/master

2020-06-01 Thread Francois Saint-Jacques (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8986?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Francois Saint-Jacques reassigned ARROW-8986:
-

Assignee: Francois Saint-Jacques

> [Archery][ursabot] Fix benchmark diff checkout of origin/master
> ---
>
> Key: ARROW-8986
> URL: https://issues.apache.org/jira/browse/ARROW-8986
> Project: Apache Arrow
>  Issue Type: Bug
>Reporter: Francois Saint-Jacques
>Assignee: Francois Saint-Jacques
>Priority: Minor
>
> https://github.com/apache/arrow/pull/7300#issuecomment-635967095



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-6482) [Rust] Investigate enabling features in regex crate to reduce compile times

2020-06-01 Thread Neville Dipale (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-6482?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17120956#comment-17120956
 ] 

Neville Dipale commented on ARROW-6482:
---

[~paddyhoran] I've briefly looked at this, and it seems like the only feature 
we'd want to remove would be Unicode support 
([https://docs.rs/regex/1.3.9/regex/#crate-features]). 

> [Rust] Investigate enabling features in regex crate to reduce compile times
> ---
>
> Key: ARROW-6482
> URL: https://issues.apache.org/jira/browse/ARROW-6482
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Rust
>Affects Versions: 0.14.1
>Reporter: Paddy Horan
>Priority: Minor
>  Labels: beginner
> Fix For: 1.0.0
>
>
> The regex crate recently added a feature flag to reduce compile times and 
> binary size if certain unicode related features are not needed.  We should 
> investigate using this feature.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (ARROW-5213) [Format] Script for updating various checked-in Flatbuffers files

2020-06-01 Thread Neville Dipale (Jira)


[ 
https://issues.apache.org/jira/browse/ARROW-5213?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17120952#comment-17120952
 ] 

Neville Dipale commented on ARROW-5213:
---

[~andygrove] I'd like to remove this from the 1.0.0 milestone. We can't build 
files until the next flatbuffers release due to a bug in the codegen of 
required fields, and the other already known bugs. There's an alternative 
flatbuf implementation being worked on at [https://github.com/butte-rs/butte], 
but I don't know what its timelines are. Perhaps until one of the 2 options is 
fine, we could continue manually generating the files?

> [Format] Script for updating various checked-in Flatbuffers files
> -
>
> Key: ARROW-5213
> URL: https://issues.apache.org/jira/browse/ARROW-5213
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: Developer Tools, Format, Go, Rust
>Reporter: Wes McKinney
>Assignee: Andy Grove
>Priority: Minor
> Fix For: 1.0.0
>
>
> Some subprojects have begun checking in generated Flatbuffers files to source 
> control. This presents a maintainability issue when there are additions or 
> changes made to the .fbs sources. It would be useful to be able to automate 
> the update of these files so it doesn't have to happen on a manual / 
> case-by-case basis



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (ARROW-8996) [C++] Runtime SSE path for Aggregate Sum kernel of dense

2020-06-01 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/ARROW-8996?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated ARROW-8996:
--
Labels: pull-request-available  (was: )

> [C++] Runtime SSE path for Aggregate Sum kernel of dense
> 
>
> Key: ARROW-8996
> URL: https://issues.apache.org/jira/browse/ARROW-8996
> Project: Apache Arrow
>  Issue Type: Improvement
>  Components: C++
>Reporter: Frank Du
>Assignee: Frank Du
>Priority: Major
>  Labels: pull-request-available
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Thanks to the new kernel framework, now it can override a SIMD kernel version 
> at runtime.
>  
> Below is the things we need implemented for a SSE version of sum kernel to 
> dense data:
> 1. Add SSE intrinsic path for aggregate sum dense.
>  2. Add build support to append the compiler flag for SIMD code file.
>  3. Register the SSE version at runtime as the CPU feature.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (ARROW-8996) [C++] Runtime SSE path for Aggregate Sum kernel of dense

2020-06-01 Thread Frank Du (Jira)
Frank Du created ARROW-8996:
---

 Summary: [C++] Runtime SSE path for Aggregate Sum kernel of dense
 Key: ARROW-8996
 URL: https://issues.apache.org/jira/browse/ARROW-8996
 Project: Apache Arrow
  Issue Type: Improvement
  Components: C++
Reporter: Frank Du
Assignee: Frank Du


Thanks to the new kernel framework, now it can override a SIMD kernel version 
at runtime.

 

Below is the things we need implemented for a SSE version of sum kernel to 
dense data:

1. Add SSE intrinsic path for aggregate sum dense.
 2. Add build support to append the compiler flag for SIMD code file.
 3. Register the SSE version at runtime as the CPU feature.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)