[jira] [Updated] (ARROW-10337) [C++] More liberal parsing of ISO8601 timestamps with fractional seconds

2020-10-19 Thread Wes McKinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10337?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Wes McKinney updated ARROW-10337: - Summary: [C++] More liberal parsing of ISO8601 timestamps with fractional seconds (was: More

[jira] [Commented] (ARROW-10309) [Ruby] gem install red-arrow fails

2020-10-19 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217230#comment-17217230 ] Kouhei Sutou commented on ARROW-10309: -- Really? {{yum install -y ruby}} installs Ruby 2.0.0 not

[jira] [Created] (ARROW-10351) [C++][Flight] See if reading/writing to gRPC get/put streams asynchronously helps performance

2020-10-19 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-10351: Summary: [C++][Flight] See if reading/writing to gRPC get/put streams asynchronously helps performance Key: ARROW-10351 URL: https://issues.apache.org/jira/browse/ARROW-10351

[jira] [Updated] (ARROW-10342) Arrow vector in Java(Scala) allocate byteBuffer error while read the bytes from Python pyarrow

2020-10-19 Thread Litchy Soong (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10342?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Litchy Soong updated ARROW-10342: - Description: I am using scala arrow 1.0.1 and pyarrow 1.0.1 Following error occurs when scala

[jira] [Commented] (ARROW-5409) [C++] Improvement for IsIn Kernel when right array is small

2020-10-19 Thread David Sherrier (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217214#comment-17217214 ] David Sherrier commented on ARROW-5409: --- Hey Wes I added a benchmark (attached here) and found that

[jira] [Updated] (ARROW-5409) [C++] Improvement for IsIn Kernel when right array is small

2020-10-19 Thread David Sherrier (Jira)
[ https://issues.apache.org/jira/browse/ARROW-5409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Sherrier updated ARROW-5409: -- Attachment: set_lookup_benchmark > [C++] Improvement for IsIn Kernel when right array is small

[jira] [Updated] (ARROW-10270) [R] Fix CSV timestamp_parsers test on R-devel

2020-10-19 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10270?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson updated ARROW-10270: Fix Version/s: (was: 2.0.0) 3.0.0 > [R] Fix CSV timestamp_parsers

[jira] [Commented] (ARROW-10350) [Rust] parquet_derive crate cannot be published to crates.io

2020-10-19 Thread Neville Dipale (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10350?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217141#comment-17217141 ] Neville Dipale commented on ARROW-10350: I added them as part of another commit, but the

[jira] [Created] (ARROW-10350) [Rust] parquet_derive crate cannot be published to crates.io

2020-10-19 Thread Andy Grove (Jira)
Andy Grove created ARROW-10350: -- Summary: [Rust] parquet_derive crate cannot be published to crates.io Key: ARROW-10350 URL: https://issues.apache.org/jira/browse/ARROW-10350 Project: Apache Arrow

[jira] [Commented] (ARROW-10349) [Python] build and publish aarch64 wheels

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217022#comment-17217022 ] Antoine Pitrou commented on ARROW-10349: Hmm, I just learned that manylinux is

[jira] [Commented] (ARROW-10349) [Python] build and publish aarch64 wheels

2020-10-19 Thread Jonathan Swinney (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17217020#comment-17217020 ] Jonathan Swinney commented on ARROW-10349: -- This is partially fixed by

[jira] [Updated] (ARROW-10349) [Python] build and publish aarch64 wheels

2020-10-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10349?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10349: --- Labels: pull-request-available (was: ) > [Python] build and publish aarch64 wheels >

[jira] [Created] (ARROW-10349) [Python] build and publish aarch64 wheels

2020-10-19 Thread Jonathan Swinney (Jira)
Jonathan Swinney created ARROW-10349: Summary: [Python] build and publish aarch64 wheels Key: ARROW-10349 URL: https://issues.apache.org/jira/browse/ARROW-10349 Project: Apache Arrow

[jira] [Updated] (ARROW-10348) [C++] Fix crash on invalid Parquet file (OSS-Fuzz)

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-10348: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] Fix crash on invalid

[jira] [Resolved] (ARROW-10348) [C++] Fix crash on invalid Parquet file (OSS-Fuzz)

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-10348. Fix Version/s: (was: 3.0.0) 2.0.0 Resolution: Fixed Issue

[jira] [Commented] (ARROW-10309) [Ruby] gem install red-arrow fails

2020-10-19 Thread Bhargav Parsi (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10309?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216893#comment-17216893 ] Bhargav Parsi commented on ARROW-10309: --- Yes, We use `yum install -y ruby` in our docker

[jira] [Updated] (ARROW-10348) [C++] Fix crash on invalid Parquet file (OSS-Fuzz)

2020-10-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10348?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-10348: --- Labels: pull-request-available (was: ) > [C++] Fix crash on invalid Parquet file

[jira] [Created] (ARROW-10348) [C++] Fix crash on invalid Parquet file (OSS-Fuzz)

2020-10-19 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-10348: -- Summary: [C++] Fix crash on invalid Parquet file (OSS-Fuzz) Key: ARROW-10348 URL: https://issues.apache.org/jira/browse/ARROW-10348 Project: Apache Arrow

[jira] [Updated] (ARROW-10347) [Python][Dataset] Test behaviour in case of duplicate partition field / data column

2020-10-19 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10347?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-10347: -- Description: See

[jira] [Commented] (ARROW-10337) More liberal parsing of ISO8601 timestamps with fractional seconds

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10337?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216821#comment-17216821 ] Antoine Pitrou commented on ARROW-10337: Thanks for the report. I agree a PR would be welcome

[jira] [Created] (ARROW-10347) [Python][Dataset] Test behaviour in case of duplicate partition field / data column

2020-10-19 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-10347: - Summary: [Python][Dataset] Test behaviour in case of duplicate partition field / data column Key: ARROW-10347 URL:

[jira] [Commented] (ARROW-10346) [Python] Default S3 region is eu-central-1 even with LANG=C

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216812#comment-17216812 ] Antoine Pitrou commented on ARROW-10346: Perhaps in the Python test file instead? > [Python]

[jira] [Commented] (ARROW-10346) [Python] Default S3 region is eu-central-1 even with LANG=C

2020-10-19 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216808#comment-17216808 ] Uwe Korn commented on ARROW-10346: -- {{AWS_CONFIG_FILE=/dev/null}} was sufficient for the tests to pass,

[jira] [Commented] (ARROW-10346) [Python] Default S3 region is eu-central-1 even with LANG=C

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216776#comment-17216776 ] Antoine Pitrou commented on ARROW-10346: Perhaps you can retry with

[jira] [Updated] (ARROW-9991) [C++] split kernels for strings/binary

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9991: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] split kernels for

[jira] [Resolved] (ARROW-9991) [C++] split kernels for strings/binary

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9991?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9991. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8271

[jira] [Commented] (ARROW-10346) [Python] Default S3 region is eu-central-1 even with LANG=C

2020-10-19 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216765#comment-17216765 ] Uwe Korn commented on ARROW-10346: -- Yes, there was a user-config changing this. A bit confusing to me

[jira] [Updated] (ARROW-10346) [Python] Default S3 region is eu-central-1 even with LANG=C

2020-10-19 Thread Uwe Korn (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Uwe Korn updated ARROW-10346: - Priority: Minor (was: Major) > [Python] Default S3 region is eu-central-1 even with LANG=C >

[jira] [Commented] (ARROW-10346) [Python] Default S3 region is eu-central-1 even with LANG=C

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10346?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216721#comment-17216721 ] Antoine Pitrou commented on ARROW-10346: That must be because it's picking up your local AWS

[jira] [Updated] (ARROW-9164) [C++] Provide APIs for adding "docstrings" to arrow::compute::Function classes that can be accessed by bindings

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-9164: -- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++] Provide APIs for adding

[jira] [Resolved] (ARROW-9164) [C++] Provide APIs for adding "docstrings" to arrow::compute::Function classes that can be accessed by bindings

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9164?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-9164. --- Fix Version/s: (was: 3.0.0) 2.0.0 Resolution: Fixed Issue

[jira] [Commented] (ARROW-10261) [Rust] [BREAKING] Lists should take Field instead of DataType

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10261?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216687#comment-17216687 ] Antoine Pitrou commented on ARROW-10261: Indeed, C++ uses {{List}}. > [Rust] [BREAKING] Lists

[jira] [Resolved] (ARROW-10106) [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener

2020-10-19 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-10106. -- Resolution: Fixed Resolved by

[jira] [Updated] (ARROW-10106) [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener

2020-10-19 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10106?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-10106: - Fix Version/s: 3.0.0 > [FlightRPC][Java] Expose onIsReady() callback on OutboundStreamListener >

[jira] [Resolved] (ARROW-10203) [Doc] Capture guidance for endianness support in contributors guide.

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10203?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-10203. Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8374

[jira] [Commented] (ARROW-10345) [C++] NaN breaks sorting

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10345?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216674#comment-17216674 ] Antoine Pitrou commented on ARROW-10345: cc [~yibo] > [C++] NaN breaks sorting >

[jira] [Created] (ARROW-10345) [C++] NaN breaks sorting

2020-10-19 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-10345: -- Summary: [C++] NaN breaks sorting Key: ARROW-10345 URL: https://issues.apache.org/jira/browse/ARROW-10345 Project: Apache Arrow Issue Type: Bug

[jira] [Updated] (ARROW-10241) [C++][Compute] Add variance kernel benchmark

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-10241: --- Fix Version/s: (was: 2.0.0) 3.0.0 > [C++][Compute] Add variance

[jira] [Resolved] (ARROW-10241) [C++][Compute] Add variance kernel benchmark

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou resolved ARROW-10241. Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8407

[jira] [Commented] (ARROW-10343) [C++] Unable to parse strings into timestamps

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216616#comment-17216616 ] Antoine Pitrou commented on ARROW-10343: Thanks for the report. This appears to work on git

[jira] [Updated] (ARROW-10343) [C++] Unable to parse strings into timestamps

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-10343: --- Summary: [C++] Unable to parse strings into timestamps (was: Unable to parse strings into

[jira] [Updated] (ARROW-10343) [C++] Unable to parse strings into timestamps

2020-10-19 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-10343: --- Component/s: C++ > [C++] Unable to parse strings into timestamps >

[jira] [Updated] (ARROW-10344) [Python] Get all columns names (or schema) from Feather file, before loading whole Feather file

2020-10-19 Thread Gert Hulselmans (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10344?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Gert Hulselmans updated ARROW-10344: Description: Is there a way to get all column names (or schema) from a Feather file

[jira] [Created] (ARROW-10344) [Python] Get all columns names from Feather file, before loading whole Feather file

2020-10-19 Thread Gert Hulselmans (Jira)
Gert Hulselmans created ARROW-10344: --- Summary: [Python] Get all columns names from Feather file, before loading whole Feather file Key: ARROW-10344 URL: https://issues.apache.org/jira/browse/ARROW-10344

[jira] [Commented] (ARROW-10159) [Rust][DataFusion] Add support for Dictionary types in data fusion

2020-10-19 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216587#comment-17216587 ] Andrew Lamb commented on ARROW-10159: - Good call [~nevi_me] -- I have indeed completed the work I

[jira] [Resolved] (ARROW-10159) [Rust][DataFusion] Add support for Dictionary types in data fusion

2020-10-19 Thread Andrew Lamb (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10159?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Andrew Lamb resolved ARROW-10159. - Resolution: Fixed All subtasks completed > [Rust][DataFusion] Add support for Dictionary types

[jira] [Resolved] (ARROW-10310) [C++][Gandiva] Add single argument round() in Gandiva

2020-10-19 Thread Praveen Kumar (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10310?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Praveen Kumar resolved ARROW-10310. --- Fix Version/s: 2.0.0 Resolution: Fixed Issue resolved by pull request 8467

[jira] [Commented] (ARROW-10315) [C++] CSV skip wrong rows

2020-10-19 Thread Maciej (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10315?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216554#comment-17216554 ] Maciej commented on ARROW-10315: Emitting nulls wouldn't work for me. I may stick with checking the file

[jira] [Closed] (ARROW-10314) [C++] CSV wrong row number in error message

2020-10-19 Thread Maciej (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10314?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Maciej closed ARROW-10314. -- Resolution: Feedback Received > [C++] CSV wrong row number in error message >

[jira] [Commented] (ARROW-10314) [C++] CSV wrong row number in error message

2020-10-19 Thread Maciej (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10314?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216547#comment-17216547 ] Maciej commented on ARROW-10314: OK, thanks for the answer. > [C++] CSV wrong row number in error

[jira] [Closed] (ARROW-8435) [Python] A TypeError is raised while token expires during writing to S3

2020-10-19 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche closed ARROW-8435. Resolution: Feedback Received > [Python] A TypeError is raised while token expires

[jira] [Commented] (ARROW-8435) [Python] A TypeError is raised while token expires during writing to S3

2020-10-19 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8435?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17216535#comment-17216535 ] Joris Van den Bossche commented on ARROW-8435: -- Given this is also tracked in the s3fs issue

[jira] [Updated] (ARROW-9963) [Python] Recognize datetime.timezone.utc as UTC on conversion python->pyarrow

2020-10-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-9963: -- Labels: pull-request-available (was: ) > [Python] Recognize datetime.timezone.utc as UTC on

[jira] [Updated] (ARROW-10343) Unable to parse strings into timestamps

2020-10-19 Thread Niclas Roos (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niclas Roos updated ARROW-10343: Description: Hi, I'm working with parquet files generated by a AWS RDS Postgres snapshot export. 

[jira] [Updated] (ARROW-10343) Unable to parse strings into timestamps

2020-10-19 Thread Niclas Roos (Jira)
[ https://issues.apache.org/jira/browse/ARROW-10343?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niclas Roos updated ARROW-10343: Priority: Minor (was: Major) > Unable to parse strings into timestamps >

[jira] [Created] (ARROW-10343) Unable to parse strings into timestamps

2020-10-19 Thread Niclas Roos (Jira)
Niclas Roos created ARROW-10343: --- Summary: Unable to parse strings into timestamps Key: ARROW-10343 URL: https://issues.apache.org/jira/browse/ARROW-10343 Project: Apache Arrow Issue Type: Bug