[jira] [Updated] (ARROW-13172) Make TYPE_WIDTH in Vector public

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13172: --- Labels: pull-request-available (was: ) > Make TYPE_WIDTH in Vector public > ---

[jira] [Created] (ARROW-13172) Make TYPE_WIDTH in Vector public

2021-06-24 Thread Eduard Tudenhoefner (Jira)
Eduard Tudenhoefner created ARROW-13172: --- Summary: Make TYPE_WIDTH in Vector public Key: ARROW-13172 URL: https://issues.apache.org/jira/browse/ARROW-13172 Project: Apache Arrow Issue T

[jira] [Updated] (ARROW-13154) [C++] Unions can not have 126 and 127 as type_codes

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13154?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13154: --- Labels: pull-request-available (was: ) > [C++] Unions can not have 126 and 127 as type_code

[jira] [Updated] (ARROW-13054) [C++] Add TemporalOptions

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13054?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13054: --- Labels: pull-request-available (was: ) > [C++] Add TemporalOptions > --

[jira] [Created] (ARROW-13171) [R] Add binding for str_pad()

2021-06-24 Thread Ian Cook (Jira)
Ian Cook created ARROW-13171: Summary: [R] Add binding for str_pad() Key: ARROW-13171 URL: https://issues.apache.org/jira/browse/ARROW-13171 Project: Apache Arrow Issue Type: Improvement

[jira] [Updated] (ARROW-12716) [C++] Left/right/center string padding kernels

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook updated ARROW-12716: - Summary: [C++] Left/right/center string padding kernels (was: [C++] Left/right string padding kernels)

[jira] [Resolved] (ARROW-12869) [R] Bindings for utf8_reverse and ascii_reverse

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook resolved ARROW-12869. -- Resolution: Fixed Issue resolved by pull request 10589 [https://github.com/apache/arrow/pull/10589] >

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Description: _This is a regression after 4.0.1 so is not a live-bug in a release version of

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mauricio 'Pachá' Vargas Sepúlveda updated ARROW-13169: -- Description: _This is a regression after 4.0.1 so is n

[jira] [Commented] (ARROW-13160) [CI][C++] Use binary caching for vcpkg builds

2021-06-24 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369060#comment-17369060 ] Kouhei Sutou commented on ARROW-13160: -- We already cache vcpkg for macOS: https://

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Priority: Blocker (was: Major) > [R] [C++] sorted partition keys can cause issues > ---

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Description: When a partition key happens to be ordered, on large (>=1e7 rows), the partiti

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Component/s: C++ > [R] [C++] sorted partition keys can cause issues > --

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Description: When a partition key happens to be ordered, on large (>=1e7 rows), the partiti

[jira] [Assigned] (ARROW-12714) [C++] String title case kernel

2021-06-24 Thread Eduardo Ponce (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12714?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Eduardo Ponce reassigned ARROW-12714: - Assignee: Eduardo Ponce > [C++] String title case kernel >

[jira] [Updated] (ARROW-13169) [R] [C++] sorted partition keys can cause issues

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Summary: [R] [C++] sorted partition keys can cause issues (was: [R] group_by + write_datase

[jira] [Comment Edited] (ARROW-13117) [R] Retain schema in new Expressions

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368971#comment-17368971 ] Ian Cook edited comment on ARROW-13117 at 6/24/21, 7:10 PM:

[jira] [Created] (ARROW-13170) [C++] Reducing branching in compute/kernels/vector_selection.cc

2021-06-24 Thread Niranda Perera (Jira)
Niranda Perera created ARROW-13170: -- Summary: [C++] Reducing branching in compute/kernels/vector_selection.cc Key: ARROW-13170 URL: https://issues.apache.org/jira/browse/ARROW-13170 Project: Apache A

[jira] [Assigned] (ARROW-13136) [C++] Add a "coalesce" variadic scalar kernel

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13136?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-13136: Assignee: David Li > [C++] Add a "coalesce" variadic scalar kernel >

[jira] [Comment Edited] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369036#comment-17369036 ] Mauricio 'Pachá' Vargas Sepúlveda edited comment on ARROW-13169 at 6/24/21, 6:46 PM:

[jira] [Commented] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369036#comment-17369036 ] Mauricio 'Pachá' Vargas Sepúlveda commented on ARROW-13169: --- t

[jira] [Updated] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mauricio 'Pachá' Vargas Sepúlveda updated ARROW-13169: -- Attachment: screenshot-1.png > [R] group_by + write_da

[jira] [Updated] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jonathan Keane updated ARROW-13169: --- Affects Version/s: (was: 4.0.1) > [R] group_by + write_dataset skips some countries with

[jira] [Commented] (ARROW-13157) [C++] Implement ignore_case option for find_substring

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369031#comment-17369031 ] Ian Cook commented on ARROW-13157: -- RE2 also treats everything between {{\Q}} and {{\E}

[jira] [Commented] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369029#comment-17369029 ] Mauricio 'Pachá' Vargas Sepúlveda commented on ARROW-13169: --- s

[jira] [Updated] (ARROW-13157) [C++] Implement ignore_case option for find_substring

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13157: --- Labels: pull-request-available (was: ) > [C++] Implement ignore_case option for find_substr

[jira] [Commented] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jonathan Keane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369021#comment-17369021 ] Jonathan Keane commented on ARROW-13169: What version of Arrow are you using to

[jira] [Commented] (ARROW-13161) [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17369016#comment-17369016 ] Weston Pace commented on ARROW-13161: - David is right, batch readahead is largely ig

[jira] [Updated] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mauricio 'Pachá' Vargas Sepúlveda updated ARROW-13169: -- Description: A bit of context: the data for this exam

[jira] [Updated] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
[ https://issues.apache.org/jira/browse/ARROW-13169?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Mauricio 'Pachá' Vargas Sepúlveda updated ARROW-13169: -- Description: A bit of context: the data for this exam

[jira] [Created] (ARROW-13169) [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets

2021-06-24 Thread Jira
Mauricio 'Pachá' Vargas Sepúlveda created ARROW-13169: - Summary: [R] group_by + write_dataset skips some countries with UN COMTRADE / BACI datasets Key: ARROW-13169 URL: https://issues.apache.o

[jira] [Assigned] (ARROW-13157) [C++] Implement ignore_case option for find_substring

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-13157: Assignee: David Li > [C++] Implement ignore_case option for find_substring > ---

[jira] [Updated] (ARROW-13168) [C++] Timezone database configuration and access

2021-06-24 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc updated ARROW-13168: --- Description: Note: currently timezone database is not available on windows so timezone aware opera

[jira] [Closed] (ARROW-13167) [C++] Type determination kernels ("type", "type_id")

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou closed ARROW-13167. -- Resolution: Won't Do Sorry, but rejecting this. > [C++] Type determination kernels ("type", "

[jira] [Commented] (ARROW-13167) [C++] Type determination kernels ("type", "type_id")

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368979#comment-17368979 ] Antoine Pitrou commented on ARROW-13167: An SQL frontend for Arrow could support

[jira] [Commented] (ARROW-13167) [C++] Type determination kernels ("type", "type_id")

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368978#comment-17368978 ] Ian Cook commented on ARROW-13167: -- >From the perspective of a user of a compute API, i

[jira] [Comment Edited] (ARROW-13167) [C++] Type determination kernels ("type", "type_id")

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368978#comment-17368978 ] Ian Cook edited comment on ARROW-13167 at 6/24/21, 5:16 PM:

[jira] [Commented] (ARROW-13167) [C++] Type determination kernels ("type", "type_id")

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368974#comment-17368974 ] Antoine Pitrou commented on ARROW-13167: I don't understand why this needs to be

[jira] [Created] (ARROW-13168) [C++] Timezone database configuration and access

2021-06-24 Thread Rok Mihevc (Jira)
Rok Mihevc created ARROW-13168: -- Summary: [C++] Timezone database configuration and access Key: ARROW-13168 URL: https://issues.apache.org/jira/browse/ARROW-13168 Project: Apache Arrow Issue Typ

[jira] [Commented] (ARROW-13117) [R] Retain schema in new Expressions

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13117?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368971#comment-17368971 ] Ian Cook commented on ARROW-13117: -- ARROW-13167 would eliminate the need for all these

[jira] [Created] (ARROW-13167) [C++] Type determination kernels ("type", "type_id")

2021-06-24 Thread Ian Cook (Jira)
Ian Cook created ARROW-13167: Summary: [C++] Type determination kernels ("type", "type_id") Key: ARROW-13167 URL: https://issues.apache.org/jira/browse/ARROW-13167 Project: Apache Arrow Issue Typ

[jira] [Updated] (ARROW-13104) [C++] ByteStreamSplit implementation uses invalid pointer cast

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13104: --- Labels: pull-request-available (was: ) > [C++] ByteStreamSplit implementation uses invalid

[jira] [Assigned] (ARROW-13104) [C++] ByteStreamSplit implementation uses invalid pointer cast

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13104?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou reassigned ARROW-13104: -- Assignee: Antoine Pitrou > [C++] ByteStreamSplit implementation uses invalid pointer

[jira] [Commented] (ARROW-13118) [R] Improve handling of R scalars in some nse_funcs

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13118?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368959#comment-17368959 ] Ian Cook commented on ARROW-13118: -- Yes, the examples above are simplistic; a case that

[jira] [Commented] (ARROW-13151) [Python] Unable to read single child field of struct column from Parquet

2021-06-24 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368952#comment-17368952 ] Micah Kornfield commented on ARROW-13151: - As an aside it is ugly that we need t

[jira] [Commented] (ARROW-13151) [Python] Unable to read single child field of struct column from Parquet

2021-06-24 Thread Micah Kornfield (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368946#comment-17368946 ] Micah Kornfield commented on ARROW-13151: - This seems like a legitimate bug, I w

[jira] [Assigned] (ARROW-12944) [C++] String capitalize kernel

2021-06-24 Thread Niranda Perera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12944?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niranda Perera reassigned ARROW-12944: -- Assignee: Niranda Perera > [C++] String capitalize kernel > -

[jira] [Assigned] (ARROW-12946) [C++] String swap case kernel

2021-06-24 Thread Niranda Perera (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Niranda Perera reassigned ARROW-12946: -- Assignee: Niranda Perera > [C++] String swap case kernel > --

[jira] [Commented] (ARROW-13160) [CI][C++] Use binary caching for vcpkg builds

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368933#comment-17368933 ] Antoine Pitrou commented on ARROW-13160: Hmm, GHA caching seems to be per-branch

[jira] [Commented] (ARROW-13150) [Python] combine_chunks fails on column of table, but does not error on table itself

2021-06-24 Thread Gert Hulselmans (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368929#comment-17368929 ] Gert Hulselmans commented on ARROW-13150: - https://issues.apache.org/jira/browse

[jira] [Resolved] (ARROW-13022) [R] bindings for lubridate's year, isoyear, quarter, month, day, wday, yday, isoweek, hour, minute, and second functions

2021-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13022?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-13022. - Resolution: Fixed Issue resolved by pull request 10507 [https://github.com/apache/arrow/

[jira] [Resolved] (ARROW-13037) [R] Incorrect param when creating Expression crashes R

2021-06-24 Thread Neal Richardson (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13037?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Neal Richardson resolved ARROW-13037. - Resolution: Fixed Issue resolved by pull request 10584 [https://github.com/apache/arrow/

[jira] [Commented] (ARROW-13150) [Python] combine_chunks fails on column of table, but does not error on table itself

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368907#comment-17368907 ] David Li commented on ARROW-13150: -- I think ARROW-7245 and possibly ARROW-9003 would be

[jira] [Comment Edited] (ARROW-13157) [C++] Implement ignore_case option for find_substring

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368896#comment-17368896 ] David Li edited comment on ARROW-13157 at 6/24/21, 2:43 PM:

[jira] [Commented] (ARROW-13157) [C++] Implement ignore_case option for find_substring

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13157?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368896#comment-17368896 ] David Li commented on ARROW-13157: -- IIRC, this should be doable, but requires some trou

[jira] [Created] (ARROW-13166) Java Dataset API ScanOptions expansion

2021-06-24 Thread Sebastiaan Alvarez Rodriguez (Jira)
Sebastiaan Alvarez Rodriguez created ARROW-13166: Summary: Java Dataset API ScanOptions expansion Key: ARROW-13166 URL: https://issues.apache.org/jira/browse/ARROW-13166 Project: Apache

[jira] [Resolved] (ARROW-12870) [R] Bindings for stringr::str_like

2021-06-24 Thread Ian Cook (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ian Cook resolved ARROW-12870. -- Resolution: Fixed Issue resolved by pull request 10590 [https://github.com/apache/arrow/pull/10590] >

[jira] [Updated] (ARROW-13151) [Python] Unable to read single child field of struct column from Parquet

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13151: -- Summary: [Python] Unable to read single child field of struct column from Parq

[jira] [Commented] (ARROW-13151) [Python] Unable to read column of `list>`

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368879#comment-17368879 ] Joris Van den Bossche commented on ARROW-13151: --- Reading the file itself s

[jira] [Commented] (ARROW-13150) [Python] combine_chunks fails on column of table, but does not error on table itself

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13150?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368877#comment-17368877 ] Joris Van den Bossche commented on ARROW-13150: --- Both are implemented diff

[jira] [Commented] (ARROW-13161) [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread Jayjeet Chakraborty (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368872#comment-17368872 ] Jayjeet Chakraborty commented on ARROW-13161: - I see. Looks like fragment re

[jira] [Created] (ARROW-13165) [R] Add bindings for ProjectOptions

2021-06-24 Thread Ian Cook (Jira)
Ian Cook created ARROW-13165: Summary: [R] Add bindings for ProjectOptions Key: ARROW-13165 URL: https://issues.apache.org/jira/browse/ARROW-13165 Project: Apache Arrow Issue Type: Improvement

[jira] [Created] (ARROW-13164) [R] altrep vectors from Array with nulls

2021-06-24 Thread Romain Francois (Jira)
Romain Francois created ARROW-13164: --- Summary: [R] altrep vectors from Array with nulls Key: ARROW-13164 URL: https://issues.apache.org/jira/browse/ARROW-13164 Project: Apache Arrow Issue T

[jira] [Commented] (ARROW-13161) Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368831#comment-17368831 ] David Li commented on ARROW-13161: -- Thanks for clarifying. The sync scanner will be goi

[jira] [Updated] (ARROW-13161) [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-13161: - Summary: [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilder (was: Allow setting Frag

[jira] [Updated] (ARROW-13161) [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li updated ARROW-13161: - Labels: dataset datasets (was: ) > [C++][Dataset] Allow setting FragmentReadahead to 0 in ScannerBuilde

[jira] [Comment Edited] (ARROW-13161) Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread Jayjeet Chakraborty (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368826#comment-17368826 ] Jayjeet Chakraborty edited comment on ARROW-13161 at 6/24/21, 12:51 PM: --

[jira] [Commented] (ARROW-13161) Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread Jayjeet Chakraborty (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368826#comment-17368826 ] Jayjeet Chakraborty commented on ARROW-13161: - Thanks a lot for the quick re

[jira] [Created] (ARROW-13163) [C++][Gandiva] Implement REPEAT function on Gandiva

2021-06-24 Thread Jira
João Pedro Antunes Ferreira created ARROW-13163: --- Summary: [C++][Gandiva] Implement REPEAT function on Gandiva Key: ARROW-13163 URL: https://issues.apache.org/jira/browse/ARROW-13163 Proj

[jira] [Created] (ARROW-13162) [C++][Gandiva] Add new alias for extract date functions in Gandiva registry

2021-06-24 Thread Jira
João Pedro Antunes Ferreira created ARROW-13162: --- Summary: [C++][Gandiva] Add new alias for extract date functions in Gandiva registry Key: ARROW-13162 URL: https://issues.apache.org/jira/browse/ARRO

[jira] [Updated] (ARROW-13145) [C++][CI] Flight test crashes on MinGW

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Antoine Pitrou updated ARROW-13145: --- Fix Version/s: 5.0.0 > [C++][CI] Flight test crashes on MinGW >

[jira] [Resolved] (ARROW-13145) [C++][CI] Flight test crashes on MinGW

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-13145. -- Assignee: David Li Resolution: Fixed The new revision fixed the issue (e.g. see https://github.

[jira] [Commented] (ARROW-13161) Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13161?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368817#comment-17368817 ] David Li commented on ARROW-13161: -- [~westonpace] can confirm but it's because places o

[jira] [Updated] (ARROW-13113) [R] use RTasks to manage parallel in converting arrow to R

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13113: --- Labels: pull-request-available (was: ) > [R] use RTasks to manage parallel in converting ar

[jira] [Assigned] (ARROW-11441) [R] Read CSV from character vector

2021-06-24 Thread Nic Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-11441?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nic Crane reassigned ARROW-11441: - Assignee: Nic Crane > [R] Read CSV from character vector > -- >

[jira] [Created] (ARROW-13161) Allow setting FragmentReadahead to 0 in ScannerBuilder

2021-06-24 Thread Jayjeet Chakraborty (Jira)
Jayjeet Chakraborty created ARROW-13161: --- Summary: Allow setting FragmentReadahead to 0 in ScannerBuilder Key: ARROW-13161 URL: https://issues.apache.org/jira/browse/ARROW-13161 Project: Apache A

[jira] [Commented] (ARROW-13145) [C++][CI] Flight test crashes on MinGW

2021-06-24 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13145?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368795#comment-17368795 ] David Li commented on ARROW-13145: -- Upstream released a new package revision that shoul

[jira] [Created] (ARROW-13160) [CI][C++] Use binary caching for vcpkg builds

2021-06-24 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-13160: -- Summary: [CI][C++] Use binary caching for vcpkg builds Key: ARROW-13160 URL: https://issues.apache.org/jira/browse/ARROW-13160 Project: Apache Arrow Issu

[jira] [Commented] (ARROW-13160) [CI][C++] Use binary caching for vcpkg builds

2021-06-24 Thread Antoine Pitrou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13160?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368785#comment-17368785 ] Antoine Pitrou commented on ARROW-13160: [~icook] [~kszucs] > [CI][C++] Use bin

[jira] [Assigned] (ARROW-13137) [C++][Documentation] Make in-table references consistent

2021-06-24 Thread Alenka Frim (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alenka Frim reassigned ARROW-13137: --- Assignee: Alenka Frim > [C++][Documentation] Make in-table references consistent >

[jira] [Created] (ARROW-13159) [Doc][Python] The use of IPython directive or doctest code blocks in the python user guide

2021-06-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13159: - Summary: [Doc][Python] The use of IPython directive or doctest code blocks in the python user guide Key: ARROW-13159 URL: https://issues.apache.org/jira/browse/A

[jira] [Updated] (ARROW-13141) [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable?

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13141: --- Labels: filesystem hdfs pull-request-available (was: filesystem hdfs) > [C++][Python] Hadoo

[jira] [Commented] (ARROW-9997) [Python] StructScalar.as_py() fails if the type has duplicate field names

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368714#comment-17368714 ] Joris Van den Bossche commented on ARROW-9997: -- I opened ARROW-13158 (and a

[jira] [Updated] (ARROW-13158) [Python] Fix repr and contains of StructScalar with duplicate field names

2021-06-24 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-13158: --- Labels: pull-request-available (was: ) > [Python] Fix repr and contains of StructScalar wit

[jira] [Updated] (ARROW-13141) [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable?

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche updated ARROW-13141: -- Fix Version/s: 5.0.0 > [C++][Python] HadoopFileSystem: automatically set CLASS

[jira] [Assigned] (ARROW-13141) [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable?

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-13141: - Assignee: Joris Van den Bossche > [C++][Python] HadoopFileSystem: autom

[jira] [Commented] (ARROW-13141) [C++][Python] HadoopFileSystem: automatically set CLASSPATH based on HADOOP_HOME env variable?

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368704#comment-17368704 ] Joris Van den Bossche commented on ARROW-13141: --- OK, I will do a quick PR

[jira] [Assigned] (ARROW-13158) [Python] Fix repr and contains of StructScalar with duplicate field names

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13158?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Joris Van den Bossche reassigned ARROW-13158: - Assignee: Joris Van den Bossche > [Python] Fix repr and contains of Str

[jira] [Created] (ARROW-13158) [Python] Fix repr and contains of StructScalar with duplicate field names

2021-06-24 Thread Joris Van den Bossche (Jira)
Joris Van den Bossche created ARROW-13158: - Summary: [Python] Fix repr and contains of StructScalar with duplicate field names Key: ARROW-13158 URL: https://issues.apache.org/jira/browse/ARROW-13158

[jira] [Commented] (ARROW-9997) [Python] StructScalar.as_py() fails if the type has duplicate field names

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368691#comment-17368691 ] Joris Van den Bossche commented on ARROW-9997: -- Small correction: you can al

[jira] [Commented] (ARROW-9997) [Python] StructScalar.as_py() fails if the type has duplicate field names

2021-06-24 Thread Joris Van den Bossche (Jira)
[ https://issues.apache.org/jira/browse/ARROW-9997?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17368688#comment-17368688 ] Joris Van den Bossche commented on ARROW-9997: -- To restate from my last comm

[jira] [Updated] (ARROW-13151) [Python] Unable to read column of `list>`

2021-06-24 Thread Nic Crane (Jira)
[ https://issues.apache.org/jira/browse/ARROW-13151?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Nic Crane updated ARROW-13151: -- Summary: [Python] Unable to read column of `list>` (was: Unable to read column of `list>`) > [Python