[jira] [Resolved] (ARROW-17470) [CI][GLib] Add more system packages to sync the upstream PKGBUILD

2022-08-19 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17470?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-17470. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13917

[jira] [Resolved] (ARROW-17478) [C++][Java] Update ORC to 1.7.6

2022-08-19 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou resolved ARROW-17478. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13926

[jira] [Updated] (ARROW-17478) [C++][Java] Update ORC to 1.7.6

2022-08-19 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-17478: - Summary: [C++][Java] Update ORC to 1.7.6 (was: Update ORC to 1.7.6) > [C++][Java] Update ORC

[jira] [Updated] (ARROW-17478) [C++][Java] Update ORC to 1.7.6

2022-08-19 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou updated ARROW-17478: - Affects Version/s: 9.0.0 (was: 10.0.0) > [C++][Java] Update ORC to

[jira] [Commented] (ARROW-17484) [C++] Substrait to Arrow Aggregate doesn't take the provided Output Type for aggregates

2022-08-19 Thread Vibhatha Lakmal Abeykoon (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582110#comment-17582110 ] Vibhatha Lakmal Abeykoon commented on ARROW-17484: -- {code:json} "relations": [{

[jira] [Commented] (ARROW-17484) [C++] Substrait to Arrow Aggregate doesn't take the provided Output Type for aggregates

2022-08-19 Thread Vibhatha Lakmal Abeykoon (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17484?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582109#comment-17582109 ] Vibhatha Lakmal Abeykoon commented on ARROW-17484: -- cc [~westonpace] > [C++] Substrait

[jira] [Created] (ARROW-17484) [C++] Substrait to Arrow Aggregate doesn't take the provided Output Type for aggregates

2022-08-19 Thread Vibhatha Lakmal Abeykoon (Jira)
Vibhatha Lakmal Abeykoon created ARROW-17484: Summary: [C++] Substrait to Arrow Aggregate doesn't take the provided Output Type for aggregates Key: ARROW-17484 URL:

[jira] [Commented] (ARROW-17457) [C++] Substarit End-To-End Tests for Relations

2022-08-19 Thread Vibhatha Lakmal Abeykoon (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582108#comment-17582108 ] Vibhatha Lakmal Abeykoon commented on ARROW-17457: -- So it would be better to improve

[jira] [Created] (ARROW-17483) Support for 'pa.compute.Expression' in filter argument to 'pa.read_table'

2022-08-19 Thread Jira
Patrik Kjærran created ARROW-17483: -- Summary: Support for 'pa.compute.Expression' in filter argument to 'pa.read_table' Key: ARROW-17483 URL: https://issues.apache.org/jira/browse/ARROW-17483

[jira] [Commented] (ARROW-8163) [C++][Dataset] Allow FileSystemDataset's file list to be lazy

2022-08-19 Thread Aldrin Montana (Jira)
[ https://issues.apache.org/jira/browse/ARROW-8163?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582080#comment-17582080 ] Aldrin Montana commented on ARROW-8163: --- made a comment in ARROW-17306 about a minor PR, but I

[jira] [Commented] (ARROW-17306) [C++] Provide an optimized`GetFileInfoGenerator` specialization for `LocalFileSystem`

2022-08-19 Thread Aldrin Montana (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17306?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582079#comment-17582079 ] Aldrin Montana commented on ARROW-17306: I get a build error for localfs_benchmark.cc   here's

[jira] [Updated] (ARROW-17481) [C++] [Python] Major performance improvements to CSV reading from S3

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17481: --- Labels: pull-request-available (was: ) > [C++] [Python] Major performance improvements to

[jira] [Resolved] (ARROW-12958) [CI][Developer] Build + host the docs for PR branches

2022-08-19 Thread Rok Mihevc (Jira)
[ https://issues.apache.org/jira/browse/ARROW-12958?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Rok Mihevc resolved ARROW-12958. Resolution: Fixed Issue resolved by pull request 13913

[jira] [Updated] (ARROW-17481) Major performance improvements to CSV reading from S3

2022-08-19 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ziheng Wang updated ARROW-17481: Description: The current dataset reader for CSV is pretty slow on EC2 reading from S3. EC2

[jira] [Updated] (ARROW-17481) [C++] [Python] Major performance improvements to CSV reading from S3

2022-08-19 Thread Ziheng Wang (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Ziheng Wang updated ARROW-17481: Summary: [C++] [Python] Major performance improvements to CSV reading from S3 (was: Major

[jira] [Updated] (ARROW-17482) [Go] Remove ValueDescr types

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17482?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17482: --- Labels: pull-request-available (was: ) > [Go] Remove ValueDescr types >

[jira] [Created] (ARROW-17482) [Go] Remove ValueDescr types

2022-08-19 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-17482: - Summary: [Go] Remove ValueDescr types Key: ARROW-17482 URL: https://issues.apache.org/jira/browse/ARROW-17482 Project: Apache Arrow Issue Type: Sub-task

[jira] [Created] (ARROW-17481) Major performance improvements to CSV reading from S3

2022-08-19 Thread Ziheng Wang (Jira)
Ziheng Wang created ARROW-17481: --- Summary: Major performance improvements to CSV reading from S3 Key: ARROW-17481 URL: https://issues.apache.org/jira/browse/ARROW-17481 Project: Apache Arrow

[jira] [Updated] (ARROW-17479) [Go] Add ArraySpan and utilities for compute

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17479?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17479: --- Labels: pull-request-available (was: ) > [Go] Add ArraySpan and utilities for compute >

[jira] [Assigned] (ARROW-17478) Update ORC to 1.7.6

2022-08-19 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned ARROW-17478: - Assignee: William Hyun (was: Dongjoon Hyun) > Update ORC to 1.7.6 >

[jira] [Assigned] (ARROW-17478) Update ORC to 1.7.6

2022-08-19 Thread Dongjoon Hyun (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dongjoon Hyun reassigned ARROW-17478: - Assignee: Dongjoon Hyun (was: William Hyun) > Update ORC to 1.7.6 >

[jira] [Commented] (ARROW-15006) [Python][Doc] Iteratively enable more numpydoc checks

2022-08-19 Thread Bryce Mecum (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15006?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582012#comment-17582012 ] Bryce Mecum commented on ARROW-15006: - Makes sense and seems pragmatic. I'll start with a patch just

[jira] [Assigned] (ARROW-15006) [Python][Doc] Iteratively enable more numpydoc checks

2022-08-19 Thread Bryce Mecum (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15006?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bryce Mecum reassigned ARROW-15006: --- Assignee: Bryce Mecum > [Python][Doc] Iteratively enable more numpydoc checks >

[jira] [Commented] (ARROW-17461) [R] Table viewer for knitr/notebooks

2022-08-19 Thread Dewey Dunnington (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17582009#comment-17582009 ] Dewey Dunnington commented on ARROW-17461: -- I'll look into this again because it may have

[jira] [Updated] (ARROW-17480) [Java] add setNull() to ValueVector interface

2022-08-19 Thread Larry White (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17480?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Larry White updated ARROW-17480: Labels: Java (was: ) > [Java] add setNull() to ValueVector interface >

[jira] [Created] (ARROW-17480) [Java] add setNull() to ValueVector interface

2022-08-19 Thread Larry White (Jira)
Larry White created ARROW-17480: --- Summary: [Java] add setNull() to ValueVector interface Key: ARROW-17480 URL: https://issues.apache.org/jira/browse/ARROW-17480 Project: Apache Arrow Issue

[jira] [Created] (ARROW-17479) [Go] Add ArraySpan and utilities for compute

2022-08-19 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-17479: - Summary: [Go] Add ArraySpan and utilities for compute Key: ARROW-17479 URL: https://issues.apache.org/jira/browse/ARROW-17479 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-17478) Update ORC to 1.7.6

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17478?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17478: --- Labels: pull-request-available (was: ) > Update ORC to 1.7.6 > --- > >

[jira] [Created] (ARROW-17478) Update ORC to 1.7.6

2022-08-19 Thread William Hyun (Jira)
William Hyun created ARROW-17478: Summary: Update ORC to 1.7.6 Key: ARROW-17478 URL: https://issues.apache.org/jira/browse/ARROW-17478 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-17477) [CI][Docs] Document Docs PR Preview

2022-08-19 Thread Jacob Wujciak-Jens (Jira)
Jacob Wujciak-Jens created ARROW-17477: -- Summary: [CI][Docs] Document Docs PR Preview Key: ARROW-17477 URL: https://issues.apache.org/jira/browse/ARROW-17477 Project: Apache Arrow Issue

[jira] [Updated] (ARROW-15481) [R] [CI] Add a crossbow job that mimics CRAN's old macOS

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15481?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-15481: --- Labels: pull-request-available (was: ) > [R] [CI] Add a crossbow job that mimics CRAN's

[jira] [Assigned] (ARROW-17447) [Docs] Clarify processes for first-time contributors

2022-08-19 Thread Todd Farmer (TEST) (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Farmer (TEST) reassigned ARROW-17447: -- Assignee: Todd Farmer (TEST) > [Docs] Clarify processes for first-time

[jira] [Assigned] (ARROW-17447) [Docs] Clarify processes for first-time contributors

2022-08-19 Thread Todd Farmer (TEST) (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Todd Farmer (TEST) reassigned ARROW-17447: -- Assignee: (was: Todd Farmer (TEST)) > [Docs] Clarify processes for

[jira] [Commented] (ARROW-17439) [R] pull() should compute() not collect()

2022-08-19 Thread SHIMA Tatsuya (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17439?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581935#comment-17581935 ] SHIMA Tatsuya commented on ARROW-17439: --- Note that in dbplyr and dtplyr, pull returns a vector in

[jira] [Updated] (ARROW-17475) [Go] Function Interface and Registry implementation

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17475?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17475: --- Labels: pull-request-available (was: ) > [Go] Function Interface and Registry

[jira] [Updated] (ARROW-17476) [Release][Packaging] Make binary uploader reusable from datafusion-c

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17476?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17476: --- Labels: pull-request-available (was: ) > [Release][Packaging] Make binary uploader

[jira] [Created] (ARROW-17476) [Release][Packaging] Make binary uploader reusable from datafusion-c

2022-08-19 Thread Kouhei Sutou (Jira)
Kouhei Sutou created ARROW-17476: Summary: [Release][Packaging] Make binary uploader reusable from datafusion-c Key: ARROW-17476 URL: https://issues.apache.org/jira/browse/ARROW-17476 Project: Apache

[jira] [Created] (ARROW-17475) [Go] Function Interface and Registry implementation

2022-08-19 Thread Matthew Topol (Jira)
Matthew Topol created ARROW-17475: - Summary: [Go] Function Interface and Registry implementation Key: ARROW-17475 URL: https://issues.apache.org/jira/browse/ARROW-17475 Project: Apache Arrow

[jira] [Resolved] (ARROW-17467) [Go] Aligned Bitmap Ops mess up the final byte if no trailing bits

2022-08-19 Thread Matthew Topol (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17467?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthew Topol resolved ARROW-17467. --- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13915

[jira] [Commented] (ARROW-17457) [C++] Substarit End-To-End Tests for Relations

2022-08-19 Thread Weston Pace (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17457?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581854#comment-17581854 ] Weston Pace commented on ARROW-17457: - It is a bit spread out but we have some end-to-end tests on

[jira] [Updated] (ARROW-17429) [R] Error messages are not helpful of read_csv_arrow with col_types option

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17429: --- Labels: pull-request-available (was: ) > [R] Error messages are not helpful of

[jira] [Assigned] (ARROW-17429) [R] Error messages are not helpful of read_csv_arrow with col_types option

2022-08-19 Thread SHIMA Tatsuya (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17429?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] SHIMA Tatsuya reassigned ARROW-17429: - Assignee: SHIMA Tatsuya > [R] Error messages are not helpful of read_csv_arrow with

[jira] [Commented] (ARROW-17429) [R] Error messages are not helpful of read_csv_arrow with col_types option

2022-08-19 Thread SHIMA Tatsuya (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17429?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581850#comment-17581850 ] SHIMA Tatsuya commented on ARROW-17429: --- This issue appears to have been introduced by

[jira] [Commented] (ARROW-15481) [R] [CI] Add a crossbow job that mimics CRAN's old macOS

2022-08-19 Thread Jacob Wujciak-Jens (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15481?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17581831#comment-17581831 ] Jacob Wujciak-Jens commented on ARROW-15481: Technically this has been done with:

[jira] [Resolved] (ARROW-16754) [Java] StructVector's child vectors get unexpectedly reordered after adding vectors with duplicated fields

2022-08-19 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-16754?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-16754. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13321

[jira] [Resolved] (ARROW-17430) [Java] ListBinder to bind Arrow List type to DB column

2022-08-19 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li resolved ARROW-17430. -- Fix Version/s: 10.0.0 Resolution: Fixed Issue resolved by pull request 13906

[jira] [Assigned] (ARROW-17430) [Java] ListBinder to bind Arrow List type to DB column

2022-08-19 Thread David Li (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17430?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] David Li reassigned ARROW-17430: Assignee: Igor Suhorukov > [Java] ListBinder to bind Arrow List type to DB column >

[jira] [Updated] (ARROW-17449) [Python] Better repr for Buffer, MemoryPool and friends

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17449?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17449: --- Labels: good-first-issue good-second-issue pull-request-available (was: good-first-issue

[jira] [Created] (ARROW-17474) Pandas read_parquet failr on pyarrow level

2022-08-19 Thread Amir Aliev (Jira)
Amir Aliev created ARROW-17474: -- Summary: Pandas read_parquet failr on pyarrow level Key: ARROW-17474 URL: https://issues.apache.org/jira/browse/ARROW-17474 Project: Apache Arrow Issue Type:

[jira] [Assigned] (ARROW-17451) [CI][Java] Move away from Debian 9

2022-08-19 Thread Kouhei Sutou (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Kouhei Sutou reassigned ARROW-17451: Assignee: Kouhei Sutou > [CI][Java] Move away from Debian 9 >

[jira] [Updated] (ARROW-17451) [CI][Java] Move away from Debian 9

2022-08-19 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/ARROW-17451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-17451: --- Labels: pull-request-available (was: ) > [CI][Java] Move away from Debian 9 >

[jira] [Assigned] (ARROW-15409) [C++] The C++ API for writing datasets could be improved

2022-08-19 Thread Alvin Chunga Mamani (Jira)
[ https://issues.apache.org/jira/browse/ARROW-15409?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Alvin Chunga Mamani reassigned ARROW-15409: --- Assignee: Alvin Chunga Mamani > [C++] The C++ API for writing datasets