[jira] [Updated] (ARROW-3133) [C++] Logical boolean kernels in kernels/boolean.cc cannot write into preallocated memory
[ https://issues.apache.org/jira/browse/ARROW-3133?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-3133: -- Labels: pull-request-available (was: ) > [C++] Logical boolean kernels in kernels/boolean.cc cannot write into > preallocated memory > - > > Key: ARROW-3133 > URL: https://issues.apache.org/jira/browse/ARROW-3133 > Project: Apache Arrow > Issue Type: Bug > Components: C++ >Reporter: Wes McKinney >Assignee: Micah Kornfield >Priority: Major > Labels: pull-request-available > Fix For: 0.13.0 > > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-2038) [Python] Follow-up bug fixes for s3fs Parquet support
[ https://issues.apache.org/jira/browse/ARROW-2038?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-2038: -- Labels: aws parquet pull-request-available (was: aws parquet) > [Python] Follow-up bug fixes for s3fs Parquet support > - > > Key: ARROW-2038 > URL: https://issues.apache.org/jira/browse/ARROW-2038 > Project: Apache Arrow > Issue Type: Bug > Components: Python >Reporter: Wes McKinney >Priority: Major > Labels: aws, parquet, pull-request-available > Fix For: 0.13.0 > > > see discussion in > https://github.com/apache/arrow/pull/916#issuecomment-360558248 -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Resolved] (ARROW-4132) [GLib] Add more GArrowTable constructors
[ https://issues.apache.org/jira/browse/ARROW-4132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Yosuke Shiro resolved ARROW-4132. - Resolution: Fixed Fix Version/s: 0.12.0 Issue resolved by pull request 3285 [https://github.com/apache/arrow/pull/3285] > [GLib] Add more GArrowTable constructors > > > Key: ARROW-4132 > URL: https://issues.apache.org/jira/browse/ARROW-4132 > Project: Apache Arrow > Issue Type: New Feature > Components: GLib >Reporter: Kouhei Sutou >Assignee: Kouhei Sutou >Priority: Major > Labels: pull-request-available > Fix For: 0.12.0 > > Time Spent: 10m > Remaining Estimate: 0h > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Commented] (ARROW-1983) [Python] Add ability to write parquet `_metadata` file
[ https://issues.apache.org/jira/browse/ARROW-1983?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16730844#comment-16730844 ] Matthew Rocklin commented on ARROW-1983: > If I understand correctly, we need to combine all of the row group metadata > for all files in a directory. Yes. Ideally when writing a row group we would get some metadata object in memory. We would then collect all of those objects and hand them to some `write_metadata` function afterwards. > When a new file is written, does this file have to be updated? Yes, or it can be removed/invalidated. As a side note, this is probably one of a small number of issues that stop Dask Dataframe from using PyArrow by default. Metadata files with full row group information are especially valuable for us, particularly with remote/cloud storage. (I'm going through Dask's parquet handling now) > [Python] Add ability to write parquet `_metadata` file > -- > > Key: ARROW-1983 > URL: https://issues.apache.org/jira/browse/ARROW-1983 > Project: Apache Arrow > Issue Type: Improvement > Components: Python >Reporter: Jim Crist >Assignee: Robert Gruener >Priority: Major > Labels: beginner, parquet > Fix For: 0.13.0 > > > Currently {{pyarrow.parquet}} can only write the {{_common_metadata}} file > (mostly just schema information). It would be useful to add the ability to > write a {{_metadata}} file as well. This should include information about > each row group in the dataset, including summary statistics. Having this > summary file would allow filtering of row groups without needing to access > each file beforehand. > This would require that the user is able to get the written RowGroups out of > a {{pyarrow.parquet.write_table}} call and then give these objects as a list > to new function that then passes them on as C++ objects to {{parquet-cpp}} > that generates the respective {{_metadata}} file. -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-4128) [C++][DOCUMENTATION] Update style guide to reflect some more exceptions
[ https://issues.apache.org/jira/browse/ARROW-4128?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4128: -- Labels: pull-request-available (was: ) > [C++][DOCUMENTATION] Update style guide to reflect some more exceptions > --- > > Key: ARROW-4128 > URL: https://issues.apache.org/jira/browse/ARROW-4128 > Project: Apache Arrow > Issue Type: Bug > Components: C++, Documentation >Reporter: Micah Kornfield >Assignee: Micah Kornfield >Priority: Minor > Labels: pull-request-available > > * It seems we are using doxygen style comments for method/class declaration > ("///") > * Use the macro NULLPTR ([https://arrow.apache.org/docs/cpp/macros_8h.html] -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Updated] (ARROW-4132) [GLib] Add more GArrowTable constructors
[ https://issues.apache.org/jira/browse/ARROW-4132?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated ARROW-4132: -- Labels: pull-request-available (was: ) > [GLib] Add more GArrowTable constructors > > > Key: ARROW-4132 > URL: https://issues.apache.org/jira/browse/ARROW-4132 > Project: Apache Arrow > Issue Type: New Feature > Components: GLib >Reporter: Kouhei Sutou >Assignee: Kouhei Sutou >Priority: Major > Labels: pull-request-available > -- This message was sent by Atlassian JIRA (v7.6.3#76005)
[jira] [Created] (ARROW-4132) [GLib] Add more GArrowTable constructors
Kouhei Sutou created ARROW-4132: --- Summary: [GLib] Add more GArrowTable constructors Key: ARROW-4132 URL: https://issues.apache.org/jira/browse/ARROW-4132 Project: Apache Arrow Issue Type: New Feature Components: GLib Reporter: Kouhei Sutou Assignee: Kouhei Sutou -- This message was sent by Atlassian JIRA (v7.6.3#76005)