[jira] [Created] (ARROW-8370) [C++] Add Result to type / schema APIs

2020-04-08 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8370: - Summary: [C++] Add Result to type / schema APIs Key: ARROW-8370 URL: https://issues.apache.org/jira/browse/ARROW-8370 Project: Apache Arrow Issue Type: Sub

Re: [DRAFT] Arrow Board Report April 2020

2020-04-08 Thread Wes McKinney
The report is due today. Are there any more comments? On Sat, Apr 4, 2020 at 4:08 PM Wes McKinney wrote: > > ## Description: > > The mission of Apache Arrow is the creation and maintenance of software > related > to columnar in-memory processing and data interchange. The project has some > level

Re: [DRAFT] Arrow Board Report April 2020

2020-04-08 Thread Antoine Pitrou
Is it worth mentioning the OSS-Fuzz integration (and "success story")? Le 08/04/2020 à 15:05, Wes McKinney a écrit : > The report is due today. Are there any more comments? > > On Sat, Apr 4, 2020 at 4:08 PM Wes McKinney wrote: >> >> ## Description: >> >> The mission of Apache Arrow is the cre

Re: [DRAFT] Arrow Board Report April 2020

2020-04-08 Thread Wes McKinney
Yes, definitely, can you propose a paragraph for the Project Activity section? On Wed, Apr 8, 2020 at 8:10 AM Antoine Pitrou wrote: > > > Is it worth mentioning the OSS-Fuzz integration (and "success story")? > > Le 08/04/2020 à 15:05, Wes McKinney a écrit : > > The report is due today. Are there

Re: [DRAFT] Arrow Board Report April 2020

2020-04-08 Thread Antoine Pitrou
- Three months ago, Apache Arrow was accepted for continuous fuzzing in the OSS-Fuzz infrastructure. We have now finally stabilized the situation by fixing all detected issues in the Arrow C++ IPC implementation, and are actively fixing issues in the Arrow C++ Parquet reader. (XXX not sure Parq

Re: [DRAFT] Arrow Board Report April 2020

2020-04-08 Thread Wes McKinney
Sounds good. I think it's fine to mention Parquet since presumably some issues will be fixed that are relevant to Arrow users that don't affect other kinds of Parquet users. On Wed, Apr 8, 2020 at 8:29 AM Antoine Pitrou wrote: > > > - Three months ago, Apache Arrow was accepted for continuous fu

[jira] [Created] (ARROW-8371) [Crossbow] Implement and exercise sanity checks for tasks.yml

2020-04-08 Thread Krisztian Szucs (Jira)
Krisztian Szucs created ARROW-8371: -- Summary: [Crossbow] Implement and exercise sanity checks for tasks.yml Key: ARROW-8371 URL: https://issues.apache.org/jira/browse/ARROW-8371 Project: Apache Arro

[jira] [Created] (ARROW-8372) [C++] Add Result to table / record batch APIs

2020-04-08 Thread Antoine Pitrou (Jira)
Antoine Pitrou created ARROW-8372: - Summary: [C++] Add Result to table / record batch APIs Key: ARROW-8372 URL: https://issues.apache.org/jira/browse/ARROW-8372 Project: Apache Arrow Issue Ty

[jira] [Created] (ARROW-8373) [GLib] Problems resolving gobject-introspection, arrow in Meson builds

2020-04-08 Thread Wes McKinney (Jira)
Wes McKinney created ARROW-8373: --- Summary: [GLib] Problems resolving gobject-introspection, arrow in Meson builds Key: ARROW-8373 URL: https://issues.apache.org/jira/browse/ARROW-8373 Project: Apache Ar

Re: [Python] black vs. autopep8

2020-04-08 Thread Wes McKinney
Circling back on this, it seems there isn't consensus about switching to Black, and using autopep8 at least will give us an easy way to maintain PEP8 compliance and help contributors fix linting failures detected by flake8 (but not all, e.g. unused imports would need to be manually removed). Would

Re: [Python] black vs. autopep8

2020-04-08 Thread Neal Richardson
So autopep8 doesn't fix everything? Sounds inferior to me. That said, I'm in favor of any resolution that increases our automation of this and decreases the energy we expend debating it. Neal On Wed, Apr 8, 2020 at 10:34 AM Wes McKinney wrote: > Circling back on this, it seems there isn't cons

Re: [Python] black vs. autopep8

2020-04-08 Thread Wes McKinney
On Wed, Apr 8, 2020 at 12:47 PM Neal Richardson wrote: > > So autopep8 doesn't fix everything? Sounds inferior to me. That said, I'm > in favor of any resolution that increases our automation of this and > decreases the energy we expend debating it. It does fix everything, where "everything" is c

[jira] [Created] (ARROW-8374) [R] Table to vector of DictonaryType will error when Arrays don't have the same Dictionary per array

2020-04-08 Thread Francois Saint-Jacques (Jira)
Francois Saint-Jacques created ARROW-8374: - Summary: [R] Table to vector of DictonaryType will error when Arrays don't have the same Dictionary per array Key: ARROW-8374 URL: https://issues.apache.org/jira

Re: [Python] black vs. autopep8

2020-04-08 Thread Rok Mihevc
Could we 'tone down' black to get the desired behavior? I'm ok with either tool. Rok On Wed, Apr 8, 2020 at 8:00 PM Wes McKinney wrote: > On Wed, Apr 8, 2020 at 12:47 PM Neal Richardson > wrote: > > > > So autopep8 doesn't fix everything? Sounds inferior to me. That said, I'm > > in favor of a

Re: [Python] black vs. autopep8

2020-04-08 Thread Wes McKinney
I don't think it's possible unfortunately. From the README: "Black reformats entire files in place. It is not configurable." The main concern about Black is the impact that it has on readability. I share this concern as the subjective style choices it makes are quite different from the way I've be

Re: [DRAFT] Arrow Board Report April 2020

2020-04-08 Thread Wes McKinney
Here's the updated board report. I updated it to confirm that Buildkite is indeed finally working on the @apache organization ## Description: The mission of Apache Arrow is the creation and maintenance of software related to columnar in-memory processing and data interchange. The project has some

Re: [Python] black vs. autopep8

2020-04-08 Thread Xinbin Huang
Another option that we can look into is yapf ( https://github.com/google/yapf). It is similar to black but more tweakable. Also, it is recently adopted by the Apache Beam project. PR is here https://github.com/apache/beam/pull/10684/files Bin On Wed, Apr 8, 2020 at 1:18 PM Wes McKinney wrote: >

[jira] [Created] (ARROW-8375) [CI][R] Make Windows tests more verbose in case of segfault

2020-04-08 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8375: -- Summary: [CI][R] Make Windows tests more verbose in case of segfault Key: ARROW-8375 URL: https://issues.apache.org/jira/browse/ARROW-8375 Project: Apache Arrow

Re: [C++] Compute: Datum and "ChunkedArray&" inputs

2020-04-08 Thread Wes McKinney
Another idea would be to have a variant with const-references instead of shared_ptr. One potential issue with our Datum is that it plays the dual role of transporting both input and output arguments. With outputs it's necessary to be able to convey ownership while with inputs this is less important

[jira] [Created] (ARROW-8376) [R] Add experimental interface to ScanTask/RecordBatch iterators

2020-04-08 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8376: -- Summary: [R] Add experimental interface to ScanTask/RecordBatch iterators Key: ARROW-8376 URL: https://issues.apache.org/jira/browse/ARROW-8376 Project: Apache Ar

Re: [C++] Compute: Datum and "ChunkedArray&" inputs

2020-04-08 Thread Wes McKinney
On Wed, Apr 8, 2020 at 4:01 PM Wes McKinney wrote: > > Another idea would be to have a variant with const-references instead > of shared_ptr. One potential issue with our Datum is that it plays the > dual role of transporting both input and output arguments. With > outputs it's necessary to be abl

[jira] [Created] (ARROW-8377) [CI][C++][R] Build and run C++ tests on Rtools build

2020-04-08 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8377: -- Summary: [CI][C++][R] Build and run C++ tests on Rtools build Key: ARROW-8377 URL: https://issues.apache.org/jira/browse/ARROW-8377 Project: Apache Arrow

[jira] [Created] (ARROW-8378) [Python] "empty" dtype metadata leads to wrong Parquet column type

2020-04-08 Thread Diego Argueta (Jira)
Diego Argueta created ARROW-8378: Summary: [Python] "empty" dtype metadata leads to wrong Parquet column type Key: ARROW-8378 URL: https://issues.apache.org/jira/browse/ARROW-8378 Project: Apache Arro

[jira] [Created] (ARROW-8379) [R] Investigate/fix thread safety issues (esp. Windows)

2020-04-08 Thread Neal Richardson (Jira)
Neal Richardson created ARROW-8379: -- Summary: [R] Investigate/fix thread safety issues (esp. Windows) Key: ARROW-8379 URL: https://issues.apache.org/jira/browse/ARROW-8379 Project: Apache Arrow