Re: What's a good starting point?

2020-07-08 Thread Neal Richardson
Hi Chris, some additional thoughts to what Antoine said. Neal On Wed, Jul 8, 2020 at 10:56 AM Antoine Pitrou wrote: > > Hi Chris, > > Le 08/07/2020 à 12:01, Chris Channing a écrit : > > > > I've looked at the contribution guidelines, but rather than arbitrarily > > picking a jira I was hoping

How to specify number of partitions?

2020-07-08 Thread Yash Ganthe
Hi, parquet_writer.write_table(table) This line writes a single file. The documentation says: This creates a single Parquet file. In practice, a Parquet dataset may consist of many files in many directories. We can read a single file back with read_table: Is there a way for PyArrow to create a

Re: What's a good starting point?

2020-07-08 Thread Antoine Pitrou
Hi Chris, Le 08/07/2020 à 12:01, Chris Channing a écrit : > > I've looked at the contribution guidelines, but rather than arbitrarily > picking a jira I was hoping that there was a more structured approach for > newbies documented that I might have missed. A few questions that I have > are:

Re: Arrow sync call July 8 at 12:00 US/Eastern, 16:00 UTC

2020-07-08 Thread Neal Richardson
Attendees: * Projjal Chanda * Rémi Dettai * Supun Kamburugamuve * Ben Kietzman * Steve Kim * Uwe Korn * Micah Kornfield * Rok Mihevc * Patrick Pai * Niranda Perera * Neal Richardson * Joris Van den Bossche Discussion: * 1.0 issues * Custom build of pyarrow questions: directed to mailing list for

Re: [DRAFT] Arrow Board Report July 2020

2020-07-08 Thread Uwe L. Korn
Happy with the current version. I think this gives enough input for the board. We have so much things happening that are much better presented in the process of the 1.0 release. On Wed, Jul 8, 2020, at 12:52 AM, Micah Kornfield wrote: > Worth mentioning the website work? > > On Tue, Jul 7,

Re: [RESULT] [VOTE] Increment MetadataVersion in Schema.fbs from V4 to V5 for 1.0.0 release

2020-07-08 Thread Wes McKinney
Hi folks, This issue is probably the one true "blocker" for the 1.0.0 release. Ideally, all libraries should emit V5 MetadataVersion by default. How V4 handled depends on the willingness to implement compatibility code: * Since V4 is backwards compatible with V5 (except for unions), libraries

[NIGHTLY] Arrow Build Report for Job nightly-2020-07-08-0

2020-07-08 Thread Crossbow
Arrow Build Report for Job nightly-2020-07-08-0 All tasks: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-07-08-0 Failed Tasks: - debian-buster-arm64: URL: https://github.com/ursa-labs/crossbow/branches/all?query=nightly-2020-07-08-0-travis-debian-buster-arm64 -

What's a good starting point?

2020-07-08 Thread Chris Channing
Hey Folks, I'd like to get involved with the project from a development perspective but I'm looking for some guidance on where/how best to start. I've looked at the contribution guidelines, but rather than arbitrarily picking a jira I was hoping that there was a more structured approach for