Re: Policy on access to ursacomputing/crossbow?

2024-05-29 Thread Jonathan Keane
t; Regards, > Raúl > > El sáb, 25 may 2024 a las 2:02, Jonathan Keane () > escribió: > > > > Over my time with the project I've had access to the github repository > > ursacomputing/crossbow to be able to manually trigger crossbow jobs. I > find > > it in

Policy on access to ursacomputing/crossbow?

2024-05-24 Thread Jonathan Keane
Over my time with the project I've had access to the github repository ursacomputing/crossbow to be able to manually trigger crossbow jobs. I find it incredibly helpful when working on the extended R CI to be able to iterate more quickly than waiting for the comment bot. But also over the time

Re: [ANNOUNCE] New Arrow committer: Bryce Mecum

2024-03-18 Thread Jonathan Keane
Congrats and welcome, Bryce. -Jon On Mon, Mar 18, 2024 at 6:47 AM Antoine Pitrou wrote: > > Congratulations Bryce, and keep up the good work! > > Regards > > Antoine. > > Le 18/03/2024 à 03:21, Nic Crane a écrit : > > On behalf of the Arrow PMC, I'm happy to announce that Bryce Mecum has > >

Re: New tag for releases for R-universe

2024-02-10 Thread Jonathan Keane
Thanks for this Nic. And just to clarify: the latest here is the latest _release_ of Apache Arrow with this new set up. Prior to this the build available on R-universe were effectively dev builds (commits to main), but with this new tag, R-universe will only have (or at least default to having)

Re: [ANNOUNCE] New Arrow PMC member: Raúl Cumplido

2023-11-13 Thread Jonathan Keane
Congratulations and welcome! -Jon

Re: [ANNOUNCE] New Arrow PMC member: Jonathan Keane

2023-10-23 Thread Jonathan Keane
; > > Congratulations, Jonathan! > > > > > > From: Dane Pitkin > > Sent: Monday, October 16, 2023 11:52 AM > > To: dev@arrow.apache.org > > Subject: Re: [ANNOUNCE] New Arrow PMC member: Jonathan Keane > > >

Re: Help regarding setting up the r package in arrow apache

2023-10-20 Thread Jonathan Keane
> > > [1] https://arrow.apache.org/docs/r/articles/developers/docker.html > > > > On Fri, 20 Oct 2023 at 08:13, Divyansh Khatri < > divyanshkhatri...@gmail.com > > > > > wrote: > > > > > please see this and help me resolve the issue > > &

Re: [VOTE][Format] C data interface format strings for Utf8View and BinaryView

2023-10-18 Thread Jonathan Keane
+1 -Jon On Wed, Oct 18, 2023 at 2:26 PM Felipe Oliveira Carvalho < felipe...@gmail.com> wrote: > +1 > > On Wed, Oct 18, 2023 at 2:49 PM Dewey Dunnington > wrote: > > > +1! > > > > On Wed, Oct 18, 2023 at 2:14 PM Matt Topol > wrote: > > > > > > +1 > > > > > > On Wed, Oct 18, 2023 at 1:05 PM

Re: Help regarding setting up the r package in arrow apache

2023-10-18 Thread Jonathan Keane
For development of the R package with docker containers, the link [1] that Nic sent in this same thread is the place to go. In addition to that docker-focused one, there are a handful of others that might prove useful to you in getting your development environment setup [2]. If you run into any

Re: [Vote][Format] (new proposal) C data interface format string for ListView and LargeListView arrays

2023-10-07 Thread Jonathan Keane
+1 -Jon On Sat, Oct 7, 2023 at 3:54 AM Joris Van den Bossche < jorisvandenboss...@gmail.com> wrote: > +1 > > On Sat, 7 Oct 2023 at 10:44, Antoine Pitrou wrote: > > > > > > +1 from me. > > > > But I also reiterate my plea that these existing parsers get fixed so as > > to entirely validate the

Re: [Python][Discuss] PyArrow Dataset as a Python protocol

2023-06-28 Thread Jonathan Keane
> I would understand this objection more if DuckDB hasn't been relying on > being able to pass PyArrow expressions for 18 months now [1]. Unless, do we > just think this isn't widely used enough that we don't care? This isn't a pro or a con of specifically adopting the PyArrow expression

Re: [VOTE] Move issue tracking to GitHub Issues

2022-10-26 Thread Jonathan Keane
+1, I'm very glad to see what will hopefully be a _slightly smoother_ experience for new contributors + issue reporters -Jon On Wed, Oct 26, 2022 at 7:05 PM David Li wrote: > +1 > > On Wed, Oct 26, 2022, at 20:01, Andy Grove wrote: > > +1 > > > > On Wed, Oct 26, 2022 at 5:50 PM L. C. Hsieh

Re: [ANNOUNCE] New Arrow PMC member: Nicola Crane

2022-10-25 Thread Jonathan Keane
Congratulations! Your contributions to the project have been immeasurable. -Jon On Tue, Oct 25, 2022 at 8:12 PM Vibhatha Abeykoon wrote: > Congrats Nic! > > On Wed, Oct 26, 2022 at 5:30 AM Ashish wrote: > > > Congrats ! > > > > On Wednesday, October 26, 2022, Anja wrote: > > > > >

Re: [VOTE] Mark C Stream Interface as Stable

2022-06-08 Thread Jonathan Keane
+1 (non binding) -Jon On Wed, Jun 8, 2022 at 4:52 PM Jorge Cardoso Leitão wrote: > > Sorry, I got a bit confused on what we were voting on. Thank you for the > clarification. > > +1 > > Best, > Jorge > > > On Wed, Jun 8, 2022 at 9:53 PM Antoine Pitrou wrote: > > > > > Le 08/06/2022 à 20:55,

Re: Existence/name/scope for minimal C/C++ Arrow C Data interface helpers

2022-06-03 Thread Jonathan Keane
want to encourage > database driver libraries to add new APIs that emit the Arrow C > interface, we need to make it easier to generate the C interface > without requiring a new library dependency. > > [1]: https://lists.apache.org/thread/gnz1kz2rj3rb8rh8qz7l0mv8lvzq254w > > On M

Re: [Discuss][Java] macOS minimum requirements

2022-06-01 Thread Jonathan Keane
This isn't Java related directly, but for the R bindings we have to support at least 10.13.6 to be on CRAN, so bumping up to 10.13 would be fine for that too. -Jon On Wed, Jun 1, 2022 at 9:24 AM Antoine Pitrou wrote: > > > Sorry, I put "C++" in the title but this really affects Java via JNI. >

Re: Existence/name/scope for minimal C/C++ Arrow C Data interface helpers

2022-05-30 Thread Jonathan Keane
Thanks for working on this. I've heard people asking about something like this from a number of different fronts on top of the obvious use case in geoarrow | other geospatial libraries. I think a minimal piece of Arrow that other packages could depend on without needing to bring in all of arrow

Re: DISCUSS: Stabilize Arrow C Stream Interface?

2022-05-26 Thread Jonathan Keane
I too am +1 (nonbinding) to marking it as stable -Jon On Thu, May 26, 2022 at 1:05 PM Neal Richardson wrote: > +1 from me too to mark it as stable. De facto it is stable: there have been > no modifications to > https://github.com/apache/arrow/blob/master/cpp/src/arrow/c/abi.h since > the >

Re: [VOTE] Release Apache Arrow 7.0.0 - RC8

2022-01-27 Thread Jonathan Keane
+0 most things validate, though I haven't been able to run the C++ tests successfully Thank you for the huge effort Krisztián. I verified the signature + checksums on [3]. I've run the following (on macOS 12.1): The binary verification — successful. I've also run the source verification on: *

Re: [Parquet][C++][Python] Maximum Row Group Length Default

2021-11-17 Thread Jonathan Keane
This doesn't address the large number of row groups ticket that was raised, but for some visibility: there is some work to change the row group sizing based on the size of data instead of a static number of rows [1] as well as exposing a few more knobs to tune [2] There is a bit of prior art in

Re: Arrow sync call November 10 at 12:00 US/Eastern, 17:00 UTC

2021-11-10 Thread Jonathan Keane
Meeting notes: # Participants Nic Weston David Eduardo Benson Rok Antoine Alenka James Matt Micah # 6.0.1 patch release The RC1 for 6.0.1 is on its way and will have a vote shortly # Flight SQL David wanted to talk about Flight SQL from Dremio. We are close, would like someone to

Re: [DISCUSS] Deprecate user@ in favor for github issues/discussions

2021-09-29 Thread Jonathan Keane
I am also +1 for all of the same reasons both Neal and Philip mention. Lowering that barrier to participation for getting help + having that information more easily findable will make it easiest for folks to use and adopt Arrow. I will add personally I didn't realize I already do this when working

Re: [VOTE] Restart the Julia implementation with new repository and process

2021-09-27 Thread Jonathan Keane
+1 -Jon On Mon, Sep 27, 2021 at 2:26 PM Mauricio Vargas wrote: > > +1 > > On Mon, Sep 27, 2021 at 3:18 PM Neal Richardson > wrote: > > > +1 (binding) > > > > Neal > > > > On Mon, Sep 27, 2021 at 6:54 AM Andrew Lamb wrote: > > > > > +1 (binding) > > > > > > On Mon, Sep 27, 2021 at 12:17 AM

Re: Arrow sync call August 3 at 12:00 US/Eastern, 16:00 UTC

2021-08-04 Thread Jonathan Keane
Notes for the meeting, it was relatively short and sparsely attended this fortnight: Attendees: * David Li * Jonathan Keane * Nic Crane * Neal Richardson Topics discussed * Compute IR proposal: There's been some discussion, check it out * CRAN resubmission, we have the fixes we need, will send

Arrow sync call August 3 at 12:00 US/Eastern, 16:00 UTC

2021-08-03 Thread Jonathan Keane
Hello everyone, Our biweekly sync call is tomorrow (3 August) at 12:00 noon Eastern time. For today's call, let's please us this Google Meet URL (different from the usual one): https://meet.google.com/vbq-yufg-zwr?authuser=0 All are welcome to join. Notes will be shared with the mailing list

[Discuss] If and how we should integrate geospatial data (specs) in Arrow

2021-06-25 Thread Jonathan Keane
Hello, There is an emerging spec[1] for how to store geospatial data in Arrow + pass through parquet files in the geopandas world. There is even a new R package that implements a wrapper to do the same in R[2]. These both define a serialization[3] for storing geospatial data as an Arrow table

Re: [VOTE] Clarify meaning of timestamp without time zone to equal the concept of "LocalDateTime"

2021-06-25 Thread Jonathan Keane
+1 -Jon On Fri, Jun 25, 2021 at 5:30 AM Rok Mihevc wrote: > > +1 (non-binding) > > On Fri, Jun 25, 2021 at 11:21 AM Eduardo Ponce wrote: > > > +1 (non-binding) > > > > On Fri, Jun 25, 2021 at 4:31 AM Joris Peeters > > wrote: > > > > > +1 > > > > > > On Fri, Jun 25, 2021 at 9:29 AM Joris Van

Re: [C++][Discuss] Switch to C++17

2021-06-11 Thread Jonathan Keane
rovide their opinion for the qualitative > > metrics? What is a "good enough" coverage? > > * How do we summarize the results into a binary decision: upgrade vs not > > upgrade? > > * ... > > > > In the end, it might not be worthwhile to go through

Re: [C++][Discuss] Switch to C++17

2021-06-08 Thread Jonathan Keane
I've been digging a bit to try and put numbers on those users the Neal mentions. Specifically, we know that requiring C++17 will mean that R users on windows using versions of R before 4.0.0 will not be able to compile/install arrow. Although R version 3.6 is no longer supported by CRAN [1], many

Re: [NIGHTLY] Arrow Build Report for Job nightly-2021-06-06-0

2021-06-07 Thread Jonathan Keane
Yes, I absolutely agree that more triaging, visibility, and info into these would be massively helpful for tracking some of these down. The conda-osx-py* builds seem to all be related to this LLVM mismatch https://issues.apache.org/jira/browse/ARROW-12738 which I've clarified more on that ticket.

Re: Moving automated nightly build e-mails to a separate mailing list

2021-05-24 Thread Jonathan Keane
I also very much agree with all of the sentiments above. One of the things that I'm hoping this new site/dashboard/whatever we come up with will have is some more information / context around the failures that hopefully will help make them less overwhelming and have a higher signal to noise

Re: Nightly Builds Repors 2021-05-17

2021-05-18 Thread Jonathan Keane
Thanks for the comments + tickets Krisztián all of those sound like good enhancements to this process. On the point of: >> Error type: Internal > I find it really useful to categorize the errors, especially if we > have an error out of our direct reach. > I can't think of an easy way to automate

Re: String reverse kernel

2021-05-17 Thread Jonathan Keane
Yeah, piggybacking on what Weston said: is the line that we want to draw is code point, combining character sequences, or graphemes [1]. IME, most people would want/assume that combining characters would stay combined in reversals (using Weston's example: "tréma" becoming "aḿert" (though this

Re: [VOTE] Release Apache Arrow 4.0.0 - RC3

2021-04-22 Thread Jonathan Keane
+1 (non-binding) Verified wheels, sources, and binaries on macOS 11.2 using the verification script (except for Java Integration, Glib, and Ruby). Like Antoine I ran into the same issue with Ruby. I also installed Arrow and the R package locally + ran some adhoc tests using some of our

Re: [VOTE] Release Apache Arrow 4.0.0 - RC1

2021-04-20 Thread Jonathan Keane
be expected given > > > the current status of jfrog) but I attempted to install the CentOS 7 > > > RPM and got the following error when I ran `sudo yum update` after > > > installing the arrow repo rpm. > > > > > > > https://apache.jfrog.io/artifactory/arrow/cen

Re: [VOTE] Release Apache Arrow 4.0.0 - RC1

2021-04-20 Thread Jonathan Keane
I'm still working on my verification, but as part of that noticed that https://issues.apache.org/jira/browse/ARROW-12316 which we thought changed the default memory allocator didn't fully accomplish that. Nothing is broken per se, but jemalloc is still the default on macOS. I've made

Re: Setting Affects Version in Arrow Jira bug issues

2021-04-07 Thread Jonathan Keane
I think this proposal is great and will help a lot when scanning through Jira issues. I wonder if it's possible to automate this? I'm thinking something along the lines of: If it's a Type = Bug, could have a yes/no or checkbox where we ask "is this a bug reproducible in the most recent arrow

Re: Arrow sync call March 31 at 12:00 US/Eastern, 16:00 UTC

2021-03-31 Thread Jonathan Keane
Thank you everyone who attended, here are the notes. Attendees: Jonathan Keane Colin Alworth David Sanders Micah Kornfield Rok Mihevc Projjal Chanda Eduardo Ponce Krill Lykov Discussion: - 4.0 release - zstd compression for the java library (has PR that is approved

Re: Arrow sync call March 31 at 12:00 US/Eastern, 16:00 UTC

2021-03-31 Thread Jonathan Keane
I'm experiencing the same here. On Wed, Mar 31, 2021 at 11:06 AM Kirill Lykov wrote: > Hi, > > I don't know about the others but I cannot join because someone needs to > let me in. > Might be it the problem also for other people? > > On Tue, Mar 30, 2021 at 5:53 PM Neal Richardson < >

[jira] [Created] (ARROW-8734) [R] Compilation error on macOS

2020-05-07 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-8734: - Summary: [R] Compilation error on macOS Key: ARROW-8734 URL: https://issues.apache.org/jira/browse/ARROW-8734 Project: Apache Arrow Issue Type: Bug

[jira] [Created] (ARROW-8726) segfault with a mis-specified partition

2020-05-06 Thread Jonathan Keane (Jira)
Jonathan Keane created ARROW-8726: - Summary: segfault with a mis-specified partition Key: ARROW-8726 URL: https://issues.apache.org/jira/browse/ARROW-8726 Project: Apache Arrow Issue Type