[RESULT][VOTE][Julia] Release Apache Arrow Julia 2.6.1 RC1

2023-06-07 Thread Sutou Kouhei
Hi,

The vote carries with 3 +1 binding votes, 1 +1 non-binding
vote and no -1 votes.

I'll publish this release to
https://dist.apache.org/repos/dist/release/arrow/ .

Thanks,
-- 
kou


In <20230607.104808.9345437997937034@clear-code.com>
  "[VOTE][Julia] Release Apache Arrow Julia 2.6.1 RC1" on Wed, 07 Jun 2023 
10:48:08 +0900 (JST),
  Sutou Kouhei  wrote:

> Hi,
> 
> I would like to propose the following release candidate (RC1) of
> Apache Arrow Julia version 2.6.1.
> 
> This release candidate is based on commit:
> 2d1114e180ef11f9d3bbe310b2eb856550cfbeb3 [1]
> 
> The source release rc1 is hosted at [2].
> 
> Please download, verify checksums and signatures, run the unit tests,
> and vote on the release. See [3] for how to validate a release candidate.
> 
> The vote will be open for at least 24 hours.
> 
> [ ] +1 Release this as Apache Arrow Julia 2.6.1
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow Julia 2.6.1 because...
> 
> [1]: 
> https://github.com/apache/arrow-julia/tree/2d1114e180ef11f9d3bbe310b2eb856550cfbeb3
> [2]: 
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-julia-2.6.1-rc1/
> [3]: 
> https://github.com/apache/arrow-julia/blob/main/dev/release/README.md#verify


Re: [VOTE][Julia] Release Apache Arrow Julia 2.6.1 RC1

2023-06-07 Thread Sutou Kouhei
+1

I ran the following command line on Debian GNU/Linux sid:

  VERIFY_FORCE_USE_JULIA_BINARY=1 dev/release/verify_rc.sh 2.6.1 1


Thanks,
-- 
kou

In <20230607.104808.9345437997937034@clear-code.com>
  "[VOTE][Julia] Release Apache Arrow Julia 2.6.1 RC1" on Wed, 07 Jun 2023 
10:48:08 +0900 (JST),
  Sutou Kouhei  wrote:

> Hi,
> 
> I would like to propose the following release candidate (RC1) of
> Apache Arrow Julia version 2.6.1.
> 
> This release candidate is based on commit:
> 2d1114e180ef11f9d3bbe310b2eb856550cfbeb3 [1]
> 
> The source release rc1 is hosted at [2].
> 
> Please download, verify checksums and signatures, run the unit tests,
> and vote on the release. See [3] for how to validate a release candidate.
> 
> The vote will be open for at least 24 hours.
> 
> [ ] +1 Release this as Apache Arrow Julia 2.6.1
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow Julia 2.6.1 because...
> 
> [1]: 
> https://github.com/apache/arrow-julia/tree/2d1114e180ef11f9d3bbe310b2eb856550cfbeb3
> [2]: 
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-julia-2.6.1-rc1/
> [3]: 
> https://github.com/apache/arrow-julia/blob/main/dev/release/README.md#verify


Re: [VOTE][Julia] Release Apache Arrow Julia 2.6.1 RC1

2023-06-07 Thread David Li
+1 (Ubuntu Linux 20.04/AMD64)

I had a bit of trouble installing Julia since the script got confused (it 
appears 1.9.1 is tagged on Github but there are not yet binaries) but I think 
that's not a problem with our script

On Tue, Jun 6, 2023, at 22:28, Jacob Quinn wrote:
> +1 (macOS M1)
>
> Cheers,
>
> -Jacob
>
> On Tue, Jun 6, 2023 at 7:48 PM Sutou Kouhei  wrote:
>
>> Hi,
>>
>> I would like to propose the following release candidate (RC1) of
>> Apache Arrow Julia version 2.6.1.
>>
>> This release candidate is based on commit:
>> 2d1114e180ef11f9d3bbe310b2eb856550cfbeb3 [1]
>>
>> The source release rc1 is hosted at [2].
>>
>> Please download, verify checksums and signatures, run the unit tests,
>> and vote on the release. See [3] for how to validate a release candidate.
>>
>> The vote will be open for at least 24 hours.
>>
>> [ ] +1 Release this as Apache Arrow Julia 2.6.1
>> [ ] +0
>> [ ] -1 Do not release this as Apache Arrow Julia 2.6.1 because...
>>
>> [1]:
>> https://github.com/apache/arrow-julia/tree/2d1114e180ef11f9d3bbe310b2eb856550cfbeb3
>> [2]:
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-julia-2.6.1-rc1/
>> [3]:
>> https://github.com/apache/arrow-julia/blob/main/dev/release/README.md#verify
>>


[RESULT][VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 26.0.0 RC1

2023-06-07 Thread Andy Grove
On Wed, Jun 7, 2023 at 1:18 PM Andy Grove  wrote:

> The vote has passed with six votes (four binding). Thank you to all who
> helped with the release verification.
>
> Source release:
> https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-26.0.0
>
> Crates: https://crates.io/crates/datafusion
>
> On Mon, Jun 5, 2023 at 2:53 PM Andrew Lamb  wrote:
>
>> +1 (binding)
>>
>> I ran the tests on an x86_64 mac
>>
>> Thanks  for running this process Andy -- it is very much appreciated
>>
>> Andrew
>>
>> On Sat, Jun 3, 2023 at 11:20 AM Andy Grove  wrote:
>>
>> > Hi,
>> >
>> > I would like to propose a release of Apache Arrow DataFusion
>> > Implementation,
>> > version 26.0.0.
>> >
>> > This release candidate is based on commit:
>> > 06240ab87e7e7d8ac4b43feaa95377bf607d18eb [1]
>> > The proposed release tarball and signatures are hosted at [2].
>> > The changelog is located at [3].
>> >
>> > Please download, verify checksums and signatures, run the unit tests,
>> and
>> > vote
>> > on the release. The vote will be open for at least 72 hours.
>> >
>> > Only votes from PMC members are binding, but all members of the
>> community
>> > are
>> > encouraged to test the release and vote with "(non-binding)".
>> >
>> > The standard verification procedure is documented at
>> >
>> >
>> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
>> > .
>> >
>> > [ ] +1 Release this as Apache Arrow DataFusion 26.0.0
>> > [ ] +0
>> > [ ] -1 Do not release this as Apache Arrow DataFusion 26.0.0 because...
>> >
>> > Here is my vote:
>> >
>> > +1
>> >
>> > [1]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion/tree/06240ab87e7e7d8ac4b43feaa95377bf607d18eb
>> > [2]:
>> >
>> >
>> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-26.0.0-rc1
>> > [3]:
>> >
>> >
>> https://github.com/apache/arrow-datafusion/blob/06240ab87e7e7d8ac4b43feaa95377bf607d18eb/CHANGELOG.md
>> >
>>
>


Re: [VOTE][RUST][DataFusion] Release Apache Arrow DataFusion 26.0.0 RC1

2023-06-07 Thread Andy Grove
The vote has passed with six votes (four binding). Thank you to all who
helped with the release verification.

Source release:
https://dist.apache.org/repos/dist/release/arrow/arrow-datafusion-26.0.0

Crates: https://crates.io/crates/datafusion

On Mon, Jun 5, 2023 at 2:53 PM Andrew Lamb  wrote:

> +1 (binding)
>
> I ran the tests on an x86_64 mac
>
> Thanks  for running this process Andy -- it is very much appreciated
>
> Andrew
>
> On Sat, Jun 3, 2023 at 11:20 AM Andy Grove  wrote:
>
> > Hi,
> >
> > I would like to propose a release of Apache Arrow DataFusion
> > Implementation,
> > version 26.0.0.
> >
> > This release candidate is based on commit:
> > 06240ab87e7e7d8ac4b43feaa95377bf607d18eb [1]
> > The proposed release tarball and signatures are hosted at [2].
> > The changelog is located at [3].
> >
> > Please download, verify checksums and signatures, run the unit tests, and
> > vote
> > on the release. The vote will be open for at least 72 hours.
> >
> > Only votes from PMC members are binding, but all members of the community
> > are
> > encouraged to test the release and vote with "(non-binding)".
> >
> > The standard verification procedure is documented at
> >
> >
> https://github.com/apache/arrow-datafusion/blob/main/dev/release/README.md#verifying-release-candidates
> > .
> >
> > [ ] +1 Release this as Apache Arrow DataFusion 26.0.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow DataFusion 26.0.0 because...
> >
> > Here is my vote:
> >
> > +1
> >
> > [1]:
> >
> >
> https://github.com/apache/arrow-datafusion/tree/06240ab87e7e7d8ac4b43feaa95377bf607d18eb
> > [2]:
> >
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-datafusion-26.0.0-rc1
> > [3]:
> >
> >
> https://github.com/apache/arrow-datafusion/blob/06240ab87e7e7d8ac4b43feaa95377bf607d18eb/CHANGELOG.md
> >
>


Re: [DISCUSS] JSON Canonical Extension Type

2023-06-07 Thread Will Jones
Hello,

Sorry this hasn't gotten much attention recently. I just brought this up at
the Arrow community meeting, as I'd like to revive it.

It looks like there is a draft implementation up already [1].

I'm generally supportive of this, but I have a few questions:

1. Would we be able to make this extension type work on top of any of the
string types, including Utf8, LargeUtf8, and the (under consideration [2])
StringView types?
2. Does this imply a potential canonical extension type for every
text-based data format, such as HOCON, XML, and so on? If we agree JSON is
special, I think it's fine to have its own extension type. On the other
hand, it might be worth considering making a generic extension type for
serialized data, that is parameterized by the media type
("application/json" in this case).  This doesn't preclude the possibility
of building an extension type class / struct within Arrow implementations
that is specific to JSON; I don't think there's any hard rule that there
has to be a 1-1 correspondence between extension types in the format and
the concrete data structures in libraries.

Best,

Will Jones

[1] https://github.com/apache/arrow/pull/13901
[2] https://lists.apache.org/thread/w88tpz76ox8h3rxkjl4so6rg3f1rv7wt


On Thu, Dec 1, 2022 at 12:23 AM Antoine Pitrou  wrote:

>
> HOCON is a superset of JSON, so I'm not sure making it an extension type
> based it on JSON would be a good idea.
>
>
> Le 01/12/2022 à 06:23, Micah Kornfield a écrit :
> >>
> >> Can a logical extension be based on another logical extension?
> >
> > Potentially but this is mostly an implementation details, each type
> should
> > have their own specification IMO.
> >
> > HOCON support might be nice..
> >
> > I'm not sure if this is common enough to warrant a canonical type within
> > Arrow but you are welcome to propose something if you would like.
> >
> > Cheers,
> > Micah
> >
> > On Mon, Nov 28, 2022 at 11:55 AM Lee, David  .invalid>
> > wrote:
> >
> >> Can a logical extension be based on another logical extension?
> >>
> >> HOCON support might be nice..
> >>
> >> -Original Message-
> >> From: Micah Kornfield 
> >> Sent: Monday, November 28, 2022 11:50 AM
> >> To: dev@arrow.apache.org
> >> Subject: Re: [DISCUSS] JSON Canonical Extension Type
> >>
> >> External Email: Use caution with links and attachments
> >>
> >>
> >> This seems like a reasonable definition to me.  Since there hasn't been
> >> much feedback, I think maybe following through an implementation + this
> >> description in a PR would be the next steps.  If there isn't further
> >> feedback on this, once the PR is up we can have try to vote (which might
> >> bring up some more feedback, but hopefully wouldn't cause too much
> >> implementation churn).
> >>
> >> Thanks,
> >> Micah
> >>
> >> On Thu, Nov 17, 2022 at 3:58 PM Pradeep Gollakota
> >>  wrote:
> >>
> >>> Hi folks!
> >>>
> >>> I put together this specification for canonicalizing the JSON type in
> >>> Arrow.
> >>>
> >>> ## Introduction
> >>> JSON is a widely used text based data interchange format. There are
> >>> many use cases where a user has a column whose contents are a JSON
> >>> encoded string. BigQuery's [JSON Type][1] and Parquet’s [JSON Logical
> >>> Type][2] are two such examples.
> >>>
> >>> The JSON specification is defined in [RFC-8259][3]. However, many of
> >>> the most popular parsers support non standard extensions. Examples of
> >>> non standard extensions to JSON include comments, unquoted keys,
> >>> trailing commas, etc.
> >>>
> >>> ## Extension Specification
> >>> * The name of the extension is `arrow.json`
> >>> * The storage type of the extension is `utf8`
> >>> * The extension type has no parameters
> >>> * The metadata MUST be either empty or a valid JSON object
> >>>  - There is no canonical metadata
> >>>  - Implementations MAY include implementation-specific metadata by
> >>> using a namespaced key. For example `{"google.bigquery": {"my":
> >>> "metadata"}}`
> >>> * Implementations...
> >>>  - MUST produce valid UTF-8 encoded text
> >>>  - SHOULD produce valid standard JSON
> >>>  - MAY produce valid non-standard JSON
> >>>  - MUST support parsing standard JSON
> >>>  - MAY support parsing non standard JSON
> >>>  - SHOULD pass through contents that they do not understand
> >>>
> >>> ## Forward compatibility
> >>> In the future we might allow this logical type to annotate a byte
> >>> storage type with a different text encoding.  Implementations
> >>> consuming JSON logical types should verify this.
> >>>
> >>>  [1]:
> >>>
> >>>
> >>
> https://urldefense.com/v3/__https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types*json_type__;Iw!!KSjYCgUGsB4!YhB-EpSLu8HTacaUsWvTVqF0kYh81UlVwNFBAc4-f95F7bGtdGuyWN_JObBkRSee-jTU20_MmGe2WUH8UMqTxPY$
> >>>  [2]:
> >>>
> >>
> 

Arrow community meeting June 7 at 16:00 UTC

2023-06-07 Thread Ian Cook
Our next biweekly Arrow community meeting is today at 16:00 UTC / 12:00 EDT.

Zoom meeting URL:
https://zoom.us/j/87649033008?pwd=SitsRHluQStlREM0TjJVYkRibVZsUT09
Meeting ID: 876 4903 3008
Passcode: 958092

Meeting notes will be captured in this Google Doc:
https://docs.google.com/document/d/1xrji8fc6_24TVmKiHJB4ECX1Zy2sy2eRbBjpVJMnPmk/
If you plan to attend this meeting, you are welcome to edit the
document to add the topics that you would like to discuss.

Thanks,
Ian