Re: [VOTE] Release Apache Arrow nanoarrow 0.2.0 - RC0

2023-06-16 Thread Jacob Wujciak-Jens
+1 (non-binding) verified fully on R 4.3 and GCC 12 on manjaro

On Fri, Jun 16, 2023 at 11:13 PM David Li  wrote:

> +1
>
> Tested on Ubuntu 20.04/x86_64
>
> On Fri, Jun 16, 2023, at 16:15, Dewey Dunnington wrote:
> > Hello,
> >
> > I would like to propose the following release candidate (RC0) of
> > Apache Arrow nanoarrow version 0.2.0. This release consists of 17
> > resolved GitHub issues [1].
> >
> > This release candidate is based on commit:
> > a7b824de6cb99ce458e1a5cd311d69588ceb0570 [2]
> >
> > The source release rc0 is hosted at [3].
> > The changelog is located at [4].
> >
> > Please download, verify checksums and signatures, run the unit tests,
> > and vote on the release. See [5] for how to validate a release
> > candidate.
> >
> > The vote will be open for at least 72 hours.
> >
> > [ ] +1 Release this as Apache Arrow nanoarrow 0.2.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow nanoarrow 0.2.0 because...
> >
> > [0] https://github.com/apache/arrow-nanoarrow
> > [1] https://github.com/apache/arrow-nanoarrow/milestone/2?closed=1
> > [2]
> >
> https://github.com/apache/arrow-nanoarrow/tree/apache-arrow-nanoarrow-0.2.0-rc0
> > [3]
> >
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-nanoarrow-0.2.0-rc0/
> > [4]
> >
> https://github.com/apache/arrow-nanoarrow/blob/apache-arrow-nanoarrow-0.2.0-rc0/CHANGELOG.md
> > [5]
> >
> https://github.com/apache/arrow-nanoarrow/blob/main/dev/release/README.md
>


Re: [VOTE][RUST] Release Apache Arrow Rust 42.0.0 RC1

2023-06-16 Thread L. C. Hsieh
+1 (binding)

Verified on M1 Mac.

Thanks Andrew.

On Fri, Jun 16, 2023 at 11:40 AM Andrew Lamb  wrote:
>
> Hi,
>
> I would like to propose a release of Apache Arrow Rust Implementation,
> version 42.0.0.
>
> Please note that there is one known regression in this release related to
> parsing intervals like '.5 months' [5], but I do not believe it should
> block the release (see [6] for rationale). However, if others feel
> differently, there is a proposed fix [7] and once it is reviewed / merged I
> can create a new RC as well
>
> This release candidate is based on commit:
> 2c7b4efc1701d9db5a0cc6decacf1df22123645f [1]
>
> The proposed release tarball and signatures are hosted at [2].
>
> The changelog is located at [3].
>
> Please download, verify checksums and signatures, run the unit tests,
> and vote on the release. There is a script [4] that automates some of
> the verification.
>
> The vote will be open for at least 72 hours.
>
> [ ] +1 Release this as Apache Arrow Rust
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow Rust  because...
>
> [1]:
> https://github.com/apache/arrow-rs/tree/2c7b4efc1701d9db5a0cc6decacf1df22123645f
> [2]: https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-rs-42.0.0-rc1
> [3]:
> https://github.com/apache/arrow-rs/blob/2c7b4efc1701d9db5a0cc6decacf1df22123645f/CHANGELOG.md
> [4]:
> https://github.com/apache/arrow-rs/blob/master/dev/release/verify-release-candidate.sh
> [5]: https://github.com/apache/arrow-rs/issues/4424
> [6]: https://github.com/apache/arrow-rs/pull/4425#discussion_r1232573299
> [6]: https://github.com/apache/arrow-rs/pull/4425


Re: [VOTE] Release Apache Arrow nanoarrow 0.2.0 - RC0

2023-06-16 Thread David Li
+1 

Tested on Ubuntu 20.04/x86_64

On Fri, Jun 16, 2023, at 16:15, Dewey Dunnington wrote:
> Hello,
>
> I would like to propose the following release candidate (RC0) of
> Apache Arrow nanoarrow version 0.2.0. This release consists of 17
> resolved GitHub issues [1].
>
> This release candidate is based on commit:
> a7b824de6cb99ce458e1a5cd311d69588ceb0570 [2]
>
> The source release rc0 is hosted at [3].
> The changelog is located at [4].
>
> Please download, verify checksums and signatures, run the unit tests,
> and vote on the release. See [5] for how to validate a release
> candidate.
>
> The vote will be open for at least 72 hours.
>
> [ ] +1 Release this as Apache Arrow nanoarrow 0.2.0
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow nanoarrow 0.2.0 because...
>
> [0] https://github.com/apache/arrow-nanoarrow
> [1] https://github.com/apache/arrow-nanoarrow/milestone/2?closed=1
> [2] 
> https://github.com/apache/arrow-nanoarrow/tree/apache-arrow-nanoarrow-0.2.0-rc0
> [3] 
> https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-nanoarrow-0.2.0-rc0/
> [4] 
> https://github.com/apache/arrow-nanoarrow/blob/apache-arrow-nanoarrow-0.2.0-rc0/CHANGELOG.md
> [5] 
> https://github.com/apache/arrow-nanoarrow/blob/main/dev/release/README.md


[VOTE] Release Apache Arrow nanoarrow 0.2.0 - RC0

2023-06-16 Thread Dewey Dunnington
Hello,

I would like to propose the following release candidate (RC0) of
Apache Arrow nanoarrow version 0.2.0. This release consists of 17
resolved GitHub issues [1].

This release candidate is based on commit:
a7b824de6cb99ce458e1a5cd311d69588ceb0570 [2]

The source release rc0 is hosted at [3].
The changelog is located at [4].

Please download, verify checksums and signatures, run the unit tests,
and vote on the release. See [5] for how to validate a release
candidate.

The vote will be open for at least 72 hours.

[ ] +1 Release this as Apache Arrow nanoarrow 0.2.0
[ ] +0
[ ] -1 Do not release this as Apache Arrow nanoarrow 0.2.0 because...

[0] https://github.com/apache/arrow-nanoarrow
[1] https://github.com/apache/arrow-nanoarrow/milestone/2?closed=1
[2] 
https://github.com/apache/arrow-nanoarrow/tree/apache-arrow-nanoarrow-0.2.0-rc0
[3] 
https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-nanoarrow-0.2.0-rc0/
[4] 
https://github.com/apache/arrow-nanoarrow/blob/apache-arrow-nanoarrow-0.2.0-rc0/CHANGELOG.md
[5] https://github.com/apache/arrow-nanoarrow/blob/main/dev/release/README.md


[VOTE][RUST] Release Apache Arrow Rust 42.0.0 RC1

2023-06-16 Thread Andrew Lamb
Hi,

I would like to propose a release of Apache Arrow Rust Implementation,
version 42.0.0.

Please note that there is one known regression in this release related to
parsing intervals like '.5 months' [5], but I do not believe it should
block the release (see [6] for rationale). However, if others feel
differently, there is a proposed fix [7] and once it is reviewed / merged I
can create a new RC as well

This release candidate is based on commit:
2c7b4efc1701d9db5a0cc6decacf1df22123645f [1]

The proposed release tarball and signatures are hosted at [2].

The changelog is located at [3].

Please download, verify checksums and signatures, run the unit tests,
and vote on the release. There is a script [4] that automates some of
the verification.

The vote will be open for at least 72 hours.

[ ] +1 Release this as Apache Arrow Rust
[ ] +0
[ ] -1 Do not release this as Apache Arrow Rust  because...

[1]:
https://github.com/apache/arrow-rs/tree/2c7b4efc1701d9db5a0cc6decacf1df22123645f
[2]: https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-rs-42.0.0-rc1
[3]:
https://github.com/apache/arrow-rs/blob/2c7b4efc1701d9db5a0cc6decacf1df22123645f/CHANGELOG.md
[4]:
https://github.com/apache/arrow-rs/blob/master/dev/release/verify-release-candidate.sh
[5]: https://github.com/apache/arrow-rs/issues/4424
[6]: https://github.com/apache/arrow-rs/pull/4425#discussion_r1232573299
[6]: https://github.com/apache/arrow-rs/pull/4425


Part 2 of the article "Our journey at F5 with Apache Arrow"

2023-06-16 Thread Laurent Quérel
Hi everyone,

I wrote the second part of the article "Our Journey at F5 with Apache
Arrow," published on April 11, 2023. This article discusses three
techniques that have enabled us to enhance both the compression ratio and
memory usage of Apache Arrow buffers within the current version of the OTel
Arrow protocol.

The following PR is available to gather your feedback.
https://github.com/apache/arrow-site/pull/369

You can also read the article on Google Docs if you prefer.
https://docs.google.com/document/d/1K2CqAtF4pZjpiVts8BOcq34sOcNgozvZ9ZZw-_zTv6I/edit?usp=sharing

Cheers, Laurent Querel


Re: Turn a vector of Scalar to an Array/ArrayData of the same datatype

2023-06-16 Thread Li Jin
Thanks Jin! That's perfect.

On Thu, Jun 15, 2023 at 11:21 PM Jin Shang  wrote:

> Hi Li,
>
> I've faced this issue before, and I ended up using a generic ArrayBuilder,
> for example:
>
> ```cpp
> auto type = int32();
> std::vector> scalars = {MakeScalar(1),
> MakeScalar(2)};
>
> ARROW_ASSIGN_OR_RAISE(std::unique_ptr builder,
> MakeBuilder(type));
> ARROW_RETURN_NOT_OK(builder->AppendScalars(scalars));
> ARROW_ASSIGN_OR_RAISE(auto arr, builder->Finish());
> ```
>
> Best,
> Jin
>
>
> On Fri, Jun 16, 2023 at 5:23 AM Li Jin  wrote:
>
> > Hi,
> >
> > I find myself in need of a function to turn a vector of Scalar to an
> Array
> > of the same datatype. The data type is known at the runtime. e.g.
> >
> > shared_ptr concat_scalars(vector values.
> > shared_ptr type);
> >
> > I wonder if I need to use sth like Scalar::Accept(ScalarVisitor*) or is
> > there an easier/better way to achieve this?
> >
> > For context, I am trying to implement UDF support for hash aggregation,
> > each UDF invocation will give me back one Scalar (for each group), and I
> > need to concat them in the HashAggregateKernel finalize method. So
> > performance is not a large concern here, time in the UDF would likely
> > dominate the total runtime.
> >
> > Thanks!
> > Li
> >
>


[RELEASE] Prepare for Arrow Release version 13.0.0

2023-06-16 Thread Raúl Cumplido
Hi,

It seems it was yesterday we were doing a release but it is time again
to prepare for a new release, 13.0.0.

I propose Monday 10th of July as the release code freeze. This is ~3
weeks from now.

The GitHub milestone for 13.0.0 can be seen here [1]. There are
currently no Release blockers identified but they will be tracked here
[2]. If you identify any blocker please add the `Priority: Blocker`
label to the corresponding issue.

The plan is for me to be acting as Release Manager with support from
Kou to sign and upload artifacts.

Thank you,
Raúl

[1] https://github.com/apache/arrow/milestone/53
[2] 
https://github.com/apache/arrow/issues?q=is%3Aopen+is%3Aissue+label%3A%22Priority%3A+Blocker%22+milestone%3A13.0.0


Re: [VOTE] Release Apache Arrow ADBC 0.5.0 - RC0

2023-06-16 Thread Raúl Cumplido
+1 (non-binding)

I ran the following on Ubuntu 22.04:

USE_CONDA=1 dev/release/verify-release-candidate.sh 0.5.0 0

Thanks!
Raúl

El vie, 16 jun 2023 a las 9:10, Sutou Kouhei () escribió:
>
> +1
>
> I ran the following on Debian GNU/Linux sid:
>
>   JAVA_HOME=/usr/lib/jvm/default-java \
> dev/release/verify-release-candidate.sh 0.5.0 0
>
> with:
>
>   * Python 3.11.2
>   * g++ (Debian 12.2.0-14) 12.2.0
>   * go version go1.19.8 linux/amd64
>   * openjdk version "17.0.6" 2023-01-17
>   * ruby 3.1.2p20 (2022-04-12 revision 4491bb740a) [x86_64-linux-gnu]
>   * R version 4.3.0 (2023-04-21) -- "Already Tomorrow"
>
>
> Note: I needed https://github.com/apache/arrow-adbc/pull/810 .
>
>
> Thanks,
> --
> kou
>
>
> In <74f01cb9-5c76-4745-b357-4deca0bbd...@app.fastmail.com>
>   "[VOTE] Release Apache Arrow ADBC 0.5.0 - RC0" on Thu, 15 Jun 2023 20:06:46 
> -0400,
>   "David Li"  wrote:
>
> > Hello,
> >
> > I would like to propose the following release candidate (RC0) of Apache 
> > Arrow ADBC version 0.5.0. This is a release consisting of 36 resolved 
> > GitHub issues [1].
> >
> > This release candidate is based on commit: 
> > ac0e0ef8bd83787f65e53d421fce6ad490d9a37d [2]
> >
> > The source release rc0 is hosted at [3].
> > The binary artifacts are hosted at [4][5][6][7][8].
> > The changelog is located at [9].
> >
> > Please download, verify checksums and signatures, run the unit tests, and 
> > vote on the release. See [10] for how to validate a release candidate.
> >
> > See also a verification result on GitHub Actions [11].
> >
> > The vote will be open for at least 72 hours.
> >
> > [ ] +1 Release this as Apache Arrow ADBC 0.5.0
> > [ ] +0
> > [ ] -1 Do not release this as Apache Arrow ADBC 0.5.0 because...
> >
> > Note: to verify APT/YUM packages on macOS/AArch64, you must `export 
> > DOCKER_DEFAULT_ARCHITECTURE=linux/amd64`. (Or skip this step by `export 
> > TEST_APT=0 TEST_YUM=0`.)
> >
> > [1]: 
> > https://github.com/apache/arrow-adbc/issues?q=is%3Aissue+milestone%3A%22ADBC+Libraries+0.5.0%22+is%3Aclosed
> > [2]: 
> > https://github.com/apache/arrow-adbc/commit/ac0e0ef8bd83787f65e53d421fce6ad490d9a37d
> > [3]: 
> > https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-adbc-0.5.0-rc0/
> > [4]: https://apache.jfrog.io/artifactory/arrow/almalinux-rc/
> > [5]: https://apache.jfrog.io/artifactory/arrow/debian-rc/
> > [6]: https://apache.jfrog.io/artifactory/arrow/ubuntu-rc/
> > [7]: 
> > https://repository.apache.org/content/repositories/staging/org/apache/arrow/adbc/
> > [8]: 
> > https://github.com/apache/arrow-adbc/releases/tag/apache-arrow-adbc-0.5.0-rc0
> > [9]: 
> > https://github.com/apache/arrow-adbc/blob/apache-arrow-adbc-0.5.0-rc0/CHANGELOG.md
> > [10]: 
> > https://arrow.apache.org/adbc/main/development/releasing.html#how-to-verify-release-candidates
> > [11]: https://github.com/apache/arrow-adbc/actions/runs/5284608862


TAC Applications for Community Over Code North America and Asia now open

2023-06-16 Thread Gavin McDonald
Hi All,

(This email goes out to all our user and dev project mailing lists, so you
may receive this
email more than once.)

The Travel Assistance Committee has opened up applications to help get
people to the following events:


*Community Over Code Asia 2023 - *
*August 18th to August 20th in Beijing , China*

Applications for this event closes on the 6th July so time is short, please
apply as soon as possible. TAC is prioritising applications from the Asia
and Oceania regions.

More details on this event can be found at:
https://apachecon.com/acasia2023/

More information on how to apply please read: https://tac.apache.org/


*Community Over Code North America - *
*October 7th to October 10th in Halifax, Canada*

Applications for this event closes on the 22nd July. We expect many
applications so please do apply as soon as you can. TAC is prioritising
applications from the North and South America regions.

More details on this event can be found at: https://communityovercode.org/

More information on how to apply please read: https://tac.apache.org/


*Have you applied to be a Speaker?*

If you have applied or intend to apply as a Speaker at either of these
events, and think you
may require assistance for Travel and/or Accommodation - TAC advises that
you do not
wait until you have been notified of your speaker status and to apply
early. Should you
not be accepted as a speaker and still wish to attend you can amend you
application to
include Conference fees, or, you may withdraw your application.

The call for presentations for Halifax is here:
https://communityovercode.org/call-for-presentations/
and you have until the 13th of July to apply.

The call for presentations for Beijing is here:
https://apachecon.com/acasia2023/cfp.html
and you have until the 18th June to apply.

*IMPORTANT Note on Visas:*

It is important that you apply for a Visa as soon as possible - do not wait
until you know if you have been accepted for Travel Assistance or not, as
due to current wait times for Interviews in some Countries, waiting that
long may be too late, so please do apply for a Visa right away. Contact
tac-ap...@tac.apache.org if you need any more information or assistance in
this area.

*Spread the Word!!*

TAC encourages you to spread the word about Travel Assistance to get to
these events, so feel free to repost as you see fit on Social Media, at
work, schools, universities etc etc...

Thank You and hope to see you all soon

Gavin McDonald on behalf of the ASF Travel Assistance Committee.


Re: [DISCUSS][C++] Can we require CMake 3.16+ since 13.0.0?

2023-06-16 Thread Antoine Pitrou



Well, precompiled headers do not necessarily work well with ccache:
https://github.com/apache/arrow/issues/35641#issuecomment-1551775293

But agreed with the rest.


Le 16/06/2023 à 02:15, Sutou Kouhei a écrit :

Hi,

Ah, sorry. I should have written it in the original e-mail.

If we can require CMake 3.16+:

* We can always use the precompiled headers feature that
   reduces build time:
   
https://github.com/apache/arrow/pull/35921/files#diff-1bba462ab050e89360fd88110a689e85ee037749cea091a1848ab574381d3795L236

* We can remove our bundled FindPythonLibsNew.cmake and
   FindNumPy.cmake:
   
https://github.com/apache/arrow/pull/35921/files#diff-96c2eed475b13f84b0e0b4ce871fcf225d3d75d8c1ebd9d2b2d7183fa2a603f5L44

   Users don't need to specify both of -DPYTHON_EXECUTABLE
   and -DPython3_EXECUTABLE for old/new CMake:
   
https://github.com/apache/arrow/pull/35921/files#diff-91a09bc7b04c0122375fb44ec755065daccba993f95f96a26df129c4df9bb438L377

* We can remove a Boost::headers related shim code:
   
https://github.com/apache/arrow/pull/35921/files#diff-5cdc95f4e1b618f2f3ef10d370ce05a1ac05d9d401aecff3ccbb3d76bd366b6aL1203

* Jacob wants to use FetchContent related features:
   https://github.com/apache/arrow/pull/35921#issuecomment-1585428356


Thanks,


Re: [VOTE] Release Apache Arrow ADBC 0.5.0 - RC0

2023-06-16 Thread Sutou Kouhei
+1

I ran the following on Debian GNU/Linux sid:

  JAVA_HOME=/usr/lib/jvm/default-java \
dev/release/verify-release-candidate.sh 0.5.0 0

with:

  * Python 3.11.2
  * g++ (Debian 12.2.0-14) 12.2.0
  * go version go1.19.8 linux/amd64
  * openjdk version "17.0.6" 2023-01-17
  * ruby 3.1.2p20 (2022-04-12 revision 4491bb740a) [x86_64-linux-gnu]
  * R version 4.3.0 (2023-04-21) -- "Already Tomorrow"


Note: I needed https://github.com/apache/arrow-adbc/pull/810 .


Thanks,
-- 
kou


In <74f01cb9-5c76-4745-b357-4deca0bbd...@app.fastmail.com>
  "[VOTE] Release Apache Arrow ADBC 0.5.0 - RC0" on Thu, 15 Jun 2023 20:06:46 
-0400,
  "David Li"  wrote:

> Hello,
> 
> I would like to propose the following release candidate (RC0) of Apache Arrow 
> ADBC version 0.5.0. This is a release consisting of 36 resolved GitHub issues 
> [1].
> 
> This release candidate is based on commit: 
> ac0e0ef8bd83787f65e53d421fce6ad490d9a37d [2]
> 
> The source release rc0 is hosted at [3].
> The binary artifacts are hosted at [4][5][6][7][8].
> The changelog is located at [9].
> 
> Please download, verify checksums and signatures, run the unit tests, and 
> vote on the release. See [10] for how to validate a release candidate.
> 
> See also a verification result on GitHub Actions [11].
> 
> The vote will be open for at least 72 hours.
> 
> [ ] +1 Release this as Apache Arrow ADBC 0.5.0
> [ ] +0
> [ ] -1 Do not release this as Apache Arrow ADBC 0.5.0 because...
> 
> Note: to verify APT/YUM packages on macOS/AArch64, you must `export 
> DOCKER_DEFAULT_ARCHITECTURE=linux/amd64`. (Or skip this step by `export 
> TEST_APT=0 TEST_YUM=0`.)
> 
> [1]: 
> https://github.com/apache/arrow-adbc/issues?q=is%3Aissue+milestone%3A%22ADBC+Libraries+0.5.0%22+is%3Aclosed
> [2]: 
> https://github.com/apache/arrow-adbc/commit/ac0e0ef8bd83787f65e53d421fce6ad490d9a37d
> [3]: https://dist.apache.org/repos/dist/dev/arrow/apache-arrow-adbc-0.5.0-rc0/
> [4]: https://apache.jfrog.io/artifactory/arrow/almalinux-rc/
> [5]: https://apache.jfrog.io/artifactory/arrow/debian-rc/
> [6]: https://apache.jfrog.io/artifactory/arrow/ubuntu-rc/
> [7]: 
> https://repository.apache.org/content/repositories/staging/org/apache/arrow/adbc/
> [8]: 
> https://github.com/apache/arrow-adbc/releases/tag/apache-arrow-adbc-0.5.0-rc0
> [9]: 
> https://github.com/apache/arrow-adbc/blob/apache-arrow-adbc-0.5.0-rc0/CHANGELOG.md
> [10]: 
> https://arrow.apache.org/adbc/main/development/releasing.html#how-to-verify-release-candidates
> [11]: https://github.com/apache/arrow-adbc/actions/runs/5284608862