Re: C++ Migrate from Arrow 0.16.0

Micah Kornfield Wed, 02 Jun 2021 21:43:09 -0700

I think the one place where it might break is for Union types (I seem to
recall a breaking change just prior to 1.0).


On Wed, Jun 2, 2021 at 1:00 PM Antoine Pitrou <anto...@python.org> wrote:

>
> Le 02/06/2021 à 21:57, Rares Vernica a écrit :
> > Thanks for the pointers! The migration is going well.
> >
> > We have been using Arrow 0.16.0 RecordBatchStreamWriter
> > <
> https://github.com/Paradigm4/bridge/blob/master/src/PhysicalXSave.cpp#L450
> >
> > with & without CompressedOutputStream and wrote the resulting Arrow
> Buffer
> > data to S3
> > <https://github.com/Paradigm4/bridge/blob/master/src/S3Driver.cpp#L168>
> or file
> > system
> > <https://github.com/Paradigm4/bridge/blob/master/src/FSDriver.cpp#L156>.
> We
> > have a sizable amount of data saved this way.
> >
> > Once we upgrade our C++ code to use Arrow 3.0.0 or 4.0.0, will it be
> > possible to read the Arrow steam files written with Arrow 0.16.0?
>
> It definitely should, unless it's a bug.
> That said, for extra safety, I suggest you test loading the files before
> doing the final migration.
>
> By the way, for saving lots of data to S3, it may be more efficient to
> use Parquet. It will be more CPU-intensive but will result in
> significant space savings.
>
> Regards
>
> Antoine.
>

Re: C++ Migrate from Arrow 0.16.0

Reply via email to