I found a licensing issue

https://issues.apache.org/jira/browse/ARROW-6679

It might be worth examining third party code added to the project
since 0.14.x to make sure there are no other such issues.

On Tue, Sep 24, 2019 at 6:10 PM Wes McKinney <wesmck...@gmail.com> wrote:
>
> I have diagnosed the problem (Thrift "string" data must be UTF-8,
> cannot be arbitrary binary) and am working on a patch right now
>
> On Tue, Sep 24, 2019 at 6:02 PM Wes McKinney <wesmck...@gmail.com> wrote:
> >
> > I just opened
> >
> > https://issues.apache.org/jira/browse/ARROW-6678
> >
> > Please don't cut an RC until I have an opportunity to diagnose this,
> > will report back.
> >
> >
> > On Tue, Sep 24, 2019 at 5:51 PM Wes McKinney <wesmck...@gmail.com> wrote:
> > >
> > > I'm investigating a possible Parquet-related compatibility bug that I
> > > encountered through some routine testing / benchmarking. I'll report
> > > back once I figure out what is going on (if anything)
> > >
> > > On Sun, Sep 22, 2019 at 11:51 PM Micah Kornfield <emkornfi...@gmail.com> 
> > > wrote:
> > > >>
> > > >> It's ideal if your GPG key is in the web of trust (i.e. you can get it
> > > >> signed by another PMC member), but is not 100% essential.
> > > >
> > > > That won't be an option for me this week (it seems like I would need to 
> > > > meet one face-to-face).  I'll try to get the GPG checked in and the 
> > > > rest of the pre-requisites done tomorrow (Monday) to hopefully start 
> > > > the release on Tuesday (hopefully we can solve the last 
> > > > blocker/integration tests by then).
> > > >
> > > > On Sat, Sep 21, 2019 at 7:12 PM Wes McKinney <wesmck...@gmail.com> 
> > > > wrote:
> > > >>
> > > >> It's ideal if your GPG key is in the web of trust (i.e. you can get it
> > > >> signed by another PMC member), but is not 100% essential.
> > > >>
> > > >> Speaking of the release, there are at least 2 code changes I still
> > > >> want to get in
> > > >>
> > > >> ARROW-5717
> > > >> ARROW-6353
> > > >>
> > > >> I just pushed updates to ARROW-5717, will merge once the build is 
> > > >> green.
> > > >>
> > > >> There are a couple of Rust patches still marked for 0.15. The rest
> > > >> seems to be documentation and a couple of integration test failures we
> > > >> should see about fixing in time.
> > > >>
> > > >> On Fri, Sep 20, 2019 at 11:26 PM Micah Kornfield 
> > > >> <emkornfi...@gmail.com> wrote:
> > > >> >
> > > >> > Thanks Krisztián and Wes,
> > > >> > I've gone ahead and started registering myself on all the packaging 
> > > >> > sites.
> > > >> >
> > > >> > Is there any review process when adding my GPG key to the SVN file? 
> > > >> > [1]
> > > >> > doesn't seem to mention explicitly.
> > > >> >
> > > >> > Thanks,
> > > >> > Micah
> > > >> >
> > > >> > [1] https://www.apache.org/dev/version-control.html#https-svn
> > > >> >
> > > >> > On Fri, Sep 20, 2019 at 5:01 PM Krisztián Szűcs 
> > > >> > <szucs.kriszt...@gmail.com>
> > > >> > wrote:
> > > >> >
> > > >> > > On Thu, Sep 19, 2019 at 5:52 PM Wes McKinney <wesmck...@gmail.com> 
> > > >> > > wrote:
> > > >> > >
> > > >> > >> On Thu, Sep 19, 2019 at 12:13 AM Micah Kornfield 
> > > >> > >> <emkornfi...@gmail.com>
> > > >> > >> wrote:
> > > >> > >> >>
> > > >> > >> >> The process should be well documented at this point but there 
> > > >> > >> >> are a
> > > >> > >> >> number of steps.
> > > >> > >> >
> > > >> > >> > Is [1] the up-to-date documentation for the release?   Are there
> > > >> > >> instructions for the adding the code signing Key to SVN?
> > > >> > >> >
> > > >> > >> > I will make a go of it.  i will try to mitigate any internet 
> > > >> > >> > issues by
> > > >> > >> doing the process for a cloud instance (I assume that isn't a 
> > > >> > >> problem?).
> > > >> > >> >
> > > >> > >>
> > > >> > >> Setting up a new cloud environment suitable for producing an RC 
> > > >> > >> may be
> > > >> > >> time consuming, but you are welcome to try. Krisztian -- are you
> > > >> > >> available next week to help Micah and potentially take over 
> > > >> > >> producing
> > > >> > >> the RC if there are issues?
> > > >> > >>
> > > >> > > Sure, I'll be available next week. We can also grant access to
> > > >> > > https://github.com/ursa-labs/crossbow because configuring all
> > > >> > > the CI backends can be time consuming.
> > > >> > >
> > > >> > >>
> > > >> > >> > Thanks,
> > > >> > >> > Micah
> > > >> > >> >
> > > >> > >> > [1]
> > > >> > >> https://cwiki.apache.org/confluence/display/ARROW/Release+Management+Guide
> > > >> > >> >
> > > >> > >> > On Wed, Sep 18, 2019 at 8:29 AM Wes McKinney 
> > > >> > >> > <wesmck...@gmail.com>
> > > >> > >> wrote:
> > > >> > >> >>
> > > >> > >> >> The process should be well documented at this point but there 
> > > >> > >> >> are a
> > > >> > >> >> number of steps. Note that you need to add your code signing 
> > > >> > >> >> key to
> > > >> > >> >> the KEYS file in SVN (that's not very hard to do). I think 
> > > >> > >> >> it's fine
> > > >> > >> >> to hand off the process to others after the VOTE but it would 
> > > >> > >> >> be
> > > >> > >> >> tricky to have multiple RMs involved with producing the source 
> > > >> > >> >> and
> > > >> > >> >> binary artifacts for the vote
> > > >> > >> >>
> > > >> > >> >> On Tue, Sep 17, 2019 at 10:55 PM Micah Kornfield <
> > > >> > >> emkornfi...@gmail.com> wrote:
> > > >> > >> >> >
> > > >> > >> >> > SGTM, as well.
> > > >> > >> >> >
> > > >> > >> >> > I should have a little bit of time next week if I can help 
> > > >> > >> >> > as RM but
> > > >> > >> I have
> > > >> > >> >> > a couple of concerns:
> > > >> > >> >> > 1.  In the past I've had trouble downloading and validating
> > > >> > >> releases. I'm a
> > > >> > >> >> > bit worried, that I might have similar problems doing the 
> > > >> > >> >> > necessary
> > > >> > >> uploads.
> > > >> > >> >> > 2.  My internet connection will likely be not great, I don't 
> > > >> > >> >> > know if
> > > >> > >> this
> > > >> > >> >> > would make it even less likely to be successful.
> > > >> > >> >> >
> > > >> > >> >> > Does it become problematic if somehow I would have to 
> > > >> > >> >> > abandon the
> > > >> > >> process
> > > >> > >> >> > mid-release?  Is there anyone who could serve as a backup?  
> > > >> > >> >> > Are the
> > > >> > >> steps
> > > >> > >> >> > well documented?
> > > >> > >> >> >
> > > >> > >> >> > Thanks,
> > > >> > >> >> > Micah
> > > >> > >> >> >
> > > >> > >> >> > On Tue, Sep 17, 2019 at 4:25 PM Neal Richardson <
> > > >> > >> neal.p.richard...@gmail.com>
> > > >> > >> >> > wrote:
> > > >> > >> >> >
> > > >> > >> >> > > Sounds good to me.
> > > >> > >> >> > >
> > > >> > >> >> > > Do we have a release manager yet? Any volunteers?
> > > >> > >> >> > >
> > > >> > >> >> > > Neal
> > > >> > >> >> > >
> > > >> > >> >> > > On Tue, Sep 17, 2019 at 4:06 PM Wes McKinney 
> > > >> > >> >> > > <wesmck...@gmail.com>
> > > >> > >> wrote:
> > > >> > >> >> > >
> > > >> > >> >> > > > hi all,
> > > >> > >> >> > > >
> > > >> > >> >> > > > It looks like we're drawing close to be able to make the 
> > > >> > >> >> > > > 0.15.0
> > > >> > >> >> > > > release. I would suggest "pencils down" at the end of 
> > > >> > >> >> > > > this week
> > > >> > >> and
> > > >> > >> >> > > > see if a release candidate can be produced next Monday 
> > > >> > >> >> > > > September
> > > >> > >> 23.
> > > >> > >> >> > > > Any thoughts or objections?
> > > >> > >> >> > > >
> > > >> > >> >> > > > Thanks,
> > > >> > >> >> > > > Wes
> > > >> > >> >> > > >
> > > >> > >> >> > > > On Wed, Sep 11, 2019 at 11:23 AM Wes McKinney <
> > > >> > >> wesmck...@gmail.com>
> > > >> > >> >> > > wrote:
> > > >> > >> >> > > > >
> > > >> > >> >> > > > > hi Eric -- yes, that's correct. I'm planning to amend 
> > > >> > >> >> > > > > the
> > > >> > >> Format docs
> > > >> > >> >> > > > > today regarding the EOS issue and also update the C++ 
> > > >> > >> >> > > > > library
> > > >> > >> >> > > > >
> > > >> > >> >> > > > > On Wed, Sep 11, 2019 at 11:21 AM Eric Erhardt
> > > >> > >> >> > > > > <eric.erha...@microsoft.com> wrote:
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > I assume the plan is to merge the
> > > >> > >> ARROW-6313-flatbuffer-alignment
> > > >> > >> >> > > > branch into master before the 0.15 release, correct?
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > BTW - I believe the C# alignment changes are ready 
> > > >> > >> >> > > > > > to be
> > > >> > >> merged into
> > > >> > >> >> > > > the alignment branch -
> > > >> > >> https://github.com/apache/arrow/pull/5280/
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > Eric
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > -----Original Message-----
> > > >> > >> >> > > > > > From: Micah Kornfield <emkornfi...@gmail.com>
> > > >> > >> >> > > > > > Sent: Tuesday, September 10, 2019 10:24 PM
> > > >> > >> >> > > > > > To: Wes McKinney <wesmck...@gmail.com>
> > > >> > >> >> > > > > > Cc: dev <dev@arrow.apache.org>; niki.lj 
> > > >> > >> >> > > > > > <niki...@aliyun.com>
> > > >> > >> >> > > > > > Subject: Re: Timeline for 0.15.0 release
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > I should have a little more bandwidth to help with 
> > > >> > >> >> > > > > > some of
> > > >> > >> the
> > > >> > >> >> > > > packaging starting tomorrow and going into the weekend.
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > On Tuesday, September 10, 2019, Wes McKinney <
> > > >> > >> wesmck...@gmail.com>
> > > >> > >> >> > > > wrote:
> > > >> > >> >> > > > > >
> > > >> > >> >> > > > > > > Hi folks,
> > > >> > >> >> > > > > > >
> > > >> > >> >> > > > > > > With the state of nightly packaging and 
> > > >> > >> >> > > > > > > integration builds
> > > >> > >> things
> > > >> > >> >> > > > > > > aren't looking too good for being in release 
> > > >> > >> >> > > > > > > readiness by
> > > >> > >> the end
> > > >> > >> >> > > of
> > > >> > >> >> > > > > > > this week but maybe I'm wrong. I'm planning to be 
> > > >> > >> >> > > > > > > working
> > > >> > >> to close
> > > >> > >> >> > > as
> > > >> > >> >> > > > > > > many issues as I can and also to help with the 
> > > >> > >> >> > > > > > > ongoing
> > > >> > >> alignment
> > > >> > >> >> > > > fixes.
> > > >> > >> >> > > > > > >
> > > >> > >> >> > > > > > > Wes
> > > >> > >> >> > > > > > >
> > > >> > >> >> > > > > > > On Thu, Sep 5, 2019, 11:07 PM Micah Kornfield <
> > > >> > >> >> > > emkornfi...@gmail.com
> > > >> > >> >> > > > >
> > > >> > >> >> > > > > > > wrote:
> > > >> > >> >> > > > > > >
> > > >> > >> >> > > > > > >> Just for reference [1] has a dashboard of the 
> > > >> > >> >> > > > > > >> current
> > > >> > >> issues:
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > >
> > > >> > >> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fcwi
> > > >> > >> >> > > > > > >> ki.apache.org
> > > >> > >> >> > > > %2Fconfluence%2Fdisplay%2FARROW%2FArrow%2B0.15.0%2BRelea
> > > >> > >> >> > > > > > >> se&amp;data=02%7C01%7CEric.Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > >
> > > >> > >> a4f308d736678a45%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C6370376
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > >
> > > >> > >> 90648216338&amp;sdata=0Upux3i%2B9X6f8uanGKSGM5VYxR6c2ADWrxSPi1%2FgbH4
> > > >> > >> >> > > > > > >> %3D&amp;reserved=0
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > > > > >> On Thu, Sep 5, 2019 at 3:43 PM Wes McKinney <
> > > >> > >> wesmck...@gmail.com>
> > > >> > >> >> > > > wrote:
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > > > > >>> hi all,
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > > > > >>> It doesn't seem like we're going to be in a 
> > > >> > >> >> > > > > > >>> position to
> > > >> > >> release
> > > >> > >> >> > > at
> > > >> > >> >> > > > > > >>> the beginning of next week. I hope that one more 
> > > >> > >> >> > > > > > >>> week of
> > > >> > >> work (or
> > > >> > >> >> > > > > > >>> less) will be enough to get us there. Aside from 
> > > >> > >> >> > > > > > >>> merging
> > > >> > >> the
> > > >> > >> >> > > > > > >>> alignment changes, we need to make sure that our
> > > >> > >> packaging jobs
> > > >> > >> >> > > > > > >>> required for the release candidate are all 
> > > >> > >> >> > > > > > >>> working.
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > > > > >>> If folks could remove issues from the 0.15.0 
> > > >> > >> >> > > > > > >>> backlog
> > > >> > >> that they
> > > >> > >> >> > > > don't
> > > >> > >> >> > > > > > >>> think they will finish by end of next week that 
> > > >> > >> >> > > > > > >>> would
> > > >> > >> help focus
> > > >> > >> >> > > > > > >>> efforts (there are currently 78 issues in 0.15.0 
> > > >> > >> >> > > > > > >>> still).
> > > >> > >> I am
> > > >> > >> >> > > > > > >>> looking to tackle a few small features related to
> > > >> > >> dictionaries
> > > >> > >> >> > > > while
> > > >> > >> >> > > > > > >>> the release window is still open.
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > > > > >>> - Wes
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > > > > >>> On Tue, Aug 27, 2019 at 3:48 PM Wes McKinney <
> > > >> > >> >> > > wesmck...@gmail.com>
> > > >> > >> >> > > > > > >>> wrote:
> > > >> > >> >> > > > > > >>> >
> > > >> > >> >> > > > > > >>> > hi,
> > > >> > >> >> > > > > > >>> >
> > > >> > >> >> > > > > > >>> > I think we should try to release the week of 
> > > >> > >> >> > > > > > >>> > September
> > > >> > >> 9, so
> > > >> > >> >> > > > > > >>> > development work should be completed by end of 
> > > >> > >> >> > > > > > >>> > next
> > > >> > >> week.
> > > >> > >> >> > > > > > >>> >
> > > >> > >> >> > > > > > >>> > Does that seem reasonable?
> > > >> > >> >> > > > > > >>> >
> > > >> > >> >> > > > > > >>> > I plan to get up a patch for the protocol 
> > > >> > >> >> > > > > > >>> > alignment
> > > >> > >> changes for
> > > >> > >> >> > > > > > >>> > C++ in the next couple of days -- I think that 
> > > >> > >> >> > > > > > >>> > getting
> > > >> > >> the
> > > >> > >> >> > > > > > >>> > alignment work done is the main barrier to 
> > > >> > >> >> > > > > > >>> > releasing.
> > > >> > >> >> > > > > > >>> >
> > > >> > >> >> > > > > > >>> > Thanks
> > > >> > >> >> > > > > > >>> > Wes
> > > >> > >> >> > > > > > >>> >
> > > >> > >> >> > > > > > >>> > On Mon, Aug 19, 2019 at 12:25 PM Ji Liu
> > > >> > >> >> > > > > > >>> > <niki...@aliyun.com.invalid>
> > > >> > >> >> > > > > > >>> wrote:
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > Hi, Wes, on the java side, I can think of 
> > > >> > >> >> > > > > > >>> > > several
> > > >> > >> bugs that
> > > >> > >> >> > > > need
> > > >> > >> >> > > > > > >>> > > to
> > > >> > >> >> > > > > > >>> be fixed or reminded.
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > i. ARROW-6040: Dictionary entries are 
> > > >> > >> >> > > > > > >>> > > required in
> > > >> > >> IPC streams
> > > >> > >> >> > > > > > >>> > > even
> > > >> > >> >> > > > > > >>> when empty[1]
> > > >> > >> >> > > > > > >>> > > This one is under review now, however 
> > > >> > >> >> > > > > > >>> > > through this
> > > >> > >> PR we find
> > > >> > >> >> > > > > > >>> > > that
> > > >> > >> >> > > > > > >>> there seems a bug in java reading and writing
> > > >> > >> dictionaries in IPC
> > > >> > >> >> > > > > > >>> which is Inconsistent with spec[2] since it 
> > > >> > >> >> > > > > > >>> assumes all
> > > >> > >> >> > > > dictionaries
> > > >> > >> >> > > > > > >>> are at the start of stream (see details in PR 
> > > >> > >> >> > > > > > >>> comments,
> > > >> > >> and this
> > > >> > >> >> > > > > > >>> fix may not catch up with version 0.15). @Micah 
> > > >> > >> >> > > > > > >>> Kornfield
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > ii. ARROW-1875: Write 64-bit ints as strings 
> > > >> > >> >> > > > > > >>> > > in
> > > >> > >> integration
> > > >> > >> >> > > > test
> > > >> > >> >> > > > > > >>> JSON files[3]
> > > >> > >> >> > > > > > >>> > > Java side code already checked in, other
> > > >> > >> implementations
> > > >> > >> >> > > seems
> > > >> > >> >> > > > not.
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > iii. ARROW-6202: OutOfMemory in 
> > > >> > >> >> > > > > > >>> > > JdbcAdapter[4]
> > > >> > >> Caused by
> > > >> > >> >> > > trying
> > > >> > >> >> > > > > > >>> > > to load all records in one contiguous batch, 
> > > >> > >> >> > > > > > >>> > > fixed
> > > >> > >> >> > > > > > >>> by providing iterator API for iteratively 
> > > >> > >> >> > > > > > >>> reading in
> > > >> > >> >> > > ARROW-6219[5].
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > Thanks,
> > > >> > >> >> > > > > > >>> > > Ji Liu
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > [1]
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > 2Fgithub.com%2Fapache%2Farrow%2Fpull%2F4960&amp;data=02%7C01%7CE
> > > >> > >> >> > > > > > >>> > > ric.Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d736678a45%7
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637037690648216338&a
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > mp;sdata=eDF%2FAsJmVs7WjfEuNBYo%2F1TypIN44xx1TTlK6kQHZVg%3D&amp;
> > > >> > >> >> > > > > > >>> > > reserved=0 [2]
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%
> > > >> > >> >> > > > > > >>> > > 2Farrow.apache.org
> > > >> > >> >> > > > %2Fdocs%2Fipc.html&amp;data=02%7C01%7CEric.Erh
> > > >> > >> >> > > > > > >>> > > ardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d736678a45%7C72f988
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > bf86f141af91ab2d7cd011db47%7C1%7C0%7C637037690648216338&amp;sdat
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > a=H0pM8bVKsOyeORDhHxLlS%2BpaS%2F5meT52wxTKmNssuMk%3D&amp;reserve
> > > >> > >> >> > > > > > >>> > > d=0 [3]
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%
> > > >> > >> >> > > > > > >>> > > 2Fissues.apache.org
> > > >> > >> >> > > > %2Fjira%2Fbrowse%2FARROW-1875&amp;data=02%7C0
> > > >> > >> >> > > > > > >>> > > 1%7CEric.Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d736678
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > a45%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637037690648216
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > 338&amp;sdata=coTpuoEGhfjyOSBTagdlohOTX24DQZmtbWC0gYsDmkM%3D&amp
> > > >> > >> >> > > > > > >>> > > ;reserved=0 [4]
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%
> > > >> > >> >> > > > > > >>> > > 2Fissues.apache.org
> > > >> > >> >> > > > %2Fjira%2Fbrowse%2FARROW-6202%5B5&amp;data=02
> > > >> > >> >> > > > > > >>> > > %7C01%7CEric.Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d73
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > 6678a45%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C63703769064
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > 8216338&amp;sdata=gnyUMk8cUgwc802QBLF3eAp3mznYwonlbF0qmGyzgmY%3D
> > > >> > >> >> > > > > > >>> > > &amp;reserved=0]
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > >
> > > >> > >> https://nam06.safelinks.protection.outlook.com/?url=https%3A%2F%2Fis
> > > >> > >> >> > > > > > >>> sues.apache.org
> > > >> > >> >> > > > %2Fjira%2Fbrowse%2FARROW-6219&amp;data=02%7C01%7CEric
> > > >> > >> >> > > > > > >>> .Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d736678a45%7C72f988
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > >
> > > >> > >> bf86f141af91ab2d7cd011db47%7C1%7C0%7C637037690648216338&amp;sdata=d3
> > > >> > >> >> > > > > > >>>
> > > >> > >> LF%2BTeWSprASqO%2ByE4LywlsULHGcb1Iq%2F2byHrEPkY%3D&amp;reserved=0
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > ----------------------------------------------------------------
> > > >> > >> >> > > > > > >>> > > -- From:Wes McKinney <wesmck...@gmail.com> 
> > > >> > >> >> > > > > > >>> > > Send
> > > >> > >> >> > > > > > >>> > > Time:2019年8月19日(星期一) 23:03 To:dev <
> > > >> > >> dev@arrow.apache.org>
> > > >> > >> >> > > > > > >>> > > Subject:Re: Timeline for 0.15.0 release
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > I'm going to work some on organizing the 
> > > >> > >> >> > > > > > >>> > > 0.15.0
> > > >> > >> backlog some
> > > >> > >> >> > > > > > >>> > > this week, if anyone wants to help with 
> > > >> > >> >> > > > > > >>> > > grooming
> > > >> > >> >> > > (particularly
> > > >> > >> >> > > > > > >>> > > for languages other than C++/Python where I'm
> > > >> > >> focusing) that
> > > >> > >> >> > > > > > >>> > > would be helpful. There have been almost 500 
> > > >> > >> >> > > > > > >>> > > JIRA
> > > >> > >> issues
> > > >> > >> >> > > opened
> > > >> > >> >> > > > > > >>> > > since the
> > > >> > >> >> > > > > > >>> > > 0.14.0 release, so we should make sure to 
> > > >> > >> >> > > > > > >>> > > check
> > > >> > >> whether
> > > >> > >> >> > > there's
> > > >> > >> >> > > > > > >>> > > any regressions or other serious bugs that 
> > > >> > >> >> > > > > > >>> > > we should
> > > >> > >> try to
> > > >> > >> >> > > fix
> > > >> > >> >> > > > > > >>> > > for 0.15.0.
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>> > > On Thu, Aug 15, 2019 at 6:23 PM Wes McKinney
> > > >> > >> >> > > > > > >>> > > <wesmck...@gmail.com>
> > > >> > >> >> > > > > > >>> wrote:
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > > The Windows wheel issue in 0.14.1 seems to 
> > > >> > >> >> > > > > > >>> > > > be
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2
> > > >> > >> >> > > > > > >>> > > > F%2Fissues.apache.org
> > > >> > >> >> > > > %2Fjira%2Fbrowse%2FARROW-6015&amp;data=02
> > > >> > >> >> > > > > > >>> > > > %7C01%7CEric.Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > 736678a45%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C6370376
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > 90648216338&amp;sdata=D9lqHR16oRAFlPaIrcXq3UtW%2BLuJQW1u0Gom2u
> > > >> > >> >> > > > > > >>> > > > WEWg0%3D&amp;reserved=0
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > > I think the root cause could be the Windows
> > > >> > >> changes in
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > F%2Fgithub.com%2Fapache%2Farrow%2Fcommit%2F&amp;data=02%7C01%7
> > > >> > >> >> > > > > > >>> > > > CEric.Erhardt%40microsoft.com
> > > >> > >> >> > > > %7Ccbead81a42104034a4f308d736678a
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > 45%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C63703769064821
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > 6338&amp;sdata=iPmFB%2BncIbmvp5D31vjB4A2KyuMP%2B83Vp7%2BDiOxvl
> > > >> > >> >> > > > > > >>> > > > bs%3D&amp;reserved=0
> > > >> > >> >> > > > > > >>> 223ae744cc2a12c60cecb5db593263a03c13f85a
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > > I would be appreciative if a volunteer 
> > > >> > >> >> > > > > > >>> > > > would look
> > > >> > >> into what
> > > >> > >> >> > > > > > >>> > > > was
> > > >> > >> >> > > > > > >>> wrong
> > > >> > >> >> > > > > > >>> > > > with the 0.14.1 wheels on Windows. 
> > > >> > >> >> > > > > > >>> > > > Otherwise
> > > >> > >> 0.15.0 Windows
> > > >> > >> >> > > > > > >>> > > > wheels will be broken, too
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > > The bad wheels can be found at
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > https://nam06.safelinks.protection.outlook.com/?url=https%3A%2
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > F%2Fbintray.com%2Fapache%2Farrow%2Fpython%23files%2Fpython%252
> > > >> > >> >> > > > > > >>> > > > F0.14.1&amp;data=02%7C01%7CEric.Erhardt%
> > > >> > >> 40microsoft.com
> > > >> > >> >> > > > %7Ccbea
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > d81a42104034a4f308d736678a45%7C72f988bf86f141af91ab2d7cd011db4
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > 7%7C1%7C0%7C637037690648216338&amp;sdata=vZzx4HNS9qp2UWhFagqfJ
> > > >> > >> >> > > > > > >>> > > > zbY%2BGzwspH1TO3wdfrbA6Y%3D&amp;reserved=0
> > > >> > >> >> > > > > > >>> > > >
> > > >> > >> >> > > > > > >>> > > > On Thu, Aug 15, 2019 at 1:28 PM Antoine 
> > > >> > >> >> > > > > > >>> > > > Pitrou <
> > > >> > >> >> > > > > > >>> solip...@pitrou.net> wrote:
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > On Thu, 15 Aug 2019 11:17:07 -0700 Micah
> > > >> > >> Kornfield
> > > >> > >> >> > > > > > >>> > > > > <emkornfi...@gmail.com> wrote:
> > > >> > >> >> > > > > > >>> > > > > > >
> > > >> > >> >> > > > > > >>> > > > > > > In C++ they are
> > > >> > >> >> > > > > > >>> > > > > > > independent, we could have 32-bit 
> > > >> > >> >> > > > > > >>> > > > > > > array
> > > >> > >> lengths and
> > > >> > >> >> > > > > > >>> variable-length
> > > >> > >> >> > > > > > >>> > > > > > > types with 64-bit offsets if we 
> > > >> > >> >> > > > > > >>> > > > > > > wanted (we
> > > >> > >> just
> > > >> > >> >> > > > wouldn't
> > > >> > >> >> > > > > > >>> > > > > > > be
> > > >> > >> >> > > > > > >>> able to
> > > >> > >> >> > > > > > >>> > > > > > > have a List child with more than 
> > > >> > >> >> > > > > > >>> > > > > > > INT32_MAX
> > > >> > >> elements).
> > > >> > >> >> > > > > > >>> > > > > >
> > > >> > >> >> > > > > > >>> > > > > > I think the point is we could do this 
> > > >> > >> >> > > > > > >>> > > > > > in C++
> > > >> > >> but we
> > > >> > >> >> > > > don't.
> > > >> > >> >> > > > > > >>> I'm not sure we
> > > >> > >> >> > > > > > >>> > > > > > would have introduced the "Large" 
> > > >> > >> >> > > > > > >>> > > > > > types if we
> > > >> > >> did.
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > 64-bit offsets take twice as much space 
> > > >> > >> >> > > > > > >>> > > > > as 32-bit
> > > >> > >> >> > > offsets,
> > > >> > >> >> > > > > > >>> > > > > so if
> > > >> > >> >> > > > > > >>> you're
> > > >> > >> >> > > > > > >>> > > > > storing lots of small-ish lists or 
> > > >> > >> >> > > > > > >>> > > > > strings,
> > > >> > >> 32-bit
> > > >> > >> >> > > offsets
> > > >> > >> >> > > > > > >>> > > > > are preferrable.  So even with 64-bit 
> > > >> > >> >> > > > > > >>> > > > > array
> > > >> > >> lengths from
> > > >> > >> >> > > > the
> > > >> > >> >> > > > > > >>> > > > > start
> > > >> > >> >> > > > > > >>> it would
> > > >> > >> >> > > > > > >>> > > > > still be beneficial to have types with 
> > > >> > >> >> > > > > > >>> > > > > 32-bit
> > > >> > >> offsets.
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > > Going with the limited address space 
> > > >> > >> >> > > > > > >>> > > > > > in Java
> > > >> > >> and
> > > >> > >> >> > > calling
> > > >> > >> >> > > > > > >>> > > > > > it a
> > > >> > >> >> > > > > > >>> reference
> > > >> > >> >> > > > > > >>> > > > > > implementation seems suboptimal. If a 
> > > >> > >> >> > > > > > >>> > > > > > consumer
> > > >> > >> uses a
> > > >> > >> >> > > > "Large"
> > > >> > >> >> > > > > > >>> type
> > > >> > >> >> > > > > > >>> > > > > > presumably it is because they need the 
> > > >> > >> >> > > > > > >>> > > > > > ability
> > > >> > >> to store
> > > >> > >> >> > > > > > >>> > > > > > more
> > > >> > >> >> > > > > > >>> than INT32_MAX
> > > >> > >> >> > > > > > >>> > > > > > child elements in a column, otherwise 
> > > >> > >> >> > > > > > >>> > > > > > it is
> > > >> > >> just
> > > >> > >> >> > > wasting
> > > >> > >> >> > > > > > >>> > > > > > space
> > > >> > >> >> > > > > > >>> [1].
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > Probably. Though if the individual 
> > > >> > >> >> > > > > > >>> > > > > elements
> > > >> > >> (lists or
> > > >> > >> >> > > > > > >>> > > > > strings)
> > > >> > >> >> > > > > > >>> are
> > > >> > >> >> > > > > > >>> > > > > large, not much space is wasted in 
> > > >> > >> >> > > > > > >>> > > > > proportion,
> > > >> > >> so it may
> > > >> > >> >> > > be
> > > >> > >> >> > > > > > >>> simpler in
> > > >> > >> >> > > > > > >>> > > > > such a case to always create a "Large" 
> > > >> > >> >> > > > > > >>> > > > > type
> > > >> > >> array.
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > > [1] I suppose theoretically there 
> > > >> > >> >> > > > > > >>> > > > > > might be some
> > > >> > >> >> > > > > > >>> > > > > > performance
> > > >> > >> >> > > > > > >>> benefits on
> > > >> > >> >> > > > > > >>> > > > > > 64-bit architectures to using the 
> > > >> > >> >> > > > > > >>> > > > > > native word
> > > >> > >> sizes.
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > Concretely, common 64-bit architectures 
> > > >> > >> >> > > > > > >>> > > > > don't do
> > > >> > >> that, as
> > > >> > >> >> > > > > > >>> > > > > 32-bit
> > > >> > >> >> > > > > > >>> is an
> > > >> > >> >> > > > > > >>> > > > > extremely common integer size even in
> > > >> > >> high-performance
> > > >> > >> >> > > > code.
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > Regards
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > > Antoine.
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > > > >
> > > >> > >> >> > > > > > >>> > >
> > > >> > >> >> > > > > > >>>
> > > >> > >> >> > > > > > >>
> > > >> > >> >> > > >
> > > >> > >> >> > >
> > > >> > >>
> > > >> > >

Reply via email to