> first? Or are we going to use low-level physical types directly and add
> Spark-specific metadata to Parquet/Orc files?
>
> On Wed, Feb 20, 2019 at 10:57 PM Zoltan Ivanfi
> wrote:
>
> > Hi,
> >
> > Last december we shared a timestamp harmonization pro
Hi,
One of the feedbacks I got for the SQL timestamp type harmonization
proposal was that I should reach out the file format communities as
well. For this purpose I created a separate document from their
perspective and sent it to the Avro, ORC, Parquet, Arrow, Kudu and
Iceberg developer lists.
Dear All,
I would like to thank every reviewer of the consistent timestamps
proposal[1] for their time and valuable comments. Based on your
feedback, I have updated the proposal. The changes include
clarifications, fixes and other improvements as summarized at the end
of the document, in the
Hi,
I have seen this happening for other e-mail addresses and on other mailing
lists as well. I may be wrong, but I would suppose it is a deliberate
anti-spam measure.
Zoltan
On Tue, Jun 26, 2018 at 6:23 PM Michael Brown
wrote:
> Hi,
>
> For some reason mail to this list from users
that the
> whole row group can be included. The addition of NaNs doesn't change
> that.
>
> OTOH, if b <= a <= c, then we have to check the whole row group, and
> the addition of NaNs doesn't change that.
>
> On Tue, Feb 20, 2018 at 9:14 AM, Alexander Behm <alex.b...@cloude
handle similar
> > issues around NaNs/infinity (or infinities, in the case of IEEE-754).
> >
> > Thanks,
> >
> > - LaszloG
> >
> >
> > On Thu, Feb 15, 2018 at 5:10 PM, Zoltan Ivanfi <z...@cloudera.com> wrote:
> >
> > > Dea
Dear Parquet and Impala Developers,
We have exposed min/max statistics to extensive compatibility testing and
found troubling inconsistencies regarding float and double values. Under
certain (fortunately rather extreme) circumstances, this can lead to
predicate pushdown incorrectly discarding row