[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-11 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-626516747


   jenkins, retest this, please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-10 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-626464195


   jenkins, retest this, please 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-09 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-626215784


   The PR https://github.com/apache/spark/pull/28486 fixes the build failure 
https://github.com/apache/spark/pull/28481#issuecomment-626034381



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-09 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-626135117


   I will exclude the date 1000-02-29 because:
   1. The test RowEncoderSuite.encode/decode can fail before this PR
   2. The main goal of this PR is to test dictionary encoding that is triggered 
when there are duplicates in DATE/TIMESTAMP columns



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-09 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-626134799


   Here is JIRA ticket for [the test 
failure](https://github.com/apache/spark/pull/28481#issuecomment-626034381) 
https://issues.apache.org/jira/browse/SPARK-31669



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-08 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-626106419


   The roundtrip test RowEncoderSuite.encode/decode fails 
https://github.com/apache/spark/pull/28481#issuecomment-626034381 on 1000-02-29 
because the date doesn't exist in Proleptic Gregorian calendar, and Spark 
shifts it to the next valid date 1000-03-01. The test didn't fail before so 
often because the probability of generating this date was pretty low.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-08 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-625967911


   This is some kind of companion checks for 
https://github.com/apache/spark/pull/28479 when date rebasing is off.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] MaxGekk commented on pull request #28481: [SPARK-31665][SQL][TESTS] Check parquet dictionary encoding of random dates/timestamps

2020-05-08 Thread GitBox


MaxGekk commented on pull request #28481:
URL: https://github.com/apache/spark/pull/28481#issuecomment-62594


   @cloud-fan @HyukjinKwon @srowen @dongjoon-hyun May I ask you to review this 
PR.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org