[GitHub] [spark] dongjoon-hyun commented on pull request #26804: [SPARK-26346][BUILD][SQL] Upgrade parquet to 1.11.0

2020-05-19 Thread GitBox


dongjoon-hyun commented on pull request #26804:
URL: https://github.com/apache/spark/pull/26804#issuecomment-630997965


   @h-vetinari . This is wrong, isn't it? Did someone (except you) say it's low 
priority here? We want that, but currently it looks infeasible technically. Do 
you think that all infeasible things are low priority?
   > I'm surprised (without criticism!) that this has a seemingly low priority



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #26804: [SPARK-26346][BUILD][SQL] Upgrade parquet to 1.11.0

2020-05-19 Thread GitBox


dongjoon-hyun commented on pull request #26804:
URL: https://github.com/apache/spark/pull/26804#issuecomment-630904663


   @h-vetinari . Parquet is a de-facto standard in Apache Spark and is related 
to all the other module. That's the reason why Parquet should not break 
anything in all the other Spark modules. It's the same for the other libraries. 
Apache Spark uses Apache Hadoop 2.7.3/2.7.4 for a long time and still it's the 
default Hadoop. Apache Spark uses unofficial Hive 1.2.1 fork for a long time 
and still couldn't remove it.
   
   Please feel free to open a working PR. Then, the community will welcome.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] dongjoon-hyun commented on pull request #26804: [SPARK-26346][BUILD][SQL] Upgrade parquet to 1.11.0

2020-05-18 Thread GitBox


dongjoon-hyun commented on pull request #26804:
URL: https://github.com/apache/spark/pull/26804#issuecomment-630471244


   @iemejia 's Avro PR (#27609) didn't pass Apache Spark UTs. And, according to 
his report, this Parquet PR seems to be blocked by Avro dependency upgrade. If 
we have a clean PR for Avro to pass all UTs (including Hive 1.2/2.3 profile), 
we may restart to review it.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org