[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-30 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-736214769


   Oh, maybe I will use tar.gz and tgz in the integration test. That will 
address https://github.com/apache/spark/pull/30486#discussion_r532678330 
togeter.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-30 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-736213589


   Thanks all @dongjoon-hyun @maropu @Ngone51 @mridulm and @tgravescs. Let me 
merge this in.
   I will try to have some time to prepare an IT test with K8S which hopefully 
will be added before Spark 3.1.0 release.
   
   Merged to master.
   



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-30 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-736111892


   I haven't tested in K8S yet it would take me a while. I plan to add an 
integration test though.
   
   Hope I can proceed it separately given that the code freeze is coming and I 
would like to get this in for Spark 3.1.0.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-28 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-735225924


   Yeah, that's correct. One thing js though, if there's anything wrong in 
terms of conflict between Yarn distributed cach (spark.yarn.dist.* vs spark.* 
like spark.files), I would say this is a separate issue to handle since I am 
reusing the existing code path



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-26 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-734494224


   @mridulm dose it make sense? Ill go ahead if there are not other comments 
:-).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-25 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-734037334


   It will be exactly same as `spark.files` and `spark.yarn.dist.files`. To be 
honest, I am not exactly sure how they will conflictto each other but both work 
together as far as I know.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-25 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733625455


   I pushed some more changes to fix some nits which are all virtually non-code 
change 
(https://github.com/apache/spark/pull/30486/commits/5b1d1c3db6c1ce68a2738dec9b54af519b026b42).



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-24 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733519983


   Thank you @dongjoon-hyun!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-24 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733506294


   @mcg1969 too FYI [conda-pack](https://conda.github.io/conda-pack/). With 
this change, users can use conda-pack in other cluster modes not only Yarn.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-24 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733477920


   Thanks @maropu and @dongjoon-hyun. I believe I addressed the comments.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-24 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733312646


   I think it's fine to avoid removing `spark.yarn.dist.archives` out yet - 
maybe we could think about removing out once this feature becomes stable (?). 
Yarn also has `spark.yarn.dist.files` and `spark.files` can work together as 
far as I know.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-24 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733009501


   cc @zero323 and @fhoering too FYI. This is related to the docs and shipping 
3rd party Python packages in PySpark apps.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org



[GitHub] [spark] HyukjinKwon commented on pull request #30486: [SPARK-33530][CORE] Support --archives and spark.archives option natively

2020-11-24 Thread GitBox


HyukjinKwon commented on pull request #30486:
URL: https://github.com/apache/spark/pull/30486#issuecomment-733008523


   @tgravescs, @mridulm, @Ngone51, can you take a look when you guys find some 
time?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



-
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org