this is seriously great news!  let's all take a moment and welcome apache
spark's python support to the present.  ;)

On Mon, Jul 13, 2020 at 7:26 PM Holden Karau <hol...@pigscanfly.ca> wrote:

> Awesome, thanks you for driving this forward :)
>
> On Mon, Jul 13, 2020 at 7:25 PM Hyukjin Kwon <gurwls...@gmail.com> wrote:
>
>> Thank you all. Python 2, 3.4 and 3.5 are dropped now in the master branch
>> at https://github.com/apache/spark/pull/28957
>>
>> 2020년 7월 3일 (금) 오전 10:01, Hyukjin Kwon <gurwls...@gmail.com>님이 작성:
>>
>>> Thanks Dongjoon. That makes much more sense now!
>>>
>>> 2020년 7월 3일 (금) 오전 12:11, Dongjoon Hyun <dongjoon.h...@gmail.com>님이 작성:
>>>
>>>> Thank you, Hyukjin.
>>>>
>>>> According to the Python community, Python 3.5 is also EOF at 2020-09-13
>>>> (only two months left).
>>>>
>>>> - https://www.python.org/downloads/
>>>>
>>>> So, targeting live Python versions at Apache Spark 3.1.0 (December
>>>> 2020) looks reasonable to me.
>>>>
>>>> For old Python versions, we still have Apache Spark 2.4 LTS and also
>>>> Apache Spark 3.0.x will work.
>>>>
>>>> Bests,
>>>> Dongjoon.
>>>>
>>>>
>>>> On Wed, Jul 1, 2020 at 10:50 PM Yuanjian Li <xyliyuanj...@gmail.com>
>>>> wrote:
>>>>
>>>>> +1, especially Python 2
>>>>>
>>>>> Holden Karau <hol...@pigscanfly.ca> 于2020年7月2日周四 上午10:20写道:
>>>>>
>>>>>> I’m ok with us dropping Python 2, 3.4, and 3.5 in Spark 3.1 forward.
>>>>>> It will be exciting to get to use more recent Python features. The most
>>>>>> recent Ubuntu LTS ships with 3.7, and while the previous LTS ships with
>>>>>> 3.5, if folks really can’t upgrade there’s conda.
>>>>>>
>>>>>> Is there anyone with a large Python 3.5 fleet who can’t use conda?
>>>>>>
>>>>>> On Wed, Jul 1, 2020 at 7:15 PM Hyukjin Kwon <gurwls...@gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Yeah, sure. It will be dropped at Spark 3.1 onwards. I don't think
>>>>>>> we should make such changes in maintenance releases
>>>>>>>
>>>>>>> 2020년 7월 2일 (목) 오전 11:13, Holden Karau <hol...@pigscanfly.ca>님이 작성:
>>>>>>>
>>>>>>>> To be clear the plan is to drop them in Spark 3.1 onwards, yes?
>>>>>>>>
>>>>>>>> On Wed, Jul 1, 2020 at 7:11 PM Hyukjin Kwon <gurwls...@gmail.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> Hi all,
>>>>>>>>>
>>>>>>>>> I would like to discuss dropping deprecated Python versions 2, 3.4
>>>>>>>>> and 3.5 at https://github.com/apache/spark/pull/28957. I assume
>>>>>>>>> people support it in general
>>>>>>>>> but I am writing this to make sure everybody is happy.
>>>>>>>>>
>>>>>>>>> Fokko made a very good investigation on it, see
>>>>>>>>> https://github.com/apache/spark/pull/28957#issuecomment-652022449.
>>>>>>>>> Assuming from the statistics, I think we're pretty safe to drop
>>>>>>>>> them.
>>>>>>>>> Also note that dropping Python 2 was actually declared at
>>>>>>>>> https://python3statement.org/
>>>>>>>>>
>>>>>>>>> Roughly speaking, there are many main advantages by dropping them:
>>>>>>>>>   1. It removes a bunch of hacks we added around 700 lines in
>>>>>>>>> PySpark.
>>>>>>>>>   2. PyPy2 has a critical bug that causes a flaky test,
>>>>>>>>> https://issues.apache.org/jira/browse/SPARK-28358 given my
>>>>>>>>> testing and investigation.
>>>>>>>>>   3. Users can use Python type hints with Pandas UDFs without
>>>>>>>>> thinking about Python version
>>>>>>>>>   4. Users can leverage one latest cloudpickle,
>>>>>>>>> https://github.com/apache/spark/pull/28950. With Python 3.8+ it
>>>>>>>>> can also leverage C pickle.
>>>>>>>>>   5. ...
>>>>>>>>>
>>>>>>>>> So it benefits both users and dev. WDYT guys?
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> --
>>>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>>>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>>>>>
>>>>>>> --
>>>>>> Twitter: https://twitter.com/holdenkarau
>>>>>> Books (Learning Spark, High Performance Spark, etc.):
>>>>>> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
>>>>>> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>>>>>>
>>>>>
>
> --
> Twitter: https://twitter.com/holdenkarau
> Books (Learning Spark, High Performance Spark, etc.):
> https://amzn.to/2MaRAG9  <https://amzn.to/2MaRAG9>
> YouTube Live Streams: https://www.youtube.com/user/holdenkarau
>


-- 
Shane Knapp
Computer Guy / Voice of Reason
UC Berkeley EECS Research / RISELab Staff Technical Lead
https://rise.cs.berkeley.edu

Reply via email to