Re: Contributions and help needed in SPARK-40005

Khalid Mammadov Thu, 18 Aug 2022 23:56:59 -0700

I thought it's all finished (checked few). Do you have list of those 50%?
Happy to contribute 😊


On Fri, 19 Aug 2022, 05:54 Hyukjin Kwon, <[email protected]> wrote:

> We're half way, roughly 50%. More contributions would be very helpful.
> If the size of the file is too large, feel free to split it to multiple
> parts (e.g., https://github.com/apache/spark/pull/37575)
>
> On Tue, 9 Aug 2022 at 12:26, Qian SUN <[email protected]> wrote:
>
>> Sure, I will do it. SPARK-40010
>> <https://issues.apache.org/jira/browse/SPARK-40010> is built to track
>> progress.
>>
>> Hyukjin Kwon [email protected] <http://mailto:[email protected]>
>> 于2022年8月9日周二 10:58写道：
>>
>> Please go ahead. Would be very appreciated.
>>>
>>> On Tue, 9 Aug 2022 at 11:58, Qian SUN <[email protected]> wrote:
>>>
>>>> Hi Hyukjin
>>>>
>>>> I would like to do some work and pick up *Window.py *if possible.
>>>>
>>>> Thanks,
>>>> Qian
>>>>
>>>> Hyukjin Kwon <[email protected]> 于2022年8月9日周二 10:41写道：
>>>>
>>>>> Thanks Khalid for taking a look.
>>>>>
>>>>> On Tue, 9 Aug 2022 at 00:37, Khalid Mammadov <
>>>>> [email protected]> wrote:
>>>>>
>>>>>> Hi Hyukjin
>>>>>> That's great initiative, here is a PR that address one of those
>>>>>> issues that's waiting for review:
>>>>>> https://github.com/apache/spark/pull/37408
>>>>>>
>>>>>> Perhaps, it would be also good to track these pending issues
>>>>>> somewhere to avoid effort duplication.
>>>>>>
>>>>>> For example, I would like to pick up *union* and *union all* if no
>>>>>> one has already.
>>>>>>
>>>>>> Thanks,
>>>>>> Khalid
>>>>>>
>>>>>>
>>>>>> On Mon, Aug 8, 2022 at 1:44 PM Hyukjin Kwon <[email protected]>
>>>>>> wrote:
>>>>>>
>>>>>>> Hi all,
>>>>>>>
>>>>>>> I am trying to improve PySpark documentation especially:
>>>>>>>
>>>>>>>    - Make the examples self-contained, e.g.,
>>>>>>>    
>>>>>>> https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html
>>>>>>>    - Document Parameters
>>>>>>>    
>>>>>>> https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html#pandas.DataFrame.pivot.
>>>>>>>    There are many API that misses parameters in PySpark, e.g., 
>>>>>>> DataFrame.union
>>>>>>>
>>>>>>> Here is one example PR I am working on:
>>>>>>> https://github.com/apache/spark/pull/37437
>>>>>>> I can't do it all by myself. Any help, review, and contributions
>>>>>>> would be welcome and appreciated.
>>>>>>>
>>>>>>> Thank you all in advance.
>>>>>>>
>>>>>>
>>>>
>>>> --
>>>> Best!
>>>> Qian SUN
>>>>
>>> --
>> Best!
>> Qian SUN
>>
>

Re: Contributions and help needed in SPARK-40005

Reply via email to