Thanks Khalid for taking a look. On Tue, 9 Aug 2022 at 00:37, Khalid Mammadov <khalidmammad...@gmail.com> wrote:
> Hi Hyukjin > That's great initiative, here is a PR that address one of those issues > that's waiting for review: https://github.com/apache/spark/pull/37408 > > Perhaps, it would be also good to track these pending issues somewhere to > avoid effort duplication. > > For example, I would like to pick up *union* and *union all* if no > one has already. > > Thanks, > Khalid > > > On Mon, Aug 8, 2022 at 1:44 PM Hyukjin Kwon <gurwls...@gmail.com> wrote: > >> Hi all, >> >> I am trying to improve PySpark documentation especially: >> >> - Make the examples self-contained, e.g., >> https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html >> - Document Parameters >> >> https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html#pandas.DataFrame.pivot. >> There are many API that misses parameters in PySpark, e.g., >> DataFrame.union >> >> Here is one example PR I am working on: >> https://github.com/apache/spark/pull/37437 >> I can't do it all by myself. Any help, review, and contributions would be >> welcome and appreciated. >> >> Thank you all in advance. >> >