GitHub user holdenk opened a pull request: https://github.com/apache/spark/pull/20908
[SPARK-23672][PYTHON] Document support for nested return types in scalar with arrow udfs ## What changes were proposed in this pull request? Clarify docstring for Scalar functions ## How was this patch tested? Adds a unit test showing use similar to wordcount, there's existing unit test for array of floats as well. You can merge this pull request into a Git repository by running: $ git pull https://github.com/holdenk/spark SPARK-23672-document-support-for-nested-return-types-in-scalar-with-arrow-udfs Alternatively you can review and apply these changes as the patch at: https://github.com/apache/spark/pull/20908.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #20908 ---- commit da8dbafcee753ebb2f7f3a205d0b378c240a3469 Author: Holden Karau <holden@...> Date: 2018-03-26T18:18:34Z Demonstrate tokenize udf commit 342d2228a5c68fd2c07bd8c1b518da6135ce1bf6 Author: Holden Karau <holden@...> Date: 2018-03-26T18:20:51Z Long lines are bad, kthnx ---- --- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org