GitHub user BryanCutler opened a pull request:

    https://github.com/apache/spark/pull/18659

    [SPARK-21404][PYSPARK][WIP] Simple Python Vectorized UDFs 

    ## What changes were proposed in this pull request?
    
    WIP to demonstrate simple vectorized UDFs for Python as a preliminary step 
to an expanded API from the SPIP at 
https://issues.apache.org/jira/browse/SPARK-21190
    
    ## How was this patch tested?
    
    (Please explain how this patch was tested. E.g. unit tests, integration 
tests, manual tests)
    (If this patch involves UI changes, please attach a screenshot; otherwise, 
remove this)
    
    Please review http://spark.apache.org/contributing.html before opening a 
pull request.


You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/BryanCutler/spark 
arrow-vectorized-udfs-SPARK-21404

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/18659.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #18659
    
----
commit d2edd311b2f22d3cb9c3b2519835101727cbb127
Author: Bryan Cutler <[email protected]>
Date:   2017-07-14T00:29:23Z

    vectorized udfs working but hardcoded for ArrowPandasSerializer

commit 028a58f1ef0cad71f361094f272033dc1a606bfb
Author: Bryan Cutler <[email protected]>
Date:   2017-07-14T21:20:51Z

    Added conf for enabling vectorized UDFs, now working

commit c12f658d2df41d4cc66104a821954e5bc5976f8a
Author: Bryan Cutler <[email protected]>
Date:   2017-07-14T22:38:15Z

    Columns for ArrowPandasSerializer need unique name

commit 11a7a87c92825b0d458780586dd6181450bd8e21
Author: Bryan Cutler <[email protected]>
Date:   2017-07-14T22:57:24Z

    fixed style checks

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at [email protected] or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to