GitHub user abehrens opened a pull request:

    https://github.com/apache/spark/pull/10710

    [pyspark] adding disjunction and difference functions for rdds

    I was looking for a way to perform disjunction and difference operations, 
in other words:
    
    * disjunction: find all elements NOT in A and B 
    * difference: find all elements in A but NOT in B
    
    If there are better names for these functions, I'd be happy to change them. 
 I am a pyspark user so they are only in the python code.  I would love having 
them in Scala and Java too.

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/abehrens/spark disjunction

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/10710.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #10710
    
----
commit e4cc54b0d22b11d4c902c05d25403d87080ff960
Author: Alex Behrens <abehr...@signal.co>
Date:   2016-01-11T21:44:58Z

    [pyspark] adding disjunction and difference functions for rdds

----


---
If your project is set up for it, you can reply to this email and have your
reply appear on GitHub as well. If your project does not have this feature
enabled and wishes so, or if the feature is enabled but not working, please
contact infrastructure at infrastruct...@apache.org or file a JIRA ticket
with INFRA.
---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to