[GitHub] spark pull request #20373: [WIP][SPARK-23159][PYTHON] Update cloudpickle to ...

BryanCutler Tue, 23 Jan 2018 16:14:05 -0800

Github user BryanCutler commented on a diff in the pull request:

    https://github.com/apache/spark/pull/20373#discussion_r163421005
  
    --- Diff: python/pyspark/cloudpickle.py ---
    @@ -754,64 +742,6 @@ def __getattribute__(self, item):
         if type(operator.attrgetter) is type:
             dispatch[operator.attrgetter] = save_attrgetter
     
    -    def save_reduce(self, func, args, state=None,
    -                    listitems=None, dictitems=None, obj=None):
    -        # Assert that args is a tuple or None
    -        if not isinstance(args, tuple):
    -            raise pickle.PicklingError("args from reduce() should be a 
tuple")
    -
    -        # Assert that func is callable
    -        if not hasattr(func, '__call__'):
    -            raise pickle.PicklingError("func from reduce should be 
callable")
    -
    -        save = self.save
    -        write = self.write
    -
    -        # Protocol 2 special case: if func's name is __newobj__, use NEWOBJ
    -        if self.proto >= 2 and getattr(func, "__name__", "") == 
"__newobj__":
    -            cls = args[0]
    -            if not hasattr(cls, "__new__"):
    -                raise pickle.PicklingError(
    -                    "args[0] from __newobj__ args has no __new__")
    -            if obj is not None and cls is not obj.__class__:
    -                raise pickle.PicklingError(
    -                    "args[0] from __newobj__ args has the wrong class")
    -            args = args[1:]
    -            save(cls)
    -
    -            save(args)
    -            write(pickle.NEWOBJ)
    -        else:
    -            save(func)
    -            save(args)
    -            write(pickle.REDUCE)
    -
    -        if obj is not None:
    -            self.memoize(obj)
    -
    -        # More new special cases (that work with older protocols as
    -        # well): when __reduce__ returns a tuple with 4 or 5 items,
    -        # the 4th and 5th item should be iterators that provide list
    -        # items and dict items (as (key, value) tuples), or None.
    -
    -        if listitems is not None:
    -            self._batch_appends(listitems)
    -
    -        if dictitems is not None:
    -            self._batch_setitems(dictitems)
    -
    -        if state is not None:
    -            save(state)
    -            write(pickle.BUILD)
    -
    -    def save_partial(self, obj):
    -        """Partial objects do not serialize correctly in python2.x -- this 
fixes the bugs"""
    -        self.save_reduce(_genpartial, (obj.func, obj.args, obj.keywords))
    -
    -    if sys.version_info < (2,7):  # 2.7 supports partial pickling
    -        dispatch[partial] = save_partial
    -
    -
    --- End diff --
    
    Remove save_reduce() override:  It is the exactly the same code as in 
Python 2's Pickler class.
    
https://github.com/cloudpipe/cloudpickle/commit/2da4c243ceddebbc2febf116eb6e53035fed9b9a



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark pull request #20373: [WIP][SPARK-23159][PYTHON] Update cloudpickle to ...

Reply via email to