[ https://issues.apache.org/jira/browse/SPARK-33188?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Hyukjin Kwon updated SPARK-33188: --------------------------------- Component/s: ML > PipelineModel load resulting in error > ------------------------------------- > > Key: SPARK-33188 > URL: https://issues.apache.org/jira/browse/SPARK-33188 > Project: Spark > Issue Type: Bug > Components: ML, PySpark > Affects Versions: 3.0.1 > Environment: Spark 3.0.1 > Python 3.6.8 > numpy 1.18.5 > Reporter: rohit agarwal > Priority: Minor > > Steps to reproduce: > # Define pipeline : Pipeline(stages=[discretizer, one_hot_encoder, cv1, cv2, > assembler]) > # Save PipelineModel : PipelineModel.write().save('/path') > # Load PipelineModel : PipelineModel.load('/path') > Getting following error: > TypeError: array() argument 1 must be a unicode character, not bytes > Error is in PickleSerializer class in > park-3.0.1-bin-hadoop2.7/python/pyspark/serializers.py > Changing pickle.loads(obj, encoding=encoding) to pickle.loads(obj) resolves > it. > -- This message was sent by Atlassian Jira (v8.3.4#803005) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org