logging pickle files on local run of spark.ml Pipeline model

Mnisi, Caleb Wed, 15 Mar 2023 07:13:44 -0700

Good Day

I am having trouble saving a spark.ml Pipeline model to a pickle file, when 
running locally on my PC.
I've tried a few ways to save the model:


  1.  mlflow.spark.log_model(artifact_path=experiment.artifact_location, 
spark_model= model, registered_model_name="myModel")
     *   with error that the spark model is multiple files
  2.  pickle.dump(model, file): with error - TypeError: cannot pickle 
'_thread.RLock' object
  3.  model.save('path'): with Java errors:
     *   at 
org.apache.hadoop.mapred.OutputCommitter.commitJob(OutputCommitter.java:291)
     *   at 
org.apache.spark.internal.io.HadoopMapReduceCommitProtocol.commitJob(HadoopMapReduceCommitProtocol.scala:182)
     *   at 
org.apache.spark.internal.io.SparkHadoopWriter$.write(SparkHadoopWriter.scala:99)
 ... 67 more

Your assistance on this would be much appreciated.
Regards,

Caleb Mnisi
Consultant | Deloitte Analytics | Cognitive Advantage
Deloitte & Touche
5th floor, 5 Magwa Crescent, Waterfall City, 2090
M: +27 72 170 8779
[email protected]<mailto:[email protected]> | 
www2.deloitte.com/za<mailto:[email protected]>

[cid:[email protected]]

Please consider the environment before printing.

*Disclaimer:* This email is subject to important restrictions, qualifications 
and disclaimers ("the Disclaimer") that must be accessed and read by visiting 
our website and viewing the webpage at the following address: 
http://www.deloitte.com/za/disclaimer. The Disclaimer forms part of the content 
of this email. If you cannot access the Disclaimer, please obtain a copy 
thereof from us by sending an email to [email protected]. Deloitte 
refers to a Deloitte member firm, one of its related entities, or Deloitte 
Touche Tohmatsu Limited ("DTTL"). Each Deloitte member firm is a separate legal 
entity and a member of DTTL. DTTL does not provide services to clients. Please 
see www.deloitte.com/about to learn more.

logging pickle files on local run of spark.ml Pipeline model

Reply via email to