Hi, I am using Apache Spark 2.2 and mllib library in Python. I have to pass a Hive context to FPGrowth algorithm. For that, I converted a Df to RDD. I am struggling with some pickling errors. After going through stack overflow. It seems we need to convert an RDD to pipelineRDD. Could anyone suggest how that could be done ?
Thanks.