Maciej Szymkiewicz created SPARK-19467: ------------------------------------------
Summary: PySpark ML shouldn't use circular imports Key: SPARK-19467 URL: https://issues.apache.org/jira/browse/SPARK-19467 Project: Spark Issue Type: Improvement Components: ML, PySpark Affects Versions: 2.1.0, 2.0.0, 2.2.0 Reporter: Maciej Szymkiewicz Priority: Minor {{pyspark.ml}} and {{pyspark.ml.pipeline}} contain circular imports with the [former one|https://github.com/apache/spark/blob/39f328ba3519b01940a7d1cdee851ba4e75ef31f/python/pyspark/ml/__init__.py#L23]: {code} from pyspark.ml.pipeline import Pipeline, PipelineModel {code} and the [latter one|https://github.com/apache/spark/blob/39f328ba3519b01940a7d1cdee851ba4e75ef31f/python/pyspark/ml/pipeline.py#L24]: {code} from pyspark.ml import Estimator, Model, Transformer {code} This is unnecessary and can cause failures when working with external tools. -- This message was sent by Atlassian JIRA (v6.3.15#6346) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org