Kengo Seki created BIGTOP-4424: ---------------------------------- Summary: Rewrite BPS Spark with the DataFrame API and spark.ml Key: BIGTOP-4424 URL: https://issues.apache.org/jira/browse/BIGTOP-4424 Project: Bigtop Issue Type: Improvement Components: blueprints Reporter: Kengo Seki
Currently, BPS Spark is mainly based on the RDD API and the old version of MLlib (spark.mllib), [which is not recommended to use directly nowadays|https://spark.apache.org/docs/latest/ml-guide.html#announcement-dataframe-based-api-is-primary-api]. We should rewrite them with the DataFrame API and the new version of Spark ML (spark.ml) for simplicity and efficiency. -- This message was sent by Atlassian Jira (v8.20.10#820010)