Fernando Pereira created SPARK-22649: ----------------------------------------
Summary: localCheckpoint support in Dataset API Key: SPARK-22649 URL: https://issues.apache.org/jira/browse/SPARK-22649 Project: Spark Issue Type: Improvement Components: PySpark, SQL Affects Versions: 2.2.0 Reporter: Fernando Pereira Priority: Minor localCheckpoints were recently introduced in RDDs as a way of quickly truncate the execution plan by using the caching subsystem (and therefore can not be considered reliable). I believe it would be of major interest to offer this API in the Dataset API, and similarly in Python Dataframes. A possible implementation is at PR19805 -- This message was sent by Atlassian JIRA (v6.4.14#64029) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org