Sandy Ryza created SPARK-4911: --------------------------------- Summary: Report the inputs and outputs of Spark jobs so that external systems can track data lineage Key: SPARK-4911 URL: https://issues.apache.org/jira/browse/SPARK-4911 Project: Spark Issue Type: New Feature Components: Spark Core Affects Versions: 1.2.0 Reporter: Sandy Ryza
When Spark runs a job, it would be useful to log its filesystem inputs and outputs somewhere. This allows external tools to track which persisted datasets are derived from other persisted datasets. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org