[ https://issues.apache.org/jira/browse/SPARK-9397?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Apache Spark reassigned SPARK-9397: ----------------------------------- Assignee: Aaron Davidson (was: Apache Spark) > DataFrame should provide an API to find source data files if applicable > ----------------------------------------------------------------------- > > Key: SPARK-9397 > URL: https://issues.apache.org/jira/browse/SPARK-9397 > Project: Spark > Issue Type: Improvement > Components: SQL > Reporter: Aaron Davidson > Assignee: Aaron Davidson > Priority: Critical > > Certain applications would benefit from being able to inspect DataFrames that > are straightforwardly produced by data sources that stem from files, and find > out their source data. For example, one might want to display to a user the > size of the data underlying a table, or to copy or mutate it. > Currently, there is not a good way to get this information in a public API. -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org