[ https://issues.apache.org/jira/browse/SPARK-2028?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=14018474#comment-14018474 ]
Patrick Wendell commented on SPARK-2028: ---------------------------------------- I wantonly changed this from a "Bug" to a "New Feature". We just never supported this before, but it would be nice to support in the future. > Let users of HadoopRDD access the partition InputSplits > ------------------------------------------------------- > > Key: SPARK-2028 > URL: https://issues.apache.org/jira/browse/SPARK-2028 > Project: Spark > Issue Type: New Feature > Reporter: Aaron Davidson > Assignee: Aaron Davidson > > If a user creates a HadoopRDD (e.g., via textFile), there is no way to find > out which file it came from, though this information is contained in the > InputSplit within the RDD. We should find a way to expose this publicly. -- This message was sent by Atlassian JIRA (v6.2#6252)