Tathagata Das created SPARK-7385:
------------------------------------

             Summary: Add RDD.foreachPartitionWithIndex to allow the index to 
be used in transactions in streaming apps
                 Key: SPARK-7385
                 URL: https://issues.apache.org/jira/browse/SPARK-7385
             Project: Spark
          Issue Type: Improvement
          Components: Spark Core, Streaming
            Reporter: Tathagata Das
            Assignee: Tathagata Das


Spark Streaming apps often update external stores transactionally, which 
requires it to have an id that uniquely identifies the partition of data to be 
inserted. This can be the (time, partition index). 

Current work around is to use mapPartitionsWithIndex().count() which is awkward 
as it is not an action. This jira is to add foreachPartitionWithIndex(). 



--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]

Reply via email to