I'm just circling back to this now. Is the commit protocol an acceptable way
of making this configureable? I could make the temp path (currently
"_temporary") configureable if that is what you are referring to.


Michael Armbrust wrote
> We didn't go this way initially because it doesn't work on storage systems
> that have weaker guarantees than HDFS with respect to rename.  That said,
> I'm happy to look at other options if we want to make this configurable.
> 
>> After hesitating for a while, I wrote a custom commit protocol to solve
>> the problem. It combines HadoopMapReduceCommitProtocol's behavior of
>> writing to a temp file first, with ManifestFileCommitProtocol. From what
>> I can tell ManifestFileCommitProtocol is required for the normal
>> Structured
>> Streaming behavior of being able to resume streams from a known point.





--
Sent from: http://apache-spark-user-list.1001560.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: user-unsubscr...@spark.apache.org

Reply via email to