[jira] [Commented] (SPARK-20448) Document how FileInputDStream works with object storage
[ https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178236#comment-16178236 ] Steve Loughran commented on SPARK-20448: thanks! > Document how FileInputDStream works with object storage > --- > > Key: SPARK-20448 > URL: https://issues.apache.org/jira/browse/SPARK-20448 > Project: Spark > Issue Type: Documentation > Components: Documentation >Affects Versions: 2.1.0 >Reporter: Steve Loughran >Assignee: Steve Loughran >Priority: Minor > Fix For: 2.3.0 > > > Object stores work differently from filesystems: intermediate writes not > visible, renames are really O(data) copies, not O(1) transactions. > This makes working with them as DStreams fundamentally different: you can > write straight into the destination. > 1. Document how FileinputDStreams scan directories for changes > 2. Document how object stores behave differently, and the implications > for users. -- This message was sent by Atlassian JIRA (v6.4.14#64029) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org
[jira] [Commented] (SPARK-20448) Document how FileInputDStream works with object storage
[ https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15981094#comment-15981094 ] Apache Spark commented on SPARK-20448: -- User 'steveloughran' has created a pull request for this issue: https://github.com/apache/spark/pull/17743 > Document how FileInputDStream works with object storage > --- > > Key: SPARK-20448 > URL: https://issues.apache.org/jira/browse/SPARK-20448 > Project: Spark > Issue Type: Documentation > Components: Documentation >Affects Versions: 2.1.0 >Reporter: Steve Loughran >Priority: Minor > > Object stores work differently from filesystems: intermediate writes not > visible, renames are really O(data) copies, not O(1) transactions. > This makes working with them as DStreams fundamentally different: you can > write straight into the destination. > 1. Document how FileinputDStreams scan directories for changes > 2. Document how object stores behave differently, and the implications > for users. -- This message was sent by Atlassian JIRA (v6.3.15#6346) - To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org