[jira] [Commented] (SPARK-20448) Document how FileInputDStream works with object storage

2017-09-24 Thread Steve Loughran (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16178236#comment-16178236
 ] 

Steve Loughran commented on SPARK-20448:


thanks!

> Document how FileInputDStream works with object storage
> ---
>
> Key: SPARK-20448
> URL: https://issues.apache.org/jira/browse/SPARK-20448
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation
>Affects Versions: 2.1.0
>Reporter: Steve Loughran
>Assignee: Steve Loughran
>Priority: Minor
> Fix For: 2.3.0
>
>
> Object stores work differently from filesystems: intermediate writes not 
> visible, renames are really O(data) copies, not O(1) transactions.
> This makes working with them as DStreams fundamentally different: you can 
> write straight into the destination.
> 1. Document how FileinputDStreams scan directories for changes
> 2. Document how object stores behave differently, and the implications
> for users.



--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org



[jira] [Commented] (SPARK-20448) Document how FileInputDStream works with object storage

2017-04-24 Thread Apache Spark (JIRA)

[ 
https://issues.apache.org/jira/browse/SPARK-20448?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15981094#comment-15981094
 ] 

Apache Spark commented on SPARK-20448:
--

User 'steveloughran' has created a pull request for this issue:
https://github.com/apache/spark/pull/17743

> Document how FileInputDStream works with object storage
> ---
>
> Key: SPARK-20448
> URL: https://issues.apache.org/jira/browse/SPARK-20448
> Project: Spark
>  Issue Type: Documentation
>  Components: Documentation
>Affects Versions: 2.1.0
>Reporter: Steve Loughran
>Priority: Minor
>
> Object stores work differently from filesystems: intermediate writes not 
> visible, renames are really O(data) copies, not O(1) transactions.
> This makes working with them as DStreams fundamentally different: you can 
> write straight into the destination.
> 1. Document how FileinputDStreams scan directories for changes
> 2. Document how object stores behave differently, and the implications
> for users.



--
This message was sent by Atlassian JIRA
(v6.3.15#6346)

-
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org