R: Tungsten and Spark Streaming

Paolo Platter Wed, 09 Sep 2015 23:59:31 -0700

Did you plan to modify dstream interface in order to work with dataframe ? It 
would be nice handle dstreams without generics

Paolo

Inviata dal mio Windows Phone
________________________________
Da: Tathagata Das<mailto:t...@databricks.com>
Inviato: ‎10/‎09/‎2015 07:42
A: N B<mailto:nb.nos...@gmail.com>
Cc: user<mailto:user@spark.apache.org>
Oggetto: Re: Tungsten and Spark Streaming

Rewriting is necessary. You will have to convert RDD/DStream operations to 
DataFrame operations. So get the RDDs in DStream, using transform/foreachRDD, 
convert to DataFrames and then do DataFrame operations.

On Wed, Sep 9, 2015 at 9:23 PM, N B 
<nb.nos...@gmail.com<mailto:nb.nos...@gmail.com>> wrote:
Hello,

How can we start taking advantage of the performance gains made under Project 
Tungsten in Spark 1.5 for a Spark Streaming program?

>From what I understand, this is available by default for Dataframes. But for a 
>program written using Spark Streaming, would we see any potential gains "out 
>of the box" in 1.5 or will we have to rewrite some portions of the application 
>code to realize that benefit?

Any insight/documentation links etc in this regard will be appreciated.

Thanks
Nikunj

R: Tungsten and Spark Streaming

Reply via email to