subject:"Reading parquet files into Spark Streaming"

Re: Reading parquet files into Spark Streaming

2016-08-27 Thread Sebastian Piu

Forgot to paste the link... http://ramblings.azurewebsites.net/2016/01/26/save-parquet-rdds-in-apache-spark/ On Sat, 27 Aug 2016, 19:18 Sebastian Piu, wrote: > Hi Renato, > > Check here on how to do it, it is in Java but you can translate it to > Scala if that is what

Re: Reading parquet files into Spark Streaming

2016-08-27 Thread Sebastian Piu

Hi Renato, Check here on how to do it, it is in Java but you can translate it to Scala if that is what you need. Cheers On Sat, 27 Aug 2016, 14:24 Renato Marroquín Mogrovejo, < renatoj.marroq...@gmail.com> wrote: > Hi Akhilesh, > > Thanks for your response. > I am using Spark 1.6.1 and what I

Re: Reading parquet files into Spark Streaming

2016-08-27 Thread Renato Marroquín Mogrovejo

Hi Akhilesh, Thanks for your response. I am using Spark 1.6.1 and what I am trying to do is to ingest parquet files into the Spark Streaming, not in batch operations. val ssc = new StreamingContext(sc, Seconds(5)) ssc.sparkContext.hadoopConfiguration.set("parquet.read.support.class",

Re: Reading parquet files into Spark Streaming

2016-08-27 Thread Akhilesh Pathodia

Hi Renato, Which version of Spark are you using? If spark version is 1.3.0 or more then you can use SqlContext to read the parquet file which will give you DataFrame. Please follow the below link: https://spark.apache.org/docs/1.5.0/sql-programming-guide.html#loading-data-programmatically

Re: Reading parquet files into Spark Streaming

2016-08-26 Thread Akhilesh Pathodia

Hi Renato, Which version of Spark are you using? If spark version is 1.3.0 or more then you can use SqlContext to read the parquet file which will give you DataFrame. Please follow the below link: https://spark.apache.org/docs/1.5.0/sql-programming-guide.html#loading-data-programmatically

Re: Reading parquet files into Spark Streaming

2016-08-26 Thread Renato Marroquín Mogrovejo

Anybody? I think Rory also didn't get an answer from the list ... https://mail-archives.apache.org/mod_mbox/spark-user/201602.mbox/%3ccac+fre14pv5nvqhtbvqdc+6dkxo73odazfqslbso8f94ozo...@mail.gmail.com%3E 2016-08-26 17:42 GMT+02:00 Renato Marroquín Mogrovejo < renatoj.marroq...@gmail.com>: >

Reading parquet files into Spark Streaming

2016-08-26 Thread Renato Marroquín Mogrovejo

Hi all, I am trying to use parquet files as input for DStream operations, but I can't find any documentation or example. The only thing I found was [1] but I also get the same error as in the post (Class parquet.avro.AvroReadSupport not found). Ideally I would like to do have something like this:

Re: Reading parquet files into Spark Streaming

Re: Reading parquet files into Spark Streaming

Re: Reading parquet files into Spark Streaming

Re: Reading parquet files into Spark Streaming

Re: Reading parquet files into Spark Streaming

Re: Reading parquet files into Spark Streaming

Reading parquet files into Spark Streaming

7 matches

Site Navigation

Mail list logo

Footer information