How do I stream in Parquet files using fileStream() and ParquetInputFormat

roryofbyrne Thu, 18 Feb 2016 02:42:05 -0800

Hi, 

I'm trying to understand how to stream Parquet files into Spark using
StreamingContext.fileStream[Key, Value, Format]().


I am struggling to understand a) what should be passed as Key and Value
(assuming ParquetInputFormat - is this the correct format?), and b) how - if
at all - to configure the ParquetInputFormat with a ReadSupport class, 
RecordMaterializer etc.. 

Any help is appreciated as I have almost no knowledge of Hadoop so this low
level use of Hadoop is very confusing for me.



--
View this message in context: 
http://apache-spark-user-list.1001560.n3.nabble.com/How-do-I-stream-in-Parquet-files-using-fileStream-and-ParquetInputFormat-tp26262.html
Sent from the Apache Spark User List mailing list archive at Nabble.com.

---------------------------------------------------------------------
To unsubscribe, e-mail: user-unsubscr...@spark.apache.org
For additional commands, e-mail: user-h...@spark.apache.org

How do I stream in Parquet files using fileStream() and ParquetInputFormat

Reply via email to