Re: binary file deserialization

2016-03-09 Thread Saurabh Bajaj
You can load that binary up as a String RDD, then map over that RDD and convert each row to your case class representing the data. In the map stage you could also map the input string into an RDD of JSON values and use the following function to convert it into a DF

Re: Output the data to external database at particular time in spark streaming

2016-03-08 Thread Saurabh Bajaj
You can call *foreachRDD*(*func*) on the output from the final stage, then check the time if it's the 15th min of an hour then you flush the output to DB else you don't. Let me know if that approach works. On Tue, Mar 8, 2016 at 2:10 PM, ayan guha wrote: > Yes if it falls

Re: pyspark spark-cassandra-connector java.io.IOException: Failed to open native connection to Cassandra at {192.168.1.126}:9042

2016-03-08 Thread Saurabh Bajaj
config is not being set and it's falling back to default. Let me know if that helps. Saurabh Bajaj On Tue, Mar 8, 2016 at 6:25 PM, Andy Davidson <a...@santacruzintegration.com > wrote: > Hi Ted > > I believe by default cassandra listens on 9042 > > From: Ted Yu <yuzhih...@gm