Spark not working with Hadoop 4mc compression

2018-12-19 Thread Abhijeet Kumar
not found at org.apache.hadoop.conf.Configuration.getClassByName(Configuration.java:2101) at org.apache.hadoop.io.compress.CompressionCodecFactory.getCodecClasses(CompressionCodecFactory.java:132) ... 70 more Thank you, Abhijeet Kumar

Re: Why does join use rows that were sent after watermark of 20 seconds?

2018-12-10 Thread Abhijeet Kumar
ely, > If you want the intended behaviour you should add time stamp conditions or > window operator in join condition > > > > On Mon, 10 Dec 2018 at 5:23 PM, Abhijeet Kumar <mailto:abhijeet.ku...@sentienz.com>> wrote: > Hello, > > I’m using watermark to

Why does join use rows that were sent after watermark of 20 seconds?

2018-12-10 Thread Abhijeet Kumar
even after watermark got finished Spark is holding the data in memory. I even tried after 45 seconds and that was getting joined too. I’m sending streams from two Kafka queues and tstamp_trans I’m creating with current timestamp values. This is creating confusion in my mind regarding watermark. Thank you, Abhijeet Kumar

Join happening after watermark time

2018-12-06 Thread Abhijeet Kumar
t seems like even after watermark got finished it’s holding the data in memory. I tried even after 45 seconds and that was getting joined too. This is creating confusion in my mind regarding watermark. Thank you, Abhijeet Kumar

Re: Spark Streaming join taking long to process

2018-11-27 Thread Abhijeet Kumar
are cores on the machine." > > Just check if this is reducing the time taken for processing, since by this > local[*] we are going to use all cores available, not just one core? > > Regards, > Sriram Srikanth > > On Tue, Nov 27, 2018 at 1:46 PM Abhijeet Kumar

Not able to overwrite cassandra table using Spark

2018-06-27 Thread Abhijeet Kumar
ne, but when I’m checking the Cassandra table back, there is no content inside it. Everything is deleted. I’m really worried about this behaviour because this may delete some useful content (I’m sure about overwriting the content and fully understand the consequences). Thanks, Abhijeet Kumar

Re: Spark / Scala code not recognising the path?

2018-06-09 Thread Abhijeet Kumar
named. > > On 9. Jun 2018, at 09:44, Abhijeet Kumar > wrote: > > I need to rename the file. I can write a separate program for this, I > think. > > Thanks, > Abhijeet Kumar > > On 09-Jun-2018, at 1:10 PM, Jörn Franke wrote: > > That would be an anti pattern and

Re: Spark / Scala code not recognising the path?

2018-06-09 Thread Abhijeet Kumar
I need to rename the file. I can write a separate program for this, I think. Thanks, Abhijeet Kumar > On 09-Jun-2018, at 1:10 PM, Jörn Franke wrote: > > That would be an anti pattern and would lead to bad software. > Please don’t do it for the sake of the people that use your soft

Re: Spark / Scala code not recognising the path?

2018-06-09 Thread Abhijeet Kumar
Can you please tell the estimated time. So, that my program will wait for that time period. Thanks, Abhijeet Kumar > On 09-Jun-2018, at 12:01 PM, Jörn Franke wrote: > > You need some time until the information of the file creation is propagated. > > On 9. Jun 2018, at 08:07,

Spark / Scala code not recognising the path?

2018-06-09 Thread Abhijeet Kumar
rify whether it is taking the path of hdfs or not I have added one more println statement in my code, providing the path which is already there in HDFS. It's showing true in that case. So, what could be the reason? Thanks, Abhijeet Kumar