Re: Spark Streaming Design Suggestion

2017-06-14 Thread Shashi Vishwakarma
use case? >> >> Especially applications deleting/creating topics dynamically can be a >> nightmare to operate >> >> > On 13. Jun 2017, at 22:03, Shashi Vishwakarma <shashi.vish...@gmail.com> >> wrote: >> > >> > Hi >> > >> > I have

Spark Streaming Design Suggestion

2017-06-13 Thread Shashi Vishwakarma
Hi I have to design a spark streaming application with below use case. I am looking for best possible approach for this. I have application which pushing data into 1000+ different topics each has different purpose . Spark streaming will receive data from each topic and after processing it will

Spark Shuffle Encryption

2017-05-12 Thread Shashi Vishwakarma
Hi I was doing research on encrypting spark shuffle data and found that Spark 2.1 has got that feature. https://issues.apache.org/jira/browse/SPARK-5682 Does anyone has more documentation around it ? How do I aim to use this feature in real production environment keeping mind that I need to

Re: Securing Spark Job on Cluster

2017-04-28 Thread Shashi Vishwakarma
u using HDFS? > > On 28. Apr 2017, at 16:57, Shashi Vishwakarma <shashi.vish...@gmail.com> > wrote: > > Agreed Jorn. Disk encryption is one option that will help to secure data > but how do I know at which location Spark is spilling temp file, shuffle > data and application data ?

Re: Securing Spark Job on Cluster

2017-04-28 Thread Shashi Vishwakarma
ncryption as provided by the operating system. > Additionally, you may think about shredding disks after they are not used > anymore. > > > On 28. Apr 2017, at 14:45, Shashi Vishwakarma <shashi.vish...@gmail.com> > wrote: > > > > Hi All > > > > I was

Re: Securing Spark Job on Cluster

2017-04-28 Thread Shashi Vishwakarma
e Apache kebros, or > we could use the security feature in Spark. > > > > On Apr 28, 2017, at 8:45 AM, Shashi Vishwakarma < > shashi.vish...@gmail.com> wrote: > > > > Hi All > > > > I was dealing with one the spark requirement here where Client (like &g

Securing Spark Job on Cluster

2017-04-28 Thread Shashi Vishwakarma
Hi All I was dealing with one the spark requirement here where Client (like Banking Client where security is major concern) needs all spark processing should happen securely. For example all communication happening between spark client and server ( driver & executor communication) should be on

Re: Spark Job failing with exit status 15

2015-11-08 Thread Shashi Vishwakarma
wrote: > Which release of Spark were you using ? > > Can you post the command you used to run WordCount ? > > Cheers > > On Sat, Nov 7, 2015 at 7:59 AM, Shashi Vishwakarma < > shashi.vish...@gmail.com> wrote: > >> I am trying to run simple word count job in spark bu

Spark Job failing with exit status 15

2015-11-07 Thread Shashi Vishwakarma
I am trying to run simple word count job in spark but I am getting exception while running job. For more detailed output, check application tracking page:http://quickstart.cloudera:8088/proxy/application_1446699275562_0006/Then, click on links to logs of each attempt.Diagnostics: Exception from

Realtime Data Visualization Tool for Spark

2015-09-11 Thread Shashi Vishwakarma
Hi I have got streaming data which needs to be processed and send for visualization. I am planning to use spark streaming for this but little bit confused in choosing visualization tool. I read somewhere that D3.js can be used but i wanted know which is best tool for visualization while dealing