Re: My notes on Spark Performance & Tuning Guide

2016-05-17 Thread Vinayak Agrawal
Please include me too. Vinayak Agrawal Big Data Analytics IBM "To Strive, To Seek, To Find and Not to Yield!" ~Lord Alfred Tennyson > On May 17, 2016, at 2:15 PM, Mich Talebzadeh <mich.talebza...@gmail.com> > wrote: > > Hi all, > > Many thanks for your trem

Do I need to install Cassandra node on Spark Master node to work with Cassandra?

2016-05-04 Thread Vinayak Agrawal
need to install cassandra node on my Spark Master node so that Spark can connect with cassandra or Cassandra only needs to be on Spark worker nodes? It seemss logical considering data locality. Thanks -- Vinayak Agrawal "To Strive, To Seek, To Find and Not to Yield!" ~Lord Alfred Tennyson

Saving a pipeline model ?

2016-01-27 Thread Vinayak Agrawal
are the spark users currently working around this? Is there a way to convert a pipelinemodel to mllib model and save ? Thanks - Vinayak Agrawal "To Strive, To Seek, To Find and Not to Yield!" ~Lord Alfred Tennyson

Getting Co-oefficients of a logistic regression model for a pipelinemodel Spark ML library

2016-01-21 Thread Vinayak Agrawal
+model+statistics%22=newest=1 Any suggestions? -- Vinayak Agrawal "To Strive, To Seek, To Find and Not to Yield!" ~Lord Alfred Tennyson

Re: can we create dummy variables from categorical variables, using sparkR

2016-01-19 Thread Vinayak Agrawal
for categorical variables in sparkR like we > do using "dummies" package in R > > -- > Warm regards, > Devesh. > -- Vinayak Agrawal Big Data Analytics IBM "To Strive, To Seek, To Find and Not to Yield!" ~Lord Alfred Tennyson