Hi Devesh,
RFormula will encode category variables(column of string type) as dummy
variables automatically. You do not need to do dummy transform explicitly
if you want to train machine learning model using SparkR. Although SparkR
only supports a limited ML algorithms(GLM) currently.
Thanks
Yanbo
Yes, you can use Rformula library. Please see
https://databricks.com/blog/2015/10/05/generalized-linear-models-in-sparkr-and-r-formula-support-in-mllib.html
On Tue, Jan 19, 2016 at 10:34 AM, Devesh Raj Singh
wrote:
> Hi,
>
> Can we create dummy variables for categorical variables in sparkR like
Hi,
Can we create dummy variables for categorical variables in sparkR like we
do using "dummies" package in R
--
Warm regards,
Devesh.