Load csv file:
df <- read.df(sqlContext, "file-path", source = "com.databricks.spark.csv",
header = "true")
Calculate covariance:
cov <- cov(df, "col1", "col2")

Cheers
Yanbo


2015-12-28 17:21 GMT+08:00 zhangjp <592426...@qq.com>:

> hi  all,
>     I want  to use sparkR or spark MLlib  load csv file on hdfs then
> calculate  covariance, how to do it .
>     thks.
>

Reply via email to