Hi,

Use DataFrame-based API (aka spark.ml) first and if your ml algorithm
doesn't support it switch to a RDD-based API (spark.mllib). What algorithm
are you going to use?

Jacek
On 9 Jun 2016 9:12 a.m., "pseudo oduesp" <pseudo20...@gmail.com> wrote:

> Hi,
> after spark 1.3 we have dataframe ( thanks good )  ,  instead rdd  :
>
>  in machine learning algorithmes we should  give him an RDD or dataframe?
>
> i mean when i build modele :
>
>
>    Model  = algoritme(rdd)
> or
>    Model =  algorithme(df)
>
>
>
> if you have an  exemple with data frame i prefer work with it.
>
> thanks .
>
>

Reply via email to