Hi, Use DataFrame-based API (aka spark.ml) first and if your ml algorithm doesn't support it switch to a RDD-based API (spark.mllib). What algorithm are you going to use?
Jacek On 9 Jun 2016 9:12 a.m., "pseudo oduesp" <pseudo20...@gmail.com> wrote: > Hi, > after spark 1.3 we have dataframe ( thanks good ) , instead rdd : > > in machine learning algorithmes we should give him an RDD or dataframe? > > i mean when i build modele : > > > Model = algoritme(rdd) > or > Model = algorithme(df) > > > > if you have an exemple with data frame i prefer work with it. > > thanks . > >