Re: Tips for Spark's Random Forest slow performance

2016-01-28 Thread Alexander Ratnikov
t]]. > * Must be >= 1. > * (default = 10) > * @group expertSetParam > */ > def setCheckpointInterval(value: Int): this.type = > set(checkpointInterval, value) > i'm not actually sure how this will affect training performance with the > new ml.RandomForest impl, but i'm cur

Re: Tips for Spark's Random Forest slow performance

2015-12-25 Thread Alexander Ratnikov
I/O, etc? > > you may need more Workers. > > On Tue, Dec 22, 2015 at 8:57 AM, Alexander Ratnikov > <ratnikov.alexan...@gmail.com> wrote: >> >> Hi All, >> >> It would be good to get some tips on tuning Apache Spark for Random >&

Tips for Spark's Random Forest slow performance

2015-12-22 Thread Alexander Ratnikov
Hi All, It would be good to get some tips on tuning Apache Spark for Random Forest classification. Currently, we have a model that looks like: featureSubsetStrategy all impurity gini maxBins 32 maxDepth 11 numberOfClasses 2 numberOfTrees 100 We are running Spark 1.5.1 as a standalone cluster.