Re: Anybody built the branch for Adaptive Boosting, extension to MLlib by Manish Amde?

2014-09-24 Thread Aris
Hi Manish!

Thanks for the reply and the explication on why the branches won't compile
-- that makes perfect sense.

You mentioned making the branch compatible with the latest master; could
you share some more details? Which branch do you mean -- is it on your
GitHub? And would I just be able to do a normal ./sbt/sbt assembly build,
or do I need to do something else?

Thank you and take care
Aris

On Thu, Sep 18, 2014 at 3:50 PM, Manish Amde manish...@gmail.com wrote:

 Hi Aris,

 Thanks for the interest. First and foremost, tree ensembles are a top
 priority for the 1.2 release and we are working hard towards it. A random
 forests PR is already under review and AdaBoost and gradient boosting will
 be added soon after.

 Unfortunately, the GBDT branch I shared is way off master. There has been
 a lot of under-the-hood optimizations for decision trees and I am not
 surprised that the branch doesn't compile. It will be best if you could
 wait for a few days till I make the branch compatible with the latest
 master.

 Again, thanks for your interest in boosting algos. We are eager to add
 them to MLlib ASAP.


 On Thu, Sep 18, 2014 at 7:27 PM, Aris arisofala...@gmail.com wrote:

 Thank you Spark community you make life much more lovely - suffering in
 silence is not fun!

 I am trying to build the Spark Git branch from Manish Amde, available
 here:

  https://github.com/manishamde/spark/tree/ada_boost

 I am trying to build the non-master branch 'ada_boost' (in the link
 above), but './sbt/sbt assembly' does not work, as it sees all kinds of new
 code that doesn't build. I saw another script at the top-level called
 'make-distribution.sh' which requires maven and specifically Java 6 (does
 not allow javac version 7), but that also fails.

 Does anybody have any pointers for building this developmental build of
 Spark with support for adaptive boosting (adaboost ensemble decision tree
 method) in MLlib?

 Thanks!





Anybody built the branch for Adaptive Boosting, extension to MLlib by Manish Amde?

2014-09-18 Thread Aris
Thank you Spark community you make life much more lovely - suffering in
silence is not fun!

I am trying to build the Spark Git branch from Manish Amde, available here:

https://github.com/manishamde/spark/tree/ada_boost

I am trying to build the non-master branch 'ada_boost' (in the link above),
but './sbt/sbt assembly' does not work, as it sees all kinds of new code
that doesn't build. I saw another script at the top-level called
'make-distribution.sh' which requires maven and specifically Java 6 (does
not allow javac version 7), but that also fails.

Does anybody have any pointers for building this developmental build of
Spark with support for adaptive boosting (adaboost ensemble decision tree
method) in MLlib?

Thanks!


Re: Anybody built the branch for Adaptive Boosting, extension to MLlib by Manish Amde?

2014-09-18 Thread Manish Amde
Hi Aris,


Thanks for the interest. First and foremost, tree ensembles are a top priority 
for the 1.2 release and we are working hard towards it. A random forests PR is 
already under review and AdaBoost and gradient boosting will be added soon 
after. 




Unfortunately, the GBDT branch I shared is way off master. There has been a lot 
of under-the-hood optimizations for decision trees and I am not surprised that 
the branch doesn't compile. It will be best if you could wait for a few days 
till I make the branch compatible with the latest master.





Again, thanks for your interest in boosting algos. We are eager to add them to 
MLlib ASAP.

On Thu, Sep 18, 2014 at 7:27 PM, Aris arisofala...@gmail.com wrote:

 Thank you Spark community you make life much more lovely - suffering in
 silence is not fun!
 I am trying to build the Spark Git branch from Manish Amde, available here:
 https://github.com/manishamde/spark/tree/ada_boost
 I am trying to build the non-master branch 'ada_boost' (in the link above),
 but './sbt/sbt assembly' does not work, as it sees all kinds of new code
 that doesn't build. I saw another script at the top-level called
 'make-distribution.sh' which requires maven and specifically Java 6 (does
 not allow javac version 7), but that also fails.
 Does anybody have any pointers for building this developmental build of
 Spark with support for adaptive boosting (adaboost ensemble decision tree
 method) in MLlib?
 Thanks!