Hi Flink Community: I am sending this email to let you know we just release XGBoost4J which also runs on Flink. In short, XGBoost is a machine learning package that is used by more than half of the machine challenge winning solutions and is already widely used in industry. The distributed version scale to billion examples(10x faster than spark.mllib in the experiment) with fewer resources (see .http://arxiv.org/abs/1603.02754)
See our blogpost for more details http://dmlc.ml/2016/03/14/xgboost4j-portable-distributed-xgboost-in-spark-flink-and-dataflow.html We would love to have you try it out and helo us to make it better. Cheers