Dear all

I'm trying to run Flink ALS against Yahoo-R2 data set[1] on HDFS. The
program is running without showing any errors, but it does not finish. The
operators running indefinitely are:

CoGroup (CoGroup at
org.apache.flink.ml.recommendation.ALS$.updateFactors(ALS.scala:606))(11/240)

Join(Join at
org.apache.flink.ml.recommendation.ALS$.updateFactors(ALS.scala:576))(15/240)


I was using the below parameters to run:

val als = ALS().setIterations(10).setNumFactors(10).setBlocks(100)

And I didn't set the hdfs temporary path. Can someone tell me the
parameters to set to run ALS on such large data sets? Why are these
operators running indefinitely?

[1] https://webscope.sandbox.yahoo.com/catalog.php?datatype=r

Best
Ziyad

Reply via email to