[ https://issues.apache.org/jira/browse/SPARK-12192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15046278#comment-15046278 ]
jinglei.chen edited comment on SPARK-12192 at 12/8/15 3:33 AM: --------------------------------------------------------------- recommendProductsForUsers(num: Int) can run OK when the dataset is small,but run wrong when large amount of data(about 1517459 users and 1190671 products, Every time an error, the error is User class threw exception: org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory hdfs://recomm-30-1.liepin.inc:9000/recommend/result/job/newb already exists) my QQ:604586220 was (Author: cjldl7119): recommendProductsForUsers(num: Int) can run OK when the dataset is small,but run wrong when large amount of data(about 1517459 users and 1190671 products, Every time an error, the error is User class threw exception: org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory hdfs://recomm-30-1.liepin.inc:9000/recommend/result/job/newb already exists) > recommendProductsForUsers(num: Int) can run OK when the dataset is small,but > run wrong when large amount of data > ----------------------------------------------------------------------------------------------------------------- > > Key: SPARK-12192 > URL: https://issues.apache.org/jira/browse/SPARK-12192 > Project: Spark > Issue Type: Bug > Components: MLlib > Affects Versions: 1.5.0 > Reporter: jinglei.chen > -- This message was sent by Atlassian JIRA (v6.3.4#6332) --------------------------------------------------------------------- To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org For additional commands, e-mail: issues-h...@spark.apache.org