[jira] [Comment Edited] (SPARK-12192) recommendProductsForUsers(num: Int) can run OK when the dataset is small,but run wrong when large amount of data

jinglei.chen (JIRA) Mon, 07 Dec 2015 19:34:11 -0800

    [ 
https://issues.apache.org/jira/browse/SPARK-12192?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=15046278#comment-15046278
 ]


jinglei.chen edited comment on SPARK-12192 at 12/8/15 3:33 AM:
---------------------------------------------------------------

recommendProductsForUsers(num: Int) can run OK when the dataset is small,but 
run wrong when large amount of data(about 1517459 users and 1190671 products, 
Every time an error, the error is User class threw exception: 
org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory 
hdfs://recomm-30-1.liepin.inc:9000/recommend/result/job/newb already exists)
my QQ:604586220


was (Author: cjldl7119):
recommendProductsForUsers(num: Int) can run OK when the dataset is small,but 
run wrong when large amount of data(about 1517459 users and 1190671 products, 
Every time an error, the error is User class threw exception: 
org.apache.hadoop.mapred.FileAlreadyExistsException: Output directory 
hdfs://recomm-30-1.liepin.inc:9000/recommend/result/job/newb already exists)

> recommendProductsForUsers(num: Int) can run OK when the dataset is small,but 
> run wrong when  large amount of data
> -----------------------------------------------------------------------------------------------------------------
>
>                 Key: SPARK-12192
>                 URL: https://issues.apache.org/jira/browse/SPARK-12192
>             Project: Spark
>          Issue Type: Bug
>          Components: MLlib
>    Affects Versions: 1.5.0
>            Reporter: jinglei.chen
>




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Comment Edited] (SPARK-12192) recommendProductsForUsers(num: Int) can run OK when the dataset is small,but run wrong when large amount of data

Reply via email to