[ 
https://issues.apache.org/jira/browse/MAHOUT-172?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Isabel Drost updated MAHOUT-172:
--------------------------------

    Attachment: lda.patch

The patch extends the url pattern to not match everything in the output 
directory but only stuff that starts with part* - since the lda job seems to 
run fine for me.

> When running on a Hadoop cluster LDA fails with Caused by: 
> java.io.IOException: Cannot open filename /user/*/output/state-*/_logs
> ---------------------------------------------------------------------------------------------------------------------------------
>
>                 Key: MAHOUT-172
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-172
>             Project: Mahout
>          Issue Type: Bug
>          Components: Clustering
>    Affects Versions: 0.1
>            Reporter: Isabel Drost
>             Fix For: 0.2
>
>         Attachments: lda.patch
>
>
> I tried running the reuters example of lda on a hadoop cluster today. Seems 
> like the implementation tries to read all files in output/state-* which fails 
> if in that directory "_logs" is found.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to