[ 
https://issues.apache.org/jira/browse/MAHOUT-167?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12887624#action_12887624
 ] 

Drew Farris commented on MAHOUT-167:
------------------------------------

In WikipediaDatasetCreatorDriver in r962391 line 194, the wikipedia.categories 
property is added to the configuration after the job is created. Since the 
config is cloned at job creation time this property never makes it to the job 
members and thus breaks the code. 

Not sure if this anti-pattern is repeated elsewhere in the hadoop-0.20 
migration code, but it might be worth further investigation.



> Convert code to Hadoop 0.20 API
> -------------------------------
>
>                 Key: MAHOUT-167
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-167
>             Project: Mahout
>          Issue Type: Improvement
>          Components: Classification
>    Affects Versions: 0.3
>            Reporter: Jeff Eastman
>             Fix For: 0.4
>
>         Attachments: MAHOUT-167.patch, MAHOUT-167.patch, MAHOUT-167e.patch, 
> MAHOUT-167e.patch, MAHOUT-167u.patch
>
>
> We need to update the various implementations to remove the deprecated Hadoop 
> API calls.

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to