[
https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12859731#action_12859731
]
Sean Owen commented on MAHOUT-384:
----------------------------------
What do others think of 'outlier' -- is this a concept on the level of
'clustering' or 'classification' or can we taxonomize it better.
You can use Hadoop 0.20.2 (I do) but I suggest for consistency with the code
and compatibility with AWS and to avoid bugs you not use the newer Hadoop APIs.
> Implement of AVF algorithm
> --------------------------
>
> Key: MAHOUT-384
> URL: https://issues.apache.org/jira/browse/MAHOUT-384
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Reporter: tony cui
> Attachments: mahout-384.patch
>
>
> This program realize a outlier detection algorithm called avf, which is kind
> of
> Fast Parallel Outlier Detection for Categorical Datasets using Mapreduce and
> introduced by this paper :
> http://thepublicgrid.org/papers/koufakou_wcci_08.pdf
> Following is an example how to run this program under haodoop:
> $hadoop jar programName.jar avfDriver inputData interTempData outputData
> The output data contains ordered avfValue in the first column, followed by
> original input data.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.