[
https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12859865#action_12859865
]
Ted Dunning commented on MAHOUT-384:
------------------------------------
Outlier detection is (normally) unsupervised exploratory learning.
Occasionally it is used to generate a feature for supervised learning, much as
clustering algorithms can be used.
As such, I would group it as a clustering into "normal" and "outlier" clusters.
It won't evaluate the same way, but it definitely has the same workflow.
> Implement of AVF algorithm
> --------------------------
>
> Key: MAHOUT-384
> URL: https://issues.apache.org/jira/browse/MAHOUT-384
> Project: Mahout
> Issue Type: New Feature
> Components: Collaborative Filtering
> Reporter: tony cui
> Attachments: mahout-384.patch
>
>
> This program realize a outlier detection algorithm called avf, which is kind
> of
> Fast Parallel Outlier Detection for Categorical Datasets using Mapreduce and
> introduced by this paper :
> http://thepublicgrid.org/papers/koufakou_wcci_08.pdf
> Following is an example how to run this program under haodoop:
> $hadoop jar programName.jar avfDriver inputData interTempData outputData
> The output data contains ordered avfValue in the first column, followed by
> original input data.
--
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.