[ 
https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12859865#action_12859865
 ] 

Ted Dunning commented on MAHOUT-384:
------------------------------------


Outlier detection is (normally) unsupervised exploratory learning.  
Occasionally it is used to generate a feature for supervised learning, much as 
clustering algorithms can be used.

As such, I would group it as a clustering into "normal" and "outlier" clusters. 
 It won't evaluate the same way, but it definitely has the same workflow.

  

> Implement of AVF algorithm
> --------------------------
>
>                 Key: MAHOUT-384
>                 URL: https://issues.apache.org/jira/browse/MAHOUT-384
>             Project: Mahout
>          Issue Type: New Feature
>          Components: Collaborative Filtering
>            Reporter: tony cui
>         Attachments: mahout-384.patch
>
>
> This program realize a outlier detection algorithm called avf, which is kind 
> of 
> Fast Parallel Outlier Detection for Categorical Datasets using Mapreduce and 
> introduced by this paper : 
>     http://thepublicgrid.org/papers/koufakou_wcci_08.pdf
> Following is an example how to run this program under haodoop:
> $hadoop jar programName.jar avfDriver inputData interTempData outputData
> The output data contains ordered avfValue in the first column, followed by 
> original input data. 

-- 
This message is automatically generated by JIRA.
-
You can reply to this email to add a comment to the issue online.

Reply via email to