[ https://issues.apache.org/jira/browse/MAHOUT-384?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12859722#action_12859722 ]
tony cui commented on MAHOUT-384: --------------------------------- Thanks, Robin. I will check the suggestion list one by one as soon as possible. Thanks, Sean. I think oulier is a kind of data mining algorithm like classification or cluster, which can have a bunch of functions, AVF is just a simple one of them. That is why I created a "outlier" folder as the same level as classification. Another problem, which I think may be significant to me. Must I use hadoop 0.19.X? I have not use this version before. > Implement of AVF algorithm > -------------------------- > > Key: MAHOUT-384 > URL: https://issues.apache.org/jira/browse/MAHOUT-384 > Project: Mahout > Issue Type: New Feature > Components: Collaborative Filtering > Reporter: tony cui > Attachments: mahout-384.patch > > > This program realize a outlier detection algorithm called avf, which is kind > of > Fast Parallel Outlier Detection for Categorical Datasets using Mapreduce and > introduced by this paper : > http://thepublicgrid.org/papers/koufakou_wcci_08.pdf > Following is an example how to run this program under haodoop: > $hadoop jar programName.jar avfDriver inputData interTempData outputData > The output data contains ordered avfValue in the first column, followed by > original input data. -- This message is automatically generated by JIRA. - You can reply to this email to add a comment to the issue online.