Sandy Moens created MAHOUT-1355:
-----------------------------------

             Summary: Frequent Pattern Mining algorithms for Mahout
                 Key: MAHOUT-1355
                 URL: https://issues.apache.org/jira/browse/MAHOUT-1355
             Project: Mahout
          Issue Type: New Feature
          Components: Frequent Itemset/Association Rule Mining
    Affects Versions: 0.9
            Reporter: Sandy Moens
            Priority: Minor
             Fix For: 1.0


We implemented frequent pattern mining algorithms for Hadoop and adapted them 
to Mahout. We used "PFP" (now deprecated) as a benchmark and these 
implementations perform better in terms of speed and memory footprint. The 
details of the implementations can be found in the paper Frequent Pattern 
Mining for BigData ( http://adrem.ua.ac.be/bigfim )

We have been maintaining the project for a while in GitLab ( 
https://gitlab.com/adrem/bigfim ). Documentation for adaptation ( 
Readme-Mahout.md ) and usage in mahout ( Mahout-wiki.md ) can be found there.

We are open to any modification and/or improvement requests to make it more 
worthwhile for the Mahout project. We, as the research group, volunteer to 
maintain FPM algorithms as well.



--
This message was sent by Atlassian JIRA
(v6.1#6144)

Reply via email to