Each key is a feature and each attribute is the topK frequent patterns where the feature exist
>From here one can use this information to show pattern recommendation(query recommendation as in the original pfpgrowth paper) or one can write a m/r job to count the support and confidence and create association rules(yet to be done) that will be like f1, f2, f3, f4 => f5 (support, confidence) http://publib.boulder.ibm.com/infocenter/db2luw/v8/index.jsp?topic=/com.ibm.im.model.doc/c_confidence_in_an_association_rule.html Robin On Mon, Feb 15, 2010 at 4:53 AM, Grant Ingersoll <[email protected]>wrote: > I ran: > > ./mahout fpg -i <PATH>/content/freqitemset/accidents.dat -o patterns -k 50 > -method mapreduce -g 10 -regex [\ ] > > Per > http://cwiki.apache.org/confluence/display/MAHOUT/ParallelFrequentPatternMining > > And now I see > > ls patterns/ > fpgrowth/ frequentPatterns/ parallelcounting/ > sortedoutput/ > > Looking in: ./mahout seqdump --seqFile patterns/fpgrowth/part-r-00000 > > I see: > Input Path: patterns/fpgrowth/part-r-00000 > Key class: class org.apache.hadoop.io.Text Value Class: class > org.apache.mahout.fpm.pfpgrowth.convertors.string.TopKStringPatterns > Key: 68: Value: ([68],90692), ([17, 68],90683), ([12, 68],90490), ([17, 12, > 68],90481), ([18, 68],90291), ([17, 18, 68],90282), ([12, 18, 68],90229), > ([17, 12, 18, 68],90220), ([31, 68],89071), ([17, 31, 68],89062), ([12, 31, > 68],88874), ([17, 12, 31, 68],88865), ([18, 31, 68],88681), ([17, 18, 31, > 68],88672), ([12, 18, 31, 68],88619), ([17, 12, 18, 31, 68],88610), ([16, > 68],87933), ([17, 16, 68],87924), ([12, 16, 68],87847), ([17, 12, 16, > 68],87838), ([18, 16, 68],87644), ([17, 18, 16, 68],87635), ([12, 18, 16, > 68],87589), ([17, 12, 18, 16, 68],87580), ([16, 31, 68],86362), ([17, 16, > 31, 68],86353), ([12, 16, 31, 68],86279), ([17, 12, 16, 31, 68],86270), > ([18, 16, 31, 68],86082), ([17, 18, 16, 31, 68],86073), ([12, 18, 16, 31, > 68],86027), ([17, 12, 18, 16, 31, 68],86018), ([31, 21, 68],85090), ([17, > 31, 21, 68],85081), ([12, 31, 21, 68],84903), ([17, 12, 31, 21, 68],84894), > ([17, 12, 18, 31, 21, 68],84653), ([16, 21, 68],83908), ([12, 16, 21, > 68],83829), ([18, 16, 21, 68],83639), ([17, 18, 16, 21, 68],83630), ([12, > 18, 16, 21, 68],83587), ([17, 12, 18, 16, 21, 68],83578), ([16, 31, 21, > 68],82495), ([17, 16, 31, 21, 68],82486), ([12, 16, 31, 21, 68],82418), > ([17, 12, 16, 31, 21, 68],82409), ([18, 16, 31, 21, 68],82232), ([17, 18, > 16, 31, 21, 68],82223), ([12, 18, 16, 31, 21, 68],82180) > Key: 335: Value: ([335],90909), ([17, 335],90903), ([12, 335],90869), ([17, > 12, 335],90863), ([18, 335],90754), ([17, 18, 335],90748), ([12, 18, > 335],90718), ([17, 12, 18, 335],90712), ([16, 335],89080), ([17, 16, > 335],89074), ([12, 16, 335],89049), ([17, 12, 16, 335],89043), ([18, 16, > 335],88932), ([17, 18, 16, 335],88926), ([12, 18, 16, 335],88901), ([17, 12, > 18, 16, 335],88895), ([31, 335],84776), ([17, 31, 335],84771), ([12, 31, > 335],84744), ([17, 12, 31, 335],84739), ([18, 31, 335],84647), ([17, 18, 31, > 335],84642), ([12, 18, 31, 335],84618), ([17, 12, 18, 31, 335],84613), ([16, > 31, 335],83373), ([17, 16, 31, 335],83368), ([12, 16, 31, 335],83348), ([17, > 12, 16, 31, 335],83343), ([18, 16, 31, 335],83249), ([17, 18, 16, 31, > 335],83244), ([12, 18, 16, 31, 335],83224), ([17, 12, 18, 16, 31, > 335],83219), ([17, 18, 16, 21, 335],78117), ([12, 18, 16, 21, 335],78093), > ([17, 12, 18, 16, 21, 335],78087), ([31, 21, 335],74945), ([17, 31, 21, > 335],74940), ([12, 31, 21, 335],74915), ([17, 12, 31, 21, 335],74910), ([18, > 31, 21, 335],74828), ([17, 18, 31, 21, 335],74823), ([12, 18, 31, 21, > 335],74800), ([17, 12, 18, 31, 21, 335],74795), ([16, 31, 21, 335],73641), > ([17, 16, 31, 21, 335],73636), ([12, 16, 31, 21, 335],73617), ([17, 12, 16, > 31, 21, 335],73612), ([18, 16, 31, 21, 335],73528), ([17, 18, 16, 31, 21, > 335],73523), ([12, 18, 16, 31, 21, 335],73504) > Key: 64: Value: ([64],95673), ([17, 64],95662), ([12, 64],95501), ([17, 12, > 64],95490), ([18, 64],95407), ([17, 18, 64],95396), ([12, 18, 64],95352), > ([17, 12, 18, 64],95341), ([16, 64],94511), ([17, 16, 64],94500), ([12, 16, > 64],94439), ([17, 12, 16, 64],94428), ([18, 16, 64],94343), ([17, 18, 16, > 64],94332), ([12, 18, 16, 64],94290), ([17, 12, 18, 16, 64],94279), ([31, > 64],91275), ([17, 31, 64],91265), ([12, 31, 64],91124), ([17, 12, 31, > 64],91114), ([18, 31, 64],91030), ([17, 18, 31, 64],91020), ([12, 18, 31, > 64],90987), ([17, 12, 18, 31, 64],90977), ([16, 31, 64],90304), ([17, 16, > 31, 64],90294), ([12, 16, 31, 64],90246), ([17, 12, 16, 31, 64],90236), > ([18, 16, 31, 64],90150), ([17, 18, 16, 31, 64],90140), ([12, 18, 16, 31, > 64],90109), ([17, 12, 18, 16, 31, 64],90099), ([17, 18, 16, 21, 64],82484), > ([12, 18, 16, 21, 64],82445), ([17, 12, 18, 16, 21, 64],82435), ([31, 21, > 64],80204), ([17, 31, 21, 64],80195), ([12, 31, 21, 64],80072), ([17, 12, > 31, 21, 64],80063), ([18, 31, 21, 64],79989), ([17, 18, 31, 21, 64],79980), > ([12, 18, 31, 21, 64],79949), ([17, 12, 18, 31, 21, 64],79940), ([16, 31, > 21, 64],79344), ([17, 16, 31, 21, 64],79335), ([12, 16, 31, 21, 64],79291), > ([17, 12, 16, 31, 21, 64],79282), ([18, 16, 31, 21, 64],79206), ([17, 18, > 16, 31, 21, 64],79197), ([12, 18, 16, 31, 21, 64],79168) > Key: 5: Value: ([5],96818), ([17, 5],96815), ([12, 5],96711), ([17, 12, > 5],96708), ([18, 5],96613), ([17, 18, 5],96610), ([12, 18, 5],96582), ([17, > 12, 18, 5],96579), ([16, 5],95797), ([17, 16, 5],95794), ([12, 16, > 5],95752), ([17, 12, 16, 5],95749), ([18, 16, 5],95655), ([17, 18, 16, > 5],95652), ([12, 18, 16, 5],95625), ([17, 12, 18, 16, 5],95622), ([31, > 5],94517), ([17, 31, 5],94514), ([12, 31, 5],94415), ([17, 12, 31, > 5],94412), ([18, 31, 5],94320), ([17, 18, 31, 5],94317), ([12, 18, 31, > 5],94292), ([17, 12, 18, 31, 5],94289), ([16, 31, 5],93587), ([17, 16, 31, > 5],93584), ([12, 16, 31, 5],93544), ([17, 12, 16, 31, 5],93541), ([18, 16, > 31, 5],93451), ([17, 18, 16, 31, 5],93448), ([12, 18, 16, 31, 5],93423), > ([17, 12, 18, 16, 31, 5],93420), ([17, 18, 16, 21, 5],90130), ([12, 18, 16, > 21, 5],90104), ([17, 12, 18, 16, 21, 5],90101), ([31, 21, 5],89273), ([17, > 31, 21, 5],89270), ([12, 31, 21, 5],89179), ([17, 12, 31, 21, 5],89176), > ([18, 31, 21, 5],89089), ([17, 18, 31, 21, 5],89086), ([12, 18, 31, 21, > 5],89062), ([17, 12, 18, 31, 21, 5],89059), ([16, 31, 21, 5],88402), ([17, > 16, 31, 21, 5],88399), ([12, 16, 31, 21, 5],88360), ([17, 12, 16, 31, 21, > 5],88357), ([18, 16, 31, 21, 5],88272), ([17, 18, 16, 31, 21, 5],88269), > ([12, 18, 16, 31, 21, 5],88245) > > What's the interpretation or this output? Is this the right place to look? > What about the other directories? > > -Grant
