Re: FP Growth Understanding

Robin Anil Mon, 15 Feb 2010 05:50:37 -0800

Ok.. A bit more background..

An Itemset is a subset I1, I2, I3... In


so [I2, I4, I7] is an itemset and the support(no of times its visible in the
dataset) is say Y

A Pattern is Pair<Itemset, support>

Take a look at in this format

68:
     ([68],90692),
     ([17, 68],90683),
     ([12, 68],90490),
     ([17, 12, 68],90481),
     ([18, 68],90291)

these are top patterns containing 68 and their support in descending order
68 occurs with 12,  90490 times

Robin


On Mon, Feb 15, 2010 at 6:27 PM, Grant Ingersoll <[email protected]>wrote:

>
> On Feb 14, 2010, at 11:37 PM, Robin Anil wrote:
>
> > Each key is a feature and each attribute is the topK frequent patterns
> where
> > the feature exist
>
> Still a bit confused.
> Given:
> Key: 68: Value: ([68],90692), ([17, 68],90683), ([12, 68],90490), ([17, 12,
> 68],90481), ([18, 68],90291), ([17, 18, 68],90282), ([12, 18, 68],90229),
> ([17, 12, 18, 68],90220), ([31, 68],89071), ([17, 31, 68],89062), ([12, 31,
> 68],88874), ([17, 12, 31, 68],88865), ([18, 31, 68],88681), ([17, 18, 31,
> 68],88672), ([12, 18, 31, 68],88619), ([17, 12, 18, 31, 68],88610), ([16,
> 68],87933),
>
> So, 68 is the feature in question.  That makes sense.  Then, what is the
> significance of the [] areas, as in [68],90692 or [17,12,68], 90481.  Why
> all the repetition?
>
> -Grant

Re: FP Growth Understanding

Reply via email to