Re: Mahout fpg missing patterns

2011-12-19 Thread Tom Pierce
One possible explanation is that Mahout's FPG avoids reporting patterns that are subsumed by others. For example, if you have pattern [a, b, c] with support 3, you clearly must also have [a, b], [b, c] and [a, c] with support = 3. Mahout will not report any of those unless the support is

Re: Mahout fpg missing patterns

2011-12-19 Thread gaurav singh
That seems to make sense. What do you mean by Mahout will not report any of those unless the support is strictly greater than 3. Is there a way for me to get all the patterns with support strictly greater then a particular value? Thanks Gaurav On Mon, Dec 19, 2011 at 4:58 PM, Tom Pierce

Re: Mahout fpg missing patterns

2011-12-19 Thread Tom Pierce
Maybe it's easiest to give an example. If you have input: a b c a b c d ac d a b c You should expect Mahout to output (say, for support 2): [a, b, c],3 [a, c, d],2 [a, c],4 You might also expect to see [a],4 or [a, b],3 but these are implied by the other patterns. Note that [a, b] and

Re: Mahout fpg missing patterns

2011-12-19 Thread gaurav singh
You were a real help Tom! Thanks Gaurav On Mon, Dec 19, 2011 at 5:33 PM, Tom Pierce t...@cloudera.com wrote: Maybe it's easiest to give an example. If you have input: a b c a b c d ac d a b c You should expect Mahout to output (say, for support 2): [a, b, c],3 [a, c, d],2 [a,