Re: FPGrowth and Recommendations

2015-03-03 Thread Jeff Isenhart
I am actually modeling charges. The main use case is when we have charges a,b and c then we have charge d with a confidence, support, lift, etc...Then when I get an account with some charges, I look this up against the model to see if any charge may be missing (based on confidence)+ I was un

Re: FPGrowth and Recommendations

2015-03-02 Thread Pat Ferrel
Jeff, are you trying to build a general recommender? Or a shopping cart recommender? FP was used to find things often bought together, which means recommendations based on some partial group of items (watchlist, wishlist, shopping cart). FPG has been deprecated in favor of newer methods. There

Re: FPGrowth and Recommendations

2015-03-02 Thread Andrew Musselman
Hi Jeff, as I recall the map-reduce-based fp-growth solution was problematic, and it's been either deprecated or removed. There are better solutions under the "recommendations" tab at http://mahout.apache.org And I would encourage your updating your version of Mahout to 0.9 or to the master branc

FPGrowth and Recommendations

2015-03-02 Thread Jeff Isenhart
Hi, New to mahout and fp growth. I havefollowed this example:https://chimpler.wordpress.com/2013/05/02/finding-association-rules-with-mahout-frequent-pattern-mining/ I generated nice output informationlike this (as an example): [abc,def,ghi] => klm,confidence:0.597, support:0.01, lift: 57.415, con

RE: algorithms Apriori, FPgrowth

2014-11-25 Thread Martin, Nick
certainly be happy about it! Best, Nick From: Jakub Stransky [stransky...@gmail.com] Sent: Tuesday, November 25, 2014 8:31 AM To: user@mahout.apache.org Subject: algorithms Apriori, FPgrowth Hello experienced mahout users, I am new to mahout library

algorithms Apriori, FPgrowth

2014-11-25 Thread Jakub Stransky
Hello experienced mahout users, I am new to mahout library and I have a bit trouble to find a starting point for "associative rule mining" as I don't see neither Apriori not FPgrowth algorithm on the list of implemented algorithms. Contrary I found several blog posts with re

Re: Fpgrowth

2014-07-24 Thread Martin, Nick
Marthi" wrote: > fpgrowth was initially removed and added again for 0.9 because one specific > user stepped up to support it (and was never heard from again). Mahout 0.9 > should have fpgrowth IIRC. > > > On Thu, Jul 24, 2014 at 1:27 AM, Martin, Nick wrote: > >> So I kno

Re: Fpgrowth

2014-07-23 Thread Suneel Marthi
fpgrowth was initially removed and added again for 0.9 because one specific user stepped up to support it (and was never heard from again). Mahout 0.9 should have fpgrowth IIRC. On Thu, Jul 24, 2014 at 1:27 AM, Martin, Nick wrote: > So I know fpgrowth was sent out to pasture a few months

Fpgrowth

2014-07-23 Thread Martin, Nick
So I know fpgrowth was sent out to pasture a few months ago. As luck would have it I need to do this kind of thing now. Would my only option now be to pull the source (per Sebastian's note in the JIRA)? Could I roll back from 0.9 to a prev version to pick it back up? Any other options? I

mahout_structure and FPGrowth

2013-03-28 Thread Vikrant Saxena
Hello, I am new to mahout, I wanted information about the mahout project structure (all the directories info, what they contain, how i can use them) , basically I am interested in frequent item mining stuff. Besides this, I have executed the command #mahout fpg -i accidents.dat -o patterns -k 50

mahout_structure and FPGrowth

2013-03-28 Thread vsaxena
50 -method mapreduce -regex ´[\ ]´ -s 2 kindly tell me the what is the meaning of -k in this command, and how the output varies if i change he value of k. . thanks in advance -- View this message in context: http://lucene.472066.n3.nabble.com/mahout-structure-and-FPGrowth-tp4052016.html Sent

setting the number of reduce jobs for FPGrowth

2013-03-26 Thread ricky lee
Hi, I saw some similar questions in this mailing list but could not find a clear answer yet. With fairly large dataset (330G), the FPGrowth takes most of time in the parallel-fpgrowth Reduce tasks, can I set the number of Reduce jobs automatically? In my default Hadoop installation, the number of

Re: a bug of fpgrowth?

2012-08-22 Thread tom pierce
Hello, Could you try re-running FP-Growth with the '-2' flag, and let us know if you have more success? This uses an alternate implementation of the FPGrowth algorithm; I have had problems similar to what you are seeing when using the default implementation. I am skeptical of

a bug of fpgrowth?

2012-08-21 Thread 林泽桢
hello, when i use fpgrowth to get association rules, but it always come to wrong, so confused. Then i read the source code, i think i found a bug in line #102 of FrequentPatternMaxHeap.java, which " least.compareTo(frequentPattern) < 0 " should change to " least.compareTo(fr

AW: mahout FPGrowth problem

2012-05-30 Thread Ungerer, Jens
Hi, thank you for your response. I removed the multiple items and know I don't get an exception. >> Is it neccessary to use itemsets with equal length? >No - fixed size itemsets are not required. >> Is it possible to use itemsets with duplicates in mahout FPGrowth? >No

Re: mahout FPGrowth problem

2012-05-29 Thread tom pierce
Hi Jens, Is it neccessary to use itemsets with equal length? No - fixed size itemsets are not required. Is it possible to use itemsets with duplicates in mahout FPGrowth? Not reliably. This crash looks like it caused by having more items in one particular itemset than in the set of items

mahout FPGrowth problem

2012-05-29 Thread Ungerer, Jens
I am using mahout-distribution 0.6. My first test programm of mahout FPGrowth with a small data set worked well (example1.txt). In my second test programm I get this exception. "Exception in thread "main" java.lang.ArrayIndexOutOfBoundsExceptio

Re: Parallel FPGrowth driver - what is a good demo?

2011-07-31 Thread Lance Norskog
I've rewritten the FPGrowth wiki page. Is still a bit ragged. Please critique for content. https://cwiki.apache.org/confluence/display/MAHOUT/Parallel+Frequent+Pattern+Mining On Thu, Jul 28, 2011 at 12:59 AM, Lance Norskog wrote: > Ok, now I've succeeded in running fpgrowth, both s

Re: Parallel FPGrowth driver - doc problem?

2011-07-28 Thread Lance Norskog
launcher, so some of the > sections uses the mvn exec plugin directly. > > On Tue, Jul 26, 2011 at 10:11 PM, Lance Norskog wrote: > >> The FPGrowth driver page: >> >> https://cwiki.apache.org/confluence/display/MAHOUT/Parallel+Frequent+Pattern+Mining >> >&

Re: Parallel FPGrowth driver - what is a good demo?

2011-07-28 Thread Lance Norskog
Ok, now I've succeeded in running fpgrowth, both sequential and mapreduce, from the 'fpg' job and the flag that chooses 'sequential' from 'mapreduce'. I've done this from two different datasets, accidents.dat and retail.dat. I only ran the first thousand

Re: Parallel FPGrowth driver - what is a good demo?

2011-07-27 Thread Lance Norskog
lso, what is the magic to get maven to pass JDK options to an exec'd >> class? > > Did you try using the bin/mahout script. the memory size is configurable > inside it. > > >> FPGrowth sequential needs the memory size bumped up. > > >> Cheers, >> >> -- >> Lance Norskog >> goks...@gmail.com >> > -- Lance Norskog goks...@gmail.com

Re: Parallel FPGrowth driver - what is a good demo?

2011-07-27 Thread Robin Anil
ave 'accidents.dat.gz' from the given site, or 'retail.dat.gz' from > the same site, what parameters should find some frequent patterns? > Also, what is the magic to get maven to pass JDK options to an exec'd > class? Did you try using the bin/mahout script.

Re: Parallel FPGrowth driver - doc problem?

2011-07-27 Thread Robin Anil
Its outdated. This page predates the bin/mahout fpg launcher, so some of the sections uses the mvn exec plugin directly. On Tue, Jul 26, 2011 at 10:11 PM, Lance Norskog wrote: > The FPGrowth driver page: > > https://cwiki.apache.org/confluence/display/MAHOUT/Parallel+Frequent+Patte

Parallel FPGrowth driver - what is a good demo?

2011-07-26 Thread Lance Norskog
d find some frequent patterns? Also, what is the magic to get maven to pass JDK options to an exec'd class? FPGrowth sequential needs the memory size bumped up. Cheers, -- Lance Norskog goks...@gmail.com

Parallel FPGrowth driver - doc problem?

2011-07-26 Thread Lance Norskog
The FPGrowth driver page: https://cwiki.apache.org/confluence/display/MAHOUT/Parallel+Frequent+Pattern+Mining gives a command line that only works in mahout/core, rather than mahout/. Is this drift, or a document bug? -- Lance Norskog goks...@gmail.com

Re: how to get input in parallel FPGrowth

2011-05-24 Thread Stanley Xu
1. A job is killed is a normal behavior. Since by default, hadoop will enable the speculative executions, which means it will create two attempts for the same mapper and once one of the attempt is done, it will just kill the one is not finished. 2. There are lots of possibilities that a mapper tak

Re: how to get input in parallel FPGrowth

2011-05-05 Thread Stanley Xu
at 9:35 AM, hustnn wrote: > I see a topic of you about "the convert data in databases (Flatfiles, > XMLdumps, MySQL,Cassandra, Different formats on HDFS, Hbase) into > intermediate form(say vector)". > > I Know the parallel FPGrowth can use the hadoop to distribute compution

Re: Error: Java heap space when running FPGrowth

2010-08-05 Thread tanweiguo 00124421
597) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) -邮件原件- 发件人: Ankur C. Goel [mailto:gan...@yahoo-inc.com] 发送时间: 2010年8月4日 18:02 收件人: user@mahout.apache.org 主题: Re: Error: Java heap space when running FPGrowth Hi tanweiguo, Which version of hadoop are you us

Re: Error: Java heap space when running FPGrowth

2010-08-04 Thread Ankur C. Goel
single node setup, mapred.child.java.opts="-server -Xmx768m -Djava.net.preferIPv4Stack=true" Not sure if there is a way exposed to control the parallelism. Robin ? -...@nkur On 8/4/10 1:18 PM, "tanweiguo" wrote: I just followed the wiki to test FPGrowth: https://cwiki.apache

Error: Java heap space when running FPGrowth

2010-08-04 Thread tanweiguo
I just followed the wiki to test FPGrowth: https://cwiki.apache.org/MAHOUT/parallel-frequent-pattern-mining.html 1.unzip and put the accidents.dat.gz to HDFS accidents folder 2.run on a hadoop cluster(1 master and 3 slaves) hadoop jar mahout-examples-0.3.job

Error: Java heap space when running FPGrowth

2010-08-04 Thread tanweiguo
I just followed the wiki to test FPGrowth: https://cwiki.apache.org/MAHOUT/parallel-frequent-pattern-mining.html 1.unzip and put the accidents.dat.gz to HDFS accidents folder 2.run on a hadoop cluster(1 master and 3 slaves) hadoop jar mahout-examples-0.3.job