[jira] [Updated] (MAHOUT-980) Patch to make PFPGrowth run on Amazon MapReduce (also shows possible pattern to make other algorithms work in Amazon MapReduce)

2012-03-12 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-980?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-980: -- Resolution: Fixed Assignee: tom pierce Status: Resolved (was: Patch Available) > Pat

[jira] [Updated] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-12 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-822: -- Resolution: Fixed Assignee: tom pierce Status: Resolved (was: Patch Available) Thanks al

[jira] [Updated] (MAHOUT-987) Our build is unstable - this should reduce our style warnings by >200

2012-03-07 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-987: -- Status: Patch Available (was: Open) > Our build is unstable - this should reduce our style warning

[jira] [Updated] (MAHOUT-987) Our build is unstable - this should reduce our style warnings by >200

2012-03-07 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-987?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-987: -- Attachment: MAHOUT-987.patch > Our build is unstable - this should reduce our style warnings by >20

[jira] [Updated] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-03-02 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-822: -- Attachment: MAHOUT-822.patch Updated patch to cover a new test failure under Hadoop 0.23.1-SNAPSHOT due

[jira] [Updated] (MAHOUT-947) Improvements to seqdumper

2012-01-28 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-947: -- Attachment: MAHOUT-947.patch Dropped the cluster dumping addition to VectorDumper. > I

[jira] [Updated] (MAHOUT-946) Map-reduce job status often left unchecked

2012-01-27 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-946: -- Attachment: MAHOUT-946.patch I thought about this some more and realized in many cases you might want

[jira] [Updated] (MAHOUT-822) Mahout needs to be made compatible with Hadoop .23 releases

2012-01-26 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-822?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-822: -- Attachment: MAHOUT-822.patch I addressed the clustering test failures. Mostly that involved not assumi

[jira] [Updated] (MAHOUT-947) Improvements to seqdumper

2012-01-16 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-947: -- Attachment: MAHOUT-947-2.patch Adjusted to put vector options in VectorDumper. Also add ability to dum

[jira] [Updated] (MAHOUT-947) Improvements to seqdumper

2012-01-15 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-947: -- Attachment: MAHOUT-947.patch > Improvements to seqdumper > - > >

[jira] [Updated] (MAHOUT-947) Improvements to seqdumper

2012-01-15 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-947: -- Status: Patch Available (was: Open) > Improvements to seqdumper > - > >

[jira] [Updated] (MAHOUT-946) Map-reduce job status often left unchecked

2012-01-15 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-946: -- Attachment: MAHOUT-946.patch > Map-reduce job status often left unchecked > ---

[jira] [Updated] (MAHOUT-946) Map-reduce job status often left unchecked

2012-01-15 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-946: -- Affects Version/s: 0.6 Status: Patch Available (was: Open) > Map-reduce job status

[jira] [Updated] (MAHOUT-890) Performance issue in FPGrowth

2011-12-30 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-890: -- Attachment: MAHOUT-890-3.patch I decided to add a couple tests based on the synthetic data, and found t

[jira] [Updated] (MAHOUT-890) Performance issue in FPGrowth

2011-12-30 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-890: -- Attachment: MAHOUT-890-2.patch This patch (MAHOUT-890-2) adds the new implementation (under fpgrowth2)

[jira] [Updated] (MAHOUT-920) Remove a mapreduce job from parallel FPGrowth workflow

2011-12-27 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-920: -- Attachment: MAHOUT-920.patch This is the correct patch for this issue- please disregard previous MAHOUT

[jira] [Updated] (MAHOUT-927) FPG saves a mapping from from feature to mining group, when this can be calculated on the fly

2011-12-15 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-927: -- Status: Patch Available (was: Open) > FPG saves a mapping from from feature to mining group, when

[jira] [Updated] (MAHOUT-927) FPG saves a mapping from from feature to mining group, when this can be calculated on the fly

2011-12-15 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-927?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-927: -- Attachment: MAHOUT-927.patch This patch assumes MAHOUT-920 and MAHOUT-921 have already been applied.

[jira] [Updated] (MAHOUT-921) FPG uses a lot of boxed primitives - this patch eliminates a bunch of List

2011-12-11 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-921?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-921: -- Attachment: MAHOUT-921.patch Note patch assumes MAHOUT-920 has been applied! > FPG use

[jira] [Updated] (MAHOUT-920) Remove a mapreduce job from parallel FPGrowth workflow

2011-12-10 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-920: -- Status: Patch Available (was: Open) > Remove a mapreduce job from parallel FPGrowth workflow > ---

[jira] [Updated] (MAHOUT-920) Remove a mapreduce job from parallel FPGrowth workflow

2011-12-10 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-920?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-920: -- Attachment: MAHOUT-890.patch > Remove a mapreduce job from parallel FPGrowth workflow > ---

[jira] [Updated] (MAHOUT-890) Performance issue in FPGrowth

2011-12-03 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-890: -- Status: Patch Available (was: Open) > Performance issue in FPGrowth >

[jira] [Updated] (MAHOUT-890) Performance issue in FPGrowth

2011-12-03 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-890: -- Attachment: MAHOUT-890.patch > Performance issue in FPGrowth > - > >

[jira] [Updated] (MAHOUT-911) Naive Bayes trains models that are too large to apply

2011-12-02 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-911?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-911: -- Attachment: example.wiki.categories.txt > Naive Bayes trains models that are too large to apply > -

[jira] [Updated] (MAHOUT-895) Make Wikipedia example set maker easier to mod

2011-11-23 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-895: -- Status: Patch Available (was: Open) > Make Wikipedia example set maker easier to mod > ---

[jira] [Updated] (MAHOUT-895) Make Wikipedia example set maker easier to mod

2011-11-23 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-895: -- Attachment: MAHOUT-895.patch > Make Wikipedia example set maker easier to mod > ---

[jira] [Updated] (MAHOUT-894) NB testclassifier runs in sequential mode by default

2011-11-23 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-894: -- Status: Patch Available (was: Open) > NB testclassifier runs in sequential mode by default > -

[jira] [Updated] (MAHOUT-894) NB testclassifier runs in sequential mode by default

2011-11-23 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-894: -- Attachment: MAHOUT-894.patch > NB testclassifier runs in sequential mode by default > -

[jira] [Updated] (MAHOUT-890) Performance issue in FPGrowth

2011-11-22 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-890: -- Attachment: simpleFPG.patch > Performance issue in FPGrowth > - > >

[jira] [Updated] (MAHOUT-890) Performance issue in FPGrowth

2011-11-19 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-890: -- Attachment: logtrees.patch smallexample.dat addSynth.patch > Perfor

[jira] [Updated] (MAHOUT-886) FPtree nodes multiply-added (becoming siblings in tree)

2011-11-14 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-886: -- Attachment: MAHOUT-886.patch Keep nodes from getting multiply added (becoming own siblings). There's a

[jira] [Updated] (MAHOUT-886) FPtree nodes multiply-added (becoming siblings in tree)

2011-11-14 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-886: -- Status: Patch Available (was: Open) > FPtree nodes multiply-added (becoming siblings in tree) > --

[jira] [Updated] (MAHOUT-885) Freq pattern growth advertises wrong value for default

2011-11-14 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-885: -- Labels: patch (was: ) Status: Patch Available (was: Open) Changes the default to be 1000, generat

[jira] [Updated] (MAHOUT-885) Freq pattern growth advertises wrong value for default

2011-11-14 Thread tom pierce (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] tom pierce updated MAHOUT-885: -- Attachment: MAHOUT-885.patch > Freq pattern growth advertises wrong value for default > ---