Actively working on this and should be ready for 0.6: Baum-Welch Algorithm on Map-Reduce for Parallel Hidden Markov Model Training. <https://issues.apache.org/jira/browse/MAHOUT-627>
On Fri, Jun 3, 2011 at 5:20 AM, Sean Owen <[email protected]> wrote: > 0.5 is out the door, and we already have 40 issues tagged for 0.6 -- quite > a > bit of work to do! I see Sebastian and I are already cracking through > several of them. > Committer activity is still not quite keeping up with the backlog. It's an > important time, and a time for those who can move issues to resolution or > close them to do so. Stay involved, stay relevant, especially those of you > who have been quiet. Because we need more "bandwidth" to begin dealing with > the meta-issues: docs, packaging, roadmap, etc. > > Here is the current list, to inspire you: > > > KeySummaryStatusCreated <https://issues.apache.org/jira/browse/MAHOUT-609> > MAHOUT-609 <https://issues.apache.org/jira/browse/MAHOUT-609> > > Add an option to make RecommenderJob write out it's computed item > similarities <https://issues.apache.org/jira/browse/MAHOUT-609> > [image: Open] Open08/Feb/11< > https://issues.apache.org/jira/browse/MAHOUT-627> > MAHOUT-627 <https://issues.apache.org/jira/browse/MAHOUT-627> > > Baum-Welch Algorithm on Map-Reduce for Parallel Hidden Markov Model > Training. <https://issues.apache.org/jira/browse/MAHOUT-627> > [image: Open] Open17/Mar/11< > https://issues.apache.org/jira/browse/MAHOUT-537> > MAHOUT-537 <https://issues.apache.org/jira/browse/MAHOUT-537> > > Bring DistributedRowMatrix into compliance with Hadoop > 0.20.2<https://issues.apache.org/jira/browse/MAHOUT-537> > [image: Open] Open03/Nov/10< > https://issues.apache.org/jira/browse/MAHOUT-546> > MAHOUT-546 <https://issues.apache.org/jira/browse/MAHOUT-546> > > Bug creating vector from Solr index with > TrieFields<https://issues.apache.org/jira/browse/MAHOUT-546> > [image: Open] Open16/Nov/10< > https://issues.apache.org/jira/browse/MAHOUT-714> > MAHOUT-714 <https://issues.apache.org/jira/browse/MAHOUT-714> > > CollocDriver not runnable with ToolRunner due to private > Constructor<https://issues.apache.org/jira/browse/MAHOUT-714> > [image: Patch Available] Patch > Available29/May/11<https://issues.apache.org/jira/browse/MAHOUT-696> > MAHOUT-696 <https://issues.apache.org/jira/browse/MAHOUT-696> > > Command line program for > AdaptiveLogiscticRegression< > https://issues.apache.org/jira/browse/MAHOUT-696> > [image: Open] Open15/May/11< > https://issues.apache.org/jira/browse/MAHOUT-712> > MAHOUT-712 <https://issues.apache.org/jira/browse/MAHOUT-712> > > DisplaySpectralKMeans Example Surfaces FileNotFoundException in > DistributedRowMatrix.times() > Usage/Implementation<https://issues.apache.org/jira/browse/MAHOUT-712> > [image: Open] Open25/May/11< > https://issues.apache.org/jira/browse/MAHOUT-524> > MAHOUT-524 <https://issues.apache.org/jira/browse/MAHOUT-524> > > DisplaySpectralKMeans example > fails<https://issues.apache.org/jira/browse/MAHOUT-524> > [image: Open] Open12/Oct/10< > https://issues.apache.org/jira/browse/MAHOUT-598> > MAHOUT-598 <https://issues.apache.org/jira/browse/MAHOUT-598> > > Downstream steps in the seq2sparse job flow looking in wrong location for > output from previous steps when running in Elastic MapReduce (EMR) > cluster<https://issues.apache.org/jira/browse/MAHOUT-598> > [image: Open] Open27/Jan/11< > https://issues.apache.org/jira/browse/MAHOUT-629> > MAHOUT-629 <https://issues.apache.org/jira/browse/MAHOUT-629> > > FP Growth performance > improvement<https://issues.apache.org/jira/browse/MAHOUT-629> > [image: Open] Open21/Mar/11< > https://issues.apache.org/jira/browse/MAHOUT-709> > MAHOUT-709 <https://issues.apache.org/jira/browse/MAHOUT-709> > > FP-Growth Redundant patterns< > https://issues.apache.org/jira/browse/MAHOUT-709> > [image: Open] Open22/May/11< > https://issues.apache.org/jira/browse/MAHOUT-688> > MAHOUT-688 <https://issues.apache.org/jira/browse/MAHOUT-688> > > High Document Frequency pruning for > seq2sparse<https://issues.apache.org/jira/browse/MAHOUT-688> > [image: Open] Open05/May/11< > https://issues.apache.org/jira/browse/MAHOUT-716> > MAHOUT-716 <https://issues.apache.org/jira/browse/MAHOUT-716> > > Implement Boosting <https://issues.apache.org/jira/browse/MAHOUT-716> > [image: Patch Available] Patch > Available01/Jun/11<https://issues.apache.org/jira/browse/MAHOUT-703> > MAHOUT-703 <https://issues.apache.org/jira/browse/MAHOUT-703> > > Implement Gradient machine< > https://issues.apache.org/jira/browse/MAHOUT-703> > [image: Patch Available] Patch > Available19/May/11<https://issues.apache.org/jira/browse/MAHOUT-499> > MAHOUT-499 <https://issues.apache.org/jira/browse/MAHOUT-499> > > Implement LSMR in-memory <https://issues.apache.org/jira/browse/MAHOUT-499 > > > [image: Open] Open09/Sep/10< > https://issues.apache.org/jira/browse/MAHOUT-525> > MAHOUT-525 <https://issues.apache.org/jira/browse/MAHOUT-525> > > Implement LatentFactorLogLinear > models<https://issues.apache.org/jira/browse/MAHOUT-525> > [image: Open] Open14/Oct/10< > https://issues.apache.org/jira/browse/MAHOUT-702> > MAHOUT-702 <https://issues.apache.org/jira/browse/MAHOUT-702> > > Implement Online Passive Aggressive > learner<https://issues.apache.org/jira/browse/MAHOUT-702> > [image: Patch Available] Patch > Available18/May/11<https://issues.apache.org/jira/browse/MAHOUT-384> > MAHOUT-384 <https://issues.apache.org/jira/browse/MAHOUT-384> > > Implement of AVF algorithm< > https://issues.apache.org/jira/browse/MAHOUT-384> > [image: Open] Open22/Apr/10< > https://issues.apache.org/jira/browse/MAHOUT-672> > MAHOUT-672 <https://issues.apache.org/jira/browse/MAHOUT-672> > > Implementation of Conjugate Gradient for solving large linear > systems<https://issues.apache.org/jira/browse/MAHOUT-672> > [image: Patch Available] Patch > Available16/Apr/11<https://issues.apache.org/jira/browse/MAHOUT-487> > MAHOUT-487 <https://issues.apache.org/jira/browse/MAHOUT-487> > > Issues with memory use and inconsistent or state-influenced results when > using CBayesAlgorithm <https://issues.apache.org/jira/browse/MAHOUT-487> > [image: Open] Open24/Aug/10< > https://issues.apache.org/jira/browse/MAHOUT-597> > MAHOUT-597 <https://issues.apache.org/jira/browse/MAHOUT-597> > > Kernels in Mean Shift <https://issues.apache.org/jira/browse/MAHOUT-597> > [image: Open] Open27/Jan/11< > https://issues.apache.org/jira/browse/MAHOUT-399> > MAHOUT-399 <https://issues.apache.org/jira/browse/MAHOUT-399> > > LDA on Mahout 0.3 does not converge to correct solution for overlapping > pyramids toy problem. <https://issues.apache.org/jira/browse/MAHOUT-399> > [image: Open] Open24/May/10< > https://issues.apache.org/jira/browse/MAHOUT-690> > MAHOUT-690 <https://issues.apache.org/jira/browse/MAHOUT-690> > > LanczosSolver tests take forever. No > fun.<https://issues.apache.org/jira/browse/MAHOUT-690> > [image: Open] Open06/May/11< > https://issues.apache.org/jira/browse/MAHOUT-415> > MAHOUT-415 <https://issues.apache.org/jira/browse/MAHOUT-415> > > Lucene filter for Collocations< > https://issues.apache.org/jira/browse/MAHOUT-415> > [image: Open] Open14/Jun/10< > https://issues.apache.org/jira/browse/MAHOUT-705> > MAHOUT-705 <https://issues.apache.org/jira/browse/MAHOUT-705> > > MongoDB DataModel support < > https://issues.apache.org/jira/browse/MAHOUT-705> > [image: Open] Open20/May/11< > https://issues.apache.org/jira/browse/MAHOUT-678> > MAHOUT-678 <https://issues.apache.org/jira/browse/MAHOUT-678> > > NullPointerException while using MixedGradient with SGD > algorithm<https://issues.apache.org/jira/browse/MAHOUT-678> > [image: Open] Open22/Apr/11< > https://issues.apache.org/jira/browse/MAHOUT-692> > MAHOUT-692 <https://issues.apache.org/jira/browse/MAHOUT-692> > > OnlineSummarizer does not tolerate fewer than 100 > samples<https://issues.apache.org/jira/browse/MAHOUT-692> > [image: Open] Open10/May/11< > https://issues.apache.org/jira/browse/MAHOUT-695> > MAHOUT-695 <https://issues.apache.org/jira/browse/MAHOUT-695> > > Option to determine number of words for LDADriver from a specified > dictionary <https://issues.apache.org/jira/browse/MAHOUT-695> > [image: Open] Open13/May/11< > https://issues.apache.org/jira/browse/MAHOUT-632> > MAHOUT-632 <https://issues.apache.org/jira/browse/MAHOUT-632> > > PFPGrowth : Exceeded max jobconf > size<https://issues.apache.org/jira/browse/MAHOUT-632> > [image: Patch Available] Patch > Available22/Mar/11<https://issues.apache.org/jira/browse/MAHOUT-663> > MAHOUT-663 <https://issues.apache.org/jira/browse/MAHOUT-663> > > Rationalize hadoop job creation with respect to > setJarByClass<https://issues.apache.org/jira/browse/MAHOUT-663> > [image: Open] Open08/Apr/11< > https://issues.apache.org/jira/browse/MAHOUT-664> > MAHOUT-664 <https://issues.apache.org/jira/browse/MAHOUT-664> > > Remove usage of XStream string serialization > too?<https://issues.apache.org/jira/browse/MAHOUT-664> > [image: Open] Open10/Apr/11< > https://issues.apache.org/jira/browse/MAHOUT-719> > MAHOUT-719 <https://issues.apache.org/jira/browse/MAHOUT-719> > > Rename current runLogistic command line program to validateLogistic and let > runLogistic do predicting against new production > data<https://issues.apache.org/jira/browse/MAHOUT-719> > [image: Open] Open02/Jun/11< > https://issues.apache.org/jira/browse/MAHOUT-699> > MAHOUT-699 <https://issues.apache.org/jira/browse/MAHOUT-699> > > Rename taste-webapp module to integration; move integration code there from > examples <https://issues.apache.org/jira/browse/MAHOUT-699> > [image: Open] Open18/May/11< > https://issues.apache.org/jira/browse/MAHOUT-707> > MAHOUT-707 <https://issues.apache.org/jira/browse/MAHOUT-707> > > Setup Jenkins Jobs to validate our Examples/bin > Scripts<https://issues.apache.org/jira/browse/MAHOUT-707> > [image: Open] Open20/May/11< > https://issues.apache.org/jira/browse/MAHOUT-626> > MAHOUT-626 <https://issues.apache.org/jira/browse/MAHOUT-626> > > T1 and T2 Values in Canopy (& > MeanShift)<https://issues.apache.org/jira/browse/MAHOUT-626> > [image: Reopened] > Reopened13/Mar/11<https://issues.apache.org/jira/browse/MAHOUT-596> > MAHOUT-596 <https://issues.apache.org/jira/browse/MAHOUT-596> > > Testing if the weight assigned to points when calling the observe method in > AbstractCluster incorrectly affect the number of points in a > cluster<https://issues.apache.org/jira/browse/MAHOUT-596> > [image: Open] Open27/Jan/11< > https://issues.apache.org/jira/browse/MAHOUT-294> > MAHOUT-294 <https://issues.apache.org/jira/browse/MAHOUT-294> > > Uniform API behavior for Jobs< > https://issues.apache.org/jira/browse/MAHOUT-294> > [image: Open] Open16/Feb/10< > https://issues.apache.org/jira/browse/MAHOUT-652> > MAHOUT-652 <https://issues.apache.org/jira/browse/MAHOUT-652> > > [GSoC Proposal] Parallel Viterbi algorithm for > HMM<https://issues.apache.org/jira/browse/MAHOUT-652> > [image: Open] Open06/Apr/11< > https://issues.apache.org/jira/browse/MAHOUT-711> > MAHOUT-711 <https://issues.apache.org/jira/browse/MAHOUT-711> > > outputs miss some right frequent > itemsets<https://issues.apache.org/jira/browse/MAHOUT-711> > [image: Open] Open25/May/11< > https://issues.apache.org/jira/browse/MAHOUT-706> > MAHOUT-706 <https://issues.apache.org/jira/browse/MAHOUT-706> > > reuse lucene tokenstreams < > https://issues.apache.org/jira/browse/MAHOUT-706> > [image: Open] Open20/May/11 >
