I'm ready too, only thing I want to do is ensure that MAHOUT-493 works on ElasticMapReduce but that should be done until next week.
--sebastian Am 21.09.2010 17:08, schrieb Sean Owen: > Agree, I'm ready. It's time to simply find a sensible point to draw a > line and put out 0.4. Anything else is simply moved along to 0.5. > > Can I suggest we leave until Friday to update all issues? Anything > left as 0.4 means you expect a patch is imminent next week to close > the issue. Anything else, simply mark as 0.5. Here's what's open: > > > > T Key Summary Pr Status Updated Created > MAHOUT-227 Parallel SVM Open 10/Feb/10 > 20/Dec/09 > MAHOUT-279 Make RandomSeedGenerator a M/R Job Open > 14/Feb/10 07/Feb/10 > MAHOUT-293 Add more tunable parameters to PFPGrowth implementation > > Open 15/Feb/10 15/Feb/10 > MAHOUT-303 Exhaustive Tests for Vector implementations > Open > 20/Feb/10 20/Feb/10 > MAHOUT-306 Profile and improve performance of algorithms based on > vectors Open 22/Feb/10 22/Feb/10 > MAHOUT-309 Implement Stochastic Decomposition Open > 24/Feb/10 24/Feb/10 > MAHOUT-319 SVD solvers should be gracefully stoppable/restartable > > Open 01/Mar/10 01/Mar/10 > MAHOUT-369 Issues with DistributedLanczosSolver output > Open > 25/Apr/10 07/Apr/10 > MAHOUT-397 SparseVectorsFromSequenceFiles only outputs a single > vector file Patch Available 19/May/10 19/May/10 > MAHOUT-153 Implement kmeans++ for initial cluster selection in > kmeans Open 27/May/10 27/Jul/09 > MAHOUT-376 Implement Map-reduce version of stochastic SVD > Open > 08/May/10 11/Apr/10 > MAHOUT-414 Usability: Mahout applications need a consistent API to > allow users to specify desired map/reduce concurrency Open > 13/Jun/10 13/Jun/10 > MAHOUT-419 Convert decomposer code to Hadoop 0.20 API > Open > 20/Jun/10 20/Jun/10 > MAHOUT-308 Improve Lanczos to handle extremely large feature sets > (without hashing) Patch Available 30/Jun/10 > 24/Feb/10 > MAHOUT-401 Use NamedVector in seq2sparse Reopened > 02/Jul/10 27/May/10 > MAHOUT-232 Implementation of sequential SVM solver based on > Pegasos Patch Available 06/Jul/10 27/Dec/09 > MAHOUT-287 Bayes Classifier should use Vector as input > Open > 10/Feb/10 10/Feb/10 > MAHOUT-167 Convert code to Hadoop 0.20 API Open > 24/Jul/10 28/Aug/09 > MAHOUT-458 The LDA output does not include the topic-probability > distribution per document (p(z|d)). It outputs only the topics and > corresponding words. Open 09/Aug/10 06/Aug/10 > MAHOUT-344 Minhash based clustering Patch > Available 10/Aug/10 22/Mar/10 > MAHOUT-334 Proposal for GSoC2010 (Linear SVM for Mahout) > Patch > Available 14/Aug/10 12/Mar/10 > MAHOUT-467 Change Iterable<Cooccurrence> in > org.apache.mahout.math.hadoop.similarity.RowSimilarityJob.SimilarityReducer > to list or array to improve the performance Open 18/Aug/10 > 12/Aug/10 > MAHOUT-483 Job RowSimilarityJob-Mapper-EntriesToVectorsReducer > improvement Open 18/Aug/10 18/Aug/10 > MAHOUT-495 Undeprecate Normal and Exponential distributions > Open > 04/Sep/10 31/Aug/10 > MAHOUT-294 Uniform API behavior for Jobs Open > 14/Sep/10 16/Feb/10 > MAHOUT-214 Implement Stacked RBM Open 08/Feb/10 > 08/Dec/09 > MAHOUT-155 ARFF VectorIterable Open 07/Feb/10 > 01/Aug/09 > MAHOUT-274 Use avro for serialization of structured documents. > > Open 30/Mar/10 05/Feb/10 > MAHOUT-379 SequentialAccessSparseVector.equals does not agree with > AbstractVector.equivalent Reopened 01/Aug/10 > 14/Apr/10 > MAHOUT-459 Reading an Index from Lucene/Solr 4.0-dev > Open > 06/Aug/10 06/Aug/10 > MAHOUT-471 RowSimilarityJob-Mapper-EntriesToVectorsReducer failure > > Open 17/Aug/10 12/Aug/10 > MAHOUT-480 Replace manual precondition checking with Precondition > utility class from Guava Open 18/Aug/10 13/Aug/10 > MAHOUT-396 Proposal for Implementing Hidden Markov Model > Patch > Available 17/Sep/10 16/May/10 > MAHOUT-271 Make WikipediaDatasetCreatorMapper fuzzy category match > respect word boundaries Open 08/Feb/10 28/Jan/10 > > > > On Tue, Sep 21, 2010 at 3:50 PM, Jeff Eastman > <[email protected]> wrote: > >> We've been thinking of a September-October timeframe for the 0.4 release >> but I still see some major work items in the 34 Jira issues targeted at this >> release. As an Agile practitioner it seems to me we need to push most of >> these into 0.5 if we are going to hold this schedule. How about we triage >> this list again and shoot for feature freeze at the end of the month? >> >>
