Agree, I'm ready. It's time to simply find a sensible point to draw a
line and put out 0.4. Anything else is simply moved along to 0.5.
Can I suggest we leave until Friday to update all issues? Anything
left as 0.4 means you expect a patch is imminent next week to close
the issue. Anything else, simply mark as 0.5. Here's what's open:
T Key Summary Pr Status Updated Created
MAHOUT-227 Parallel SVM Open 10/Feb/10
20/Dec/09
MAHOUT-279 Make RandomSeedGenerator a M/R Job Open
14/Feb/10 07/Feb/10
MAHOUT-293 Add more tunable parameters to PFPGrowth implementation
Open 15/Feb/10 15/Feb/10
MAHOUT-303 Exhaustive Tests for Vector implementations
Open
20/Feb/10 20/Feb/10
MAHOUT-306 Profile and improve performance of algorithms based on
vectors Open 22/Feb/10 22/Feb/10
MAHOUT-309 Implement Stochastic Decomposition Open
24/Feb/10 24/Feb/10
MAHOUT-319 SVD solvers should be gracefully stoppable/restartable
Open 01/Mar/10 01/Mar/10
MAHOUT-369 Issues with DistributedLanczosSolver output
Open
25/Apr/10 07/Apr/10
MAHOUT-397 SparseVectorsFromSequenceFiles only outputs a single
vector file Patch Available 19/May/10 19/May/10
MAHOUT-153 Implement kmeans++ for initial cluster selection in
kmeans Open 27/May/10 27/Jul/09
MAHOUT-376 Implement Map-reduce version of stochastic SVD
Open
08/May/10 11/Apr/10
MAHOUT-414 Usability: Mahout applications need a consistent API to
allow users to specify desired map/reduce concurrency Open
13/Jun/10 13/Jun/10
MAHOUT-419 Convert decomposer code to Hadoop 0.20 API
Open
20/Jun/10 20/Jun/10
MAHOUT-308 Improve Lanczos to handle extremely large feature sets
(without hashing) Patch Available 30/Jun/10
24/Feb/10
MAHOUT-401 Use NamedVector in seq2sparse Reopened
02/Jul/10 27/May/10
MAHOUT-232 Implementation of sequential SVM solver based on
Pegasos Patch Available 06/Jul/10 27/Dec/09
MAHOUT-287 Bayes Classifier should use Vector as input
Open
10/Feb/10 10/Feb/10
MAHOUT-167 Convert code to Hadoop 0.20 API Open
24/Jul/10 28/Aug/09
MAHOUT-458 The LDA output does not include the topic-probability
distribution per document (p(z|d)). It outputs only the topics and
corresponding words. Open 09/Aug/10 06/Aug/10
MAHOUT-344 Minhash based clustering Patch
Available 10/Aug/10 22/Mar/10
MAHOUT-334 Proposal for GSoC2010 (Linear SVM for Mahout)
Patch
Available 14/Aug/10 12/Mar/10
MAHOUT-467 Change Iterable<Cooccurrence> in
org.apache.mahout.math.hadoop.similarity.RowSimilarityJob.SimilarityReducer
to list or array to improve the performance Open 18/Aug/10
12/Aug/10
MAHOUT-483 Job RowSimilarityJob-Mapper-EntriesToVectorsReducer
improvement Open 18/Aug/10 18/Aug/10
MAHOUT-495 Undeprecate Normal and Exponential distributions
Open
04/Sep/10 31/Aug/10
MAHOUT-294 Uniform API behavior for Jobs Open
14/Sep/10 16/Feb/10
MAHOUT-214 Implement Stacked RBM Open 08/Feb/10
08/Dec/09
MAHOUT-155 ARFF VectorIterable Open 07/Feb/10
01/Aug/09
MAHOUT-274 Use avro for serialization of structured documents.
Open 30/Mar/10 05/Feb/10
MAHOUT-379 SequentialAccessSparseVector.equals does not agree with
AbstractVector.equivalent Reopened 01/Aug/10
14/Apr/10
MAHOUT-459 Reading an Index from Lucene/Solr 4.0-dev
Open
06/Aug/10 06/Aug/10
MAHOUT-471 RowSimilarityJob-Mapper-EntriesToVectorsReducer failure
Open 17/Aug/10 12/Aug/10
MAHOUT-480 Replace manual precondition checking with Precondition
utility class from Guava Open 18/Aug/10 13/Aug/10
MAHOUT-396 Proposal for Implementing Hidden Markov Model
Patch
Available 17/Sep/10 16/May/10
MAHOUT-271 Make WikipediaDatasetCreatorMapper fuzzy category match
respect word boundaries Open 08/Feb/10 28/Jan/10
On Tue, Sep 21, 2010 at 3:50 PM, Jeff Eastman
<[email protected]> wrote:
> We've been thinking of a September-October timeframe for the 0.4 release
> but I still see some major work items in the 34 Jira issues targeted at this
> release. As an Agile practitioner it seems to me we need to push most of
> these into 0.5 if we are going to hold this schedule. How about we triage
> this list again and shoot for feature freeze at the end of the month?
>