[jira] [Updated] (MAHOUT-997) Make splitData smart enough to not consider a CSV header to be part of the data

2012-04-06 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-997?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-997: - Fix Version/s: (was: 0.6) > Make splitData smart enough to not consider a CSV header to be part o

[jira] [Updated] (MAHOUT-977) Thread-safe version of PlusAnonymousUserDataModel with multiple concurrent users

2012-02-17 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-977: - Resolution: Fixed Fix Version/s: 0.7 Status: Resolved (was: Patch Available) Looks goo

[jira] [Updated] (MAHOUT-977) Thread-safe version of PlusAnonymousUserDataModel with multiple concurrent users

2012-02-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-977?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-977: - Affects Version/s: (was: 0.7) 0.6 Fix Version/s: (was: 0.7) Sounds

[jira] [Updated] (MAHOUT-973) SparseVectorsFromSequenceFiles will not create a proper TFIDF (bug in TFIDFPartialVectorReducer)

2012-02-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-973?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-973: - Affects Version/s: (was: 0.7) Fix Version/s: 0.7 Assignee: Grant Ingersoll Grant

[jira] [Updated] (MAHOUT-704) Refactor PredictionJob to use MultipleInputs for reduce side joins

2012-02-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-704?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-704: - Fix Version/s: (was: 1.0) > Refactor PredictionJob to use MultipleInputs for reduce side joins >

[jira] [Updated] (MAHOUT-845) Make cluster top terms code more reusable

2012-02-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-845?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-845: - Resolution: Fixed Assignee: Jake Mannix Status: Resolved (was: Patch Available) > Make

[jira] [Updated] (MAHOUT-946) Map-reduce job status often left unchecked

2012-02-08 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-946: - Resolution: Fixed Fix Version/s: 0.7 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-967) SequenceFileFromMailArchive missing from driver.classes.props

2012-02-07 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-967: - Resolution: Fixed Fix Version/s: 0.7 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-970) Make hadoop version overridable

2012-02-07 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-970?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-970: - Resolution: Fixed Fix Version/s: 0.7 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-972) Implement Taste DynamoDBDataModel

2012-02-07 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-972: - Affects Version/s: 0.6 Assignee: (was: Sean Owen) Happy to review whenever you have some

[jira] [Updated] (MAHOUT-963) GenericUserPreferenceArray and GenericItemPreferenceArray use selection sorts

2012-02-07 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-963: - Resolution: Fixed Fix Version/s: 0.7 Status: Resolved (was: Patch Available) > Gen

[jira] [Updated] (MAHOUT-967) SequenceFileFromMailArchive missing from driver.classes.props

2012-01-31 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-967?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-967: - Fix Version/s: (was: 0.6) > SequenceFileFromMailArchive missing from driver.classes.props > -

[jira] [Updated] (MAHOUT-966) Mismantch in the number of points given by the clusterDumper and ClusterOutputPostProcessor

2012-01-31 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-966?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-966: - Priority: Minor (was: Major) Fix Version/s: (was: 0.6) > Mismantch in the number of poi

[jira] [Updated] (MAHOUT-960) Reduce memory usage of ImplicitFeedbackAlternatingLeastSquaresSolver

2012-01-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-960?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-960: - Fix Version/s: (was: 0.6) Assignee: Sebastian Schelter (was: Sean Owen) > Reduce memory

[jira] [Updated] (MAHOUT-963) GenericUserPreferenceArray and GenericItemPreferenceArray use selection sorts

2012-01-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-963: - Priority: Minor (was: Major) Fix Version/s: (was: 0.6) Issue Type: Improvement (was:

[jira] [Updated] (MAHOUT-959) VectorWritable does not preserve the laxPrecision flag

2012-01-25 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-959?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-959: - Resolution: Not A Problem Fix Version/s: (was: 0.7) (was: 0.6)

[jira] [Updated] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-29 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-906: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-937) Collocations Job Partitioner not being configured properly

2011-12-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-937: - Resolution: Fixed Status: Resolved (was: Patch Available) > Collocations Job Partitioner not

[jira] [Updated] (MAHOUT-937) Collocations Job Partitioner not being configured properly

2011-12-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-937: - Fix Version/s: 0.6 Assignee: Sean Owen Affects Version/s: 0.5 Status:

[jira] [Updated] (MAHOUT-937) Collocations Job Partitioner not being configured properly

2011-12-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-937?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-937: - Attachment: MAHOUT-937.patch > Collocations Job Partitioner not being configured properly > -

[jira] [Updated] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-906: - Attachment: MAHOUT-906.patch This is a sketch of what I had in mind. It is lacking the implementation but

[jira] [Updated] (MAHOUT-925) Evaluate the reach of recommender algorithms

2011-12-13 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-925?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-925: - Resolution: Fixed Fix Version/s: 0.6 Status: Resolved (was: Patch Available) Committed

[jira] [Updated] (MAHOUT-913) Style changes / discussion

2011-12-11 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-913: - Attachment: Sean.xml My personal IJ inspections config preferences > Style changes / dis

[jira] [Updated] (MAHOUT-919) WeightedRunningAverage does not initialize correctly.

2011-12-09 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-919?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-919: - Resolution: Fixed Fix Version/s: 0.6 Status: Resolved (was: Patch Available) Ah of cou

[jira] [Updated] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-07 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-910: - Resolution: Fixed Status: Resolved (was: Patch Available) > Improve sampling in SamplingCand

[jira] [Updated] (MAHOUT-898) Error in formula for preference estimation in GenericItemBasedRecommender

2011-12-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-898?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-898: - Fix Version/s: (was: 0.6) Happy to move this back into 0.6 when there's a patch > Er

[jira] [Updated] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-910: - Attachment: SamplingCandidateItemsStrategy.java It changed so much it might be easier to read the new sou

[jira] [Updated] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-910: - Attachment: MAHOUT-910.patch See comments on next file. > Improve sampling in SamplingCa

[jira] [Updated] (MAHOUT-902) TanimotoCoefficientSimilarity should return Double.NaN for two items that have zero overlap

2011-12-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-902?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-902: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-910: - Attachment: MAHOUT-910.patch Now, samples all three things: user's item, those items' users, and those u

[jira] [Updated] (MAHOUT-913) Style changes / discussion

2011-12-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-913: - Resolution: Fixed Status: Resolved (was: Patch Available) > Style changes / discussion > ---

[jira] [Updated] (MAHOUT-913) Style changes / discussion

2011-12-03 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-913: - Status: Patch Available (was: Open) > Style changes / discussion > -- > >

[jira] [Updated] (MAHOUT-913) Style changes / discussion

2011-12-03 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-913: - Attachment: MAHOUT-913.patch > Style changes / discussion > -- > >

[jira] [Updated] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-03 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-910: - Status: Patch Available (was: Open) > Improve sampling in SamplingCandidateItemStrategy, optimize in

[jira] [Updated] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-03 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-910: - Attachment: MAHOUT-910.patch This is what I'm proposing to increase sample-ability. Now sampling applies

[jira] [Updated] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-02 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-906: - Fix Version/s: (was: 0.6) Assignee: (was: Sean Owen) Yes, I think a clean refactoring of

[jira] [Updated] (MAHOUT-612) Simplify configuring and running Mahout MapReduce jobs from Java using Java bean configuration

2011-12-01 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-612: - Assignee: (was: Sean Owen) > Simplify configuring and running Mahout MapReduce jobs from Java usi

[jira] [Updated] (MAHOUT-903) Slope one doesn't write, read diff counts resulting in no recs

2011-12-01 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-903: - Resolution: Fixed Status: Resolved (was: Patch Available) > Slope one doesn't write, read di

[jira] [Updated] (MAHOUT-903) Slope one doesn't write, read diff counts resulting in no recs

2011-12-01 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-903: - Attachment: MAHOUT-903.patch One more go-round; need to handle inverse objects correctly

[jira] [Updated] (MAHOUT-905) CachingUserSimilarity and CachingItemSimilarity have wrong (far to small) default maxSizes

2011-11-30 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-905: - Resolution: Not A Problem Status: Resolved (was: Patch Available) > CachingUserSimilarity an

[jira] [Updated] (MAHOUT-905) CachingUserSimilarity and CachingItemSimilarity have wrong (far to small) default maxSizes

2011-11-30 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-905?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-905: - Priority: Minor (was: Major) Issue Type: Improvement (was: Bug) (This is hardly a bug!) The cach

[jira] [Updated] (MAHOUT-903) Slope one doesn't write, read diff counts resulting in no recs

2011-11-30 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-903: - Attachment: MAHOUT-903.patch Different take: allow stdev output and thus use of weighting

[jira] [Updated] (MAHOUT-903) Slope one doesn't write, read diff counts resulting in no recs

2011-11-29 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-903: - Attachment: MAHOUT-903.patch > Slope one doesn't write, read diff counts resulting in no recs > -

[jira] [Updated] (MAHOUT-903) Slope one doesn't write, read diff counts resulting in no recs

2011-11-29 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-903: - Status: Patch Available (was: Open) > Slope one doesn't write, read diff counts resulting in no recs

[jira] [Updated] (MAHOUT-900) RandomSeedGenerator samples / output k texts incorrectly

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-900: - Resolution: Fixed Assignee: Sean Owen (was: Robin Anil) Status: Resolved (was: Patch Avail

[jira] [Updated] (MAHOUT-901) KnnItemBasedRecommender is not working properly

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-901?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-901: - Fix Version/s: (was: 0.5) 0.6 > KnnItemBasedRecommender is not working properl

[jira] [Updated] (MAHOUT-894) NB testclassifier runs in sequential mode by default

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-894: - Priority: Minor (was: Major) Affects Version/s: (was: 0.6) 0.5

[jira] [Updated] (MAHOUT-894) NB testclassifier runs in sequential mode by default

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-894?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-894: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-895) Make Wikipedia example set maker easier to mod

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-895?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-895: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-900) RandomSeedGenerator samples / output k texts incorrectly

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-900: - Description: {code} int currentSize = chosenTexts.size(); if (currentSize < k) {

[jira] [Updated] (MAHOUT-900) RandomSeedGenerator samples / output k texts incorrectly

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-900: - Attachment: MAHOUT-900.patch > RandomSeedGenerator samples / output k texts incorrectly > ---

[jira] [Updated] (MAHOUT-900) RandomSeedGenerator samples / output k texts incorrectly

2011-11-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-900?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-900: - Status: Patch Available (was: Open) > RandomSeedGenerator samples / output k texts incorrectly > ---

[jira] [Updated] (MAHOUT-893) Dependency Clash : Google Collections and Guava

2011-11-23 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-893: - Resolution: Fixed Status: Resolved (was: Patch Available) > Dependency Clash : Google Collec

[jira] [Updated] (MAHOUT-893) Dependency Clash : Google Collections and Guava

2011-11-23 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-893?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-893: - Priority: Minor (was: Major) Affects Version/s: (was: 0.6) 0.5

[jira] [Updated] (MAHOUT-891) LoadEvaluationRunner and Recommender stats

2011-11-21 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-891: - Resolution: Fixed Status: Resolved (was: Patch Available) > LoadEvaluationRunner and Recomme

[jira] [Updated] (MAHOUT-891) LoadEvaluationRunner and Recommender stats

2011-11-20 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-891: - Fix Version/s: 0.6 Assignee: Sean Owen Labels: collaborative-filtering (w

[jira] [Updated] (MAHOUT-891) LoadEvaluationRunner and Recommender stats

2011-11-20 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-891?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-891: - Attachment: MAHOUT-891.patch How's this? In effect it's the same thing just adding more formal hooks to a

[jira] [Updated] (MAHOUT-886) FPtree nodes multiply-added (becoming siblings in tree)

2011-11-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-886?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-886: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-885) Freq pattern growth advertises wrong value for default

2011-11-14 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-885?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-885: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-881: - Attachment: Call_Tree_2.html Call_Tree.html > Refactor TopItems to use Lucene's Prior

[jira] [Updated] (MAHOUT-155) ARFF VectorIterable

2011-11-06 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-155?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-155: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen (was: Grant Ingersoll)

[jira] [Updated] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-11-04 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-838: - Resolution: Fixed Fix Version/s: 0.6 Assignee: Sean Owen Status: Resolved (was

[jira] [Updated] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-11-01 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-838: - Affects Version/s: (was: 0.6) This patch still doesn't work for me -- tests still fail. If this patch

[jira] [Updated] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-11-01 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-838: - Attachment: MAHOUT-838.patch OK, um, somehow when I applied the patch, the IDE showed me the change as of

[jira] [Updated] (MAHOUT-847) Improve Euclidean distance similarity calculation

2011-10-21 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-847: - Resolution: Fixed Status: Resolved (was: Patch Available) > Improve Euclidean distance simil

[jira] [Updated] (MAHOUT-847) Improve Euclidean distance similarity calculation

2011-10-20 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-847: - Status: Patch Available (was: Open) > Improve Euclidean distance similarity calculation > --

[jira] [Updated] (MAHOUT-847) Improve Euclidean distance similarity calculation

2011-10-20 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-847: - Attachment: MAHOUT-847.patch > Improve Euclidean distance similarity calculation > --

[jira] [Updated] (MAHOUT-710) Implementing K-Trusses

2011-10-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-710?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-710: - Resolution: Fixed Status: Resolved (was: Patch Available) Well some good code was already submit

[jira] [Updated] (MAHOUT-842) Inconsistent and conflicting use of '-i' flag

2011-10-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-842: - Resolution: Fixed Status: Resolved (was: Patch Available) > Inconsistent and conflicting use

[jira] [Updated] (MAHOUT-612) Simplify configuring and running Mahout MapReduce jobs from Java using Java bean configuration

2011-10-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-612?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-612: - Fix Version/s: (was: 0.6) > Simplify configuring and running Mahout MapReduce jobs from Java usin

[jira] [Updated] (MAHOUT-842) Inconsistent and conflicting use of '-i' flag

2011-10-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-842: - Attachment: MAHOUT-842.patch > Inconsistent and conflicting use of '-i' flag > --

[jira] [Updated] (MAHOUT-842) Inconsistent and conflicting use of '-i' flag

2011-10-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-842?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-842: - Status: Patch Available (was: Open) > Inconsistent and conflicting use of '-i' flag > --

[jira] [Updated] (MAHOUT-826) Bayes/CBayes classification on a non-existing feature

2011-10-15 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-826: - Affects Version/s: 0.5 Fix Version/s: 0.6 Assignee: Robin Anil > Bayes/CBayes cl

[jira] [Updated] (MAHOUT-826) Bayes/CBayes classification on a non-existing feature

2011-10-05 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-826?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-826: - Comment: was deleted (was: Great is there a patch available? Or can you describe the simple change speci

[jira] [Updated] (MAHOUT-812) Allow ConfusionMatrix to be Writable (via MatrixWritable)

2011-10-03 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-812: - Resolution: Fixed Status: Resolved (was: Patch Available) > Allow ConfusionMatrix to be Writ

[jira] [Updated] (MAHOUT-812) Allow ConfusionMatrix to be Writable (via MatrixWritable)

2011-10-02 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-812?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-812: - Attachment: MAHOUT-812.patch OK, this is looking reasonable. Some of the formatting needs to be cleaned

[jira] [Updated] (MAHOUT-823) RandomAccessSparseVector.dot with another non-sequential vector can be extremely non-symmetric in its performance

2011-10-01 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-823: - Resolution: Fixed Status: Resolved (was: Patch Available) Hearing essential consensus on this ap

[jira] [Updated] (MAHOUT-778) Mark folder name of final clustering iteration with pattern such as 'cluster-n-last'

2011-09-30 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-778: - Resolution: Fixed Assignee: Sean Owen (was: Robin Anil) Status: Resolved (was: Patch Avail

[jira] [Updated] (MAHOUT-823) RandomAccessSparseVector.dot with another non-sequential vector can be extremely non-symmetric in its performance

2011-09-30 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-823: - Fix Version/s: 0.6 Assignee: Sean Owen Labels: dot dot-product vector (wa

[jira] [Updated] (MAHOUT-823) RandomAccessSparseVector.dot with another non-sequential vector can be extremely non-symmetric in its performance

2011-09-30 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-823?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-823: - Attachment: MAHOUT-823.patch > RandomAccessSparseVector.dot with another non-sequential vector can be

[jira] [Updated] (MAHOUT-778) Mark folder name of final clustering iteration with pattern such as 'cluster-n-last'

2011-09-29 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-778: - Attachment: MAHOUT-778.patch OK here's an omnibus patch, including Jeff's idea. It shows the extent of th

[jira] [Updated] (MAHOUT-799) Cannot run SequenceFilesFromCsvFilter, ever

2011-09-28 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-799: - Resolution: Fixed Status: Resolved (was: Patch Available) > Cannot run SequenceFilesFromCsvF

[jira] [Updated] (MAHOUT-799) Cannot run SequenceFilesFromCsvFilter, ever

2011-09-27 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-799: - Attachment: MAHOUT-799.patch OK, different answer: I don't think the CSV filter can be 'saved'. I'm unabl

[jira] [Updated] (MAHOUT-799) Cannot run SequenceFilesFromCsvFilter, ever

2011-09-27 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-799: - Attachment: MAHOUT-799.patch Hmm, the author didn't follow up. As far as I can tell, the -filter option

[jira] [Updated] (MAHOUT-799) Cannot run SequenceFilesFromCsvFilter, ever

2011-09-27 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-799?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-799: - Fix Version/s: 0.6 Assignee: Sean Owen Affects Version/s: (was: 0.6)

[jira] [Updated] (MAHOUT-778) Mark folder name of final clustering iteration with pattern such as 'cluster-n-last'

2011-09-27 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-778: - Attachment: MAHOUT-778.patch > Mark folder name of final clustering iteration with pattern such as >

[jira] [Updated] (MAHOUT-778) Mark folder name of final clustering iteration with pattern such as 'cluster-n-last'

2011-09-27 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-778?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-778: - Assignee: Robin Anil Status: Patch Available (was: Open) > Mark folder name of final clusterin

[jira] [Updated] (MAHOUT-814) SSVD local tests should use their own tmp space to avoid collisions

2011-09-27 Thread Sean Owen (Updated) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-814?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-814: - Resolution: Fixed Status: Resolved (was: Patch Available) > SSVD local tests should use thei