[jira] [Commented] (MAHOUT-977) Thread-safe version of PlusAnonymousUserDataModel with multiple concurrent users

2012-03-01 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13220061#comment-13220061 ] Sean Owen commented on MAHOUT-977: -- That's OK. The book is specifically for version 0.5.

[jira] [Commented] (MAHOUT-977) Thread-safe version of PlusAnonymousUserDataModel with multiple concurrent users

2012-02-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-977?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13212007#comment-13212007 ] Sean Owen commented on MAHOUT-977: -- If you are willing and interested you are welcome to

[jira] [Commented] (MAHOUT-978) spectralkmeans utility fails when input filename begins with leading underscore

2012-02-19 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-978?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13211381#comment-13211381 ] Sean Owen commented on MAHOUT-978: -- It is making the argument into a qualified version of

[jira] [Commented] (MAHOUT-504) Kmeans clustering error

2012-02-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-504?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13207791#comment-13207791 ] Sean Owen commented on MAHOUT-504: -- Is this valid as a path to clusters? Shouldn't it be

[jira] [Commented] (MAHOUT-784) Exception at 20 Newsgroups examples

2012-02-12 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-784?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13206377#comment-13206377 ] Sean Owen commented on MAHOUT-784: -- PS guys I had already committed the patch. I think th

[jira] [Commented] (MAHOUT-947) Improvements to seqdumper

2012-02-09 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-947?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13205262#comment-13205262 ] Sean Owen commented on MAHOUT-947: -- My only issue with this is that this has brought in a

[jira] [Commented] (MAHOUT-972) Implement Taste DynamoDBDataModel

2012-02-09 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13204538#comment-13204538 ] Sean Owen commented on MAHOUT-972: -- Ok, good start. This will go in integration/ and it w

[jira] [Commented] (MAHOUT-946) Map-reduce job status often left unchecked

2012-02-07 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-946?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13202848#comment-13202848 ] Sean Owen commented on MAHOUT-946: -- I like it all, except I'm not sure about the cleanup

[jira] [Commented] (MAHOUT-963) GenericUserPreferenceArray and GenericItemPreferenceArray use selection sorts

2012-01-28 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-963?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13195599#comment-13195599 ] Sean Owen commented on MAHOUT-963: -- I think something else must be at work... I just don'

[jira] [Commented] (MAHOUT-959) VectorWritable does not preserve the laxPrecision flag

2012-01-26 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13193682#comment-13193682 ] Sean Owen commented on MAHOUT-959: -- Writable is an agent for serialization, not data itse

[jira] [Commented] (MAHOUT-959) VectorWritable does not preserve the laxPrecision flag

2012-01-25 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-959?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13193449#comment-13193449 ] Sean Owen commented on MAHOUT-959: -- No, this is wrong. The vectors are most certainly rea

[jira] [Commented] (MAHOUT-945) The variance calculation of Random forest regression tree

2012-01-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-945?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13186184#comment-13186184 ] Sean Owen commented on MAHOUT-945: -- That's good, but the new implementation just duplicat

[jira] [Commented] (MAHOUT-943) Improbe the way to make the split point on DF.

2012-01-11 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-943?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13184032#comment-13184032 ] Sean Owen commented on MAHOUT-943: -- Or RunningAverageAndStdDev does this too

[jira] [Commented] (MAHOUT-826) Bayes/CBayes classification on a non-existing feature

2012-01-10 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-826?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13183163#comment-13183163 ] Sean Owen commented on MAHOUT-826: -- I have no idea, was just trying to pitch in. This is

[jira] [Commented] (MAHOUT-768) Duplicated DoubleFunction in mahout and mahout-collections (mahout.math package).

2012-01-08 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-768?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13182263#comment-13182263 ] Sean Owen commented on MAHOUT-768: -- I agree with merging back. > Duplic

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-28 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13176697#comment-13176697 ] Sean Owen commented on MAHOUT-906: -- OK. I'm ready to commit the hook, with minor changes.

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-26 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13175941#comment-13175941 ] Sean Owen commented on MAHOUT-906: -- Old data can and should just be excluded from the tes

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-25 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13175838#comment-13175838 ] Sean Owen commented on MAHOUT-906: -- After looking at this more I'm not sure this is the r

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-25 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13175832#comment-13175832 ] Sean Owen commented on MAHOUT-906: -- Oh I see, this actually doesn't implement test/traini

[jira] [Commented] (MAHOUT-904) SplitInput should support randomizing the input

2011-12-23 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13175408#comment-13175408 ] Sean Owen commented on MAHOUT-904: -- (I don't know if this is a relevant comment, but we o

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-22 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13174802#comment-13174802 ] Sean Owen commented on MAHOUT-906: -- Do you mind if I end up splitting these interfaces? T

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13173218#comment-13173218 ] Sean Owen commented on MAHOUT-906: -- OK the problem I'm still having with it, which is a s

[jira] [Commented] (MAHOUT-874) Extract Writables into a separate module to allow smaller dependencies

2011-12-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13173069#comment-13173069 ] Sean Owen commented on MAHOUT-874: -- Ah, there's a 'provided' scope? That would be great s

[jira] [Commented] (MAHOUT-874) Extract Writables into a separate module to allow smaller dependencies

2011-12-19 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172829#comment-13172829 ] Sean Owen commented on MAHOUT-874: -- That's not what I meant -- you were drawing a compari

[jira] [Commented] (MAHOUT-874) Extract Writables into a separate module to allow smaller dependencies

2011-12-19 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172792#comment-13172792 ] Sean Owen commented on MAHOUT-874: -- What all classes in core depend on doesn't matter, if

[jira] [Commented] (MAHOUT-874) Extract Writables into a separate module to allow smaller dependencies

2011-12-19 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172736#comment-13172736 ] Sean Owen commented on MAHOUT-874: -- Separating out a few classes won't change what they d

[jira] [Commented] (MAHOUT-874) Extract Writables into a separate module to allow smaller dependencies

2011-12-19 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13172489#comment-13172489 ] Sean Owen commented on MAHOUT-874: -- Is this purely an issue of the size of your resulting

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-16 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170835#comment-13170835 ] Sean Owen commented on MAHOUT-906: -- Sorry did not mean to make you re-upload, just checki

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-16 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170822#comment-13170822 ] Sean Owen commented on MAHOUT-906: -- (The patch isn't marked for inclusion in the project

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-15 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170243#comment-13170243 ] Sean Owen commented on MAHOUT-906: -- OK shall I wait for a complete patch?

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-15 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170154#comment-13170154 ] Sean Owen commented on MAHOUT-906: -- In both cases you have data to put into a model and s

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-15 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170141#comment-13170141 ] Sean Owen commented on MAHOUT-906: -- Yes that's a good start. Do you think it's possible a

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-15 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13170119#comment-13170119 ] Sean Owen commented on MAHOUT-906: -- OK, sounds like you want to replace more logic, but t

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169485#comment-13169485 ] Sean Owen commented on MAHOUT-906: -- No I think it's as simple as factoring out this secti

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169361#comment-13169361 ] Sean Owen commented on MAHOUT-906: -- Sure, you can do that. I am not sure that gives you a

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169332#comment-13169332 ] Sean Owen commented on MAHOUT-906: -- For the IR precision/recall evaluation, if you *do* h

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169280#comment-13169280 ] Sean Owen commented on MAHOUT-906: -- OK. I think we're speaking about the estimation test,

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169249#comment-13169249 ] Sean Owen commented on MAHOUT-906: -- Are we talking about the IR tests, estimation test or

[jira] [Commented] (MAHOUT-906) Allow collaborative filtering evaluators to use custom logic in splitting data set

2011-12-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-906?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13169241#comment-13169241 ] Sean Owen commented on MAHOUT-906: -- Yes, the lightest-touch approach is to pull them out

[jira] [Commented] (MAHOUT-923) Row mean job for PCA

2011-12-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13168414#comment-13168414 ] Sean Owen commented on MAHOUT-923: -- clone() can return what it likes, though it is intend

[jira] [Commented] (MAHOUT-925) Evaluate the reach of recommender algorithms

2011-12-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13168412#comment-13168412 ] Sean Owen commented on MAHOUT-925: -- @Anatoliy how would the recommender decide a relevanc

[jira] [Commented] (MAHOUT-925) Evaluate the reach of recommender algorithms

2011-12-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13168299#comment-13168299 ] Sean Owen commented on MAHOUT-925: -- Yes you could create a different kind of test that do

[jira] [Commented] (MAHOUT-923) Row mean job for PCA

2011-12-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-923?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13168289#comment-13168289 ] Sean Owen commented on MAHOUT-923: -- Lance, what are the clone() "flaws" you're talking ab

[jira] [Commented] (MAHOUT-925) Evaluate the reach of recommender algorithms

2011-12-12 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-925?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167573#comment-13167573 ] Sean Owen commented on MAHOUT-925: -- This is fine, though, don't you want to count like so

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-11 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13167136#comment-13167136 ] Sean Owen commented on MAHOUT-913: -- You mean it has a parameter "a"? I would not write an

[jira] [Commented] (MAHOUT-916) Make Mahout's tests run in parallel

2011-12-09 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13166325#comment-13166325 ] Sean Owen commented on MAHOUT-916: -- That is what I see. I am not even sure this is forkin

[jira] [Commented] (MAHOUT-916) Make Mahout's tests run in parallel

2011-12-09 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-916?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13166043#comment-13166043 ] Sean Owen commented on MAHOUT-916: -- The patch works for me. However it takes just about t

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-08 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165496#comment-13165496 ] Sean Owen commented on MAHOUT-913: -- I write: 1 (int), 1.0 (double), 1L (long), 1.0f (floa

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-08 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13165466#comment-13165466 ] Sean Owen commented on MAHOUT-913: -- 1d and 1f look like hex literals to me -- at the leas

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-07 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164352#comment-13164352 ] Sean Owen commented on MAHOUT-910: -- Yes, you would just set a very high value for the fir

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-07 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13164250#comment-13164250 ] Sean Owen commented on MAHOUT-910: -- Isn't this just a matter of setting the limits as you

[jira] [Commented] (MAHOUT-917) Build takes too long

2011-12-06 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-917?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163866#comment-13163866 ] Sean Owen commented on MAHOUT-917: -- I don't know if it's so hard... a few tests have been

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-06 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163743#comment-13163743 ] Sean Owen commented on MAHOUT-910: -- If I've understood Ted right then I'm not hearing obj

[jira] [Commented] (MAHOUT-915) OutOfMemoryError in EigenVerificationJob

2011-12-06 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-915?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163643#comment-13163643 ] Sean Owen commented on MAHOUT-915: -- toString() on Vectors writes out everything. I'm almo

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-05 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13163085#comment-13163085 ] Sean Owen commented on MAHOUT-910: -- It's still computing some maximum (for each of three

[jira] [Commented] (MAHOUT-902) TanimotoCoefficientSimilarity should return Double.NaN for two items that have zero overlap

2011-12-05 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162999#comment-13162999 ] Sean Owen commented on MAHOUT-902: -- Ah I think I misunderstood what was to change from th

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-05 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162901#comment-13162901 ] Sean Owen commented on MAHOUT-910: -- I agree. Since we have three samplings here, the simp

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-05 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162899#comment-13162899 ] Sean Owen commented on MAHOUT-910: -- Daniel says: Hi Sean, I have been playing around wit

[jira] [Commented] (MAHOUT-902) TanimotoCoefficientSimilarity should return Double.NaN for two items that have zero overlap

2011-12-05 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162885#comment-13162885 ] Sean Owen commented on MAHOUT-902: -- That's fine, I think we also need to change the distr

[jira] [Commented] (MAHOUT-902) TanimotoCoefficientSimilarity should return Double.NaN for two items that have zero overlap

2011-12-05 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162760#comment-13162760 ] Sean Owen commented on MAHOUT-902: -- Ah, right it affects the item-item computation in the

[jira] [Commented] (MAHOUT-902) TanimotoCoefficientSimilarity should return Double.NaN for two items that have zero overlap

2011-12-04 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-902?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162398#comment-13162398 ] Sean Owen commented on MAHOUT-902: -- Sebastian is this turned around? yes the non-distribu

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-04 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162333#comment-13162333 ] Sean Owen commented on MAHOUT-913: -- I'm not suggesting standardizing on a tool, no. I do

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-03 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162251#comment-13162251 ] Sean Owen commented on MAHOUT-913: -- We do have mvn checkstyle available; I think it still

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-03 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162236#comment-13162236 ] Sean Owen commented on MAHOUT-913: -- - private transient static Logger log = LoggerFacto

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-03 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162232#comment-13162232 ] Sean Owen commented on MAHOUT-913: -- I don't know it's a level of dedication thing -- it's

[jira] [Commented] (MAHOUT-913) Style changes / discussion

2011-12-03 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-913?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162228#comment-13162228 ] Sean Owen commented on MAHOUT-913: -- That's right. The only usage I saw this time was on a

[jira] [Commented] (MAHOUT-910) Improve sampling in SamplingCandidateItemStrategy, optimize intersection computations

2011-12-03 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-910?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162086#comment-13162086 ] Sean Owen commented on MAHOUT-910: -- Yeah you could; in a few cases the caller already kno

[jira] [Commented] (MAHOUT-912) InMemoryCollapsedVariationalBayes0 should ignore _SUCCESS files

2011-12-03 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-912?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13162085#comment-13162085 ] Sean Owen commented on MAHOUT-912: -- (listStatus() will also accept a PathFilter directly)

[jira] [Commented] (MAHOUT-908) Example shell scripts don't run properly on Ubuntu

2011-12-02 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-908?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13161639#comment-13161639 ] Sean Owen commented on MAHOUT-908: -- Yeah safe to assume bash, I think. >

[jira] [Commented] (MAHOUT-901) KnnItemBasedRecommender is not working properly

2011-11-29 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13159200#comment-13159200 ] Sean Owen commented on MAHOUT-901: -- Thanks, though this is identical to the current test

[jira] [Commented] (MAHOUT-898) Error in formula for preference estimation in GenericItemBasedRecommender

2011-11-28 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158657#comment-13158657 ] Sean Owen commented on MAHOUT-898: -- Yes I could imagine this improves metrics in some cas

[jira] [Commented] (MAHOUT-901) KnnItemBasedRecommender is not working properly

2011-11-28 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158478#comment-13158478 ] Sean Owen commented on MAHOUT-901: -- Sounds good, I generally trust you've investigated th

[jira] [Commented] (MAHOUT-898) Error in formula for preference estimation in GenericItemBasedRecommender

2011-11-27 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158101#comment-13158101 ] Sean Owen commented on MAHOUT-898: -- (Pearson is often mentioned in early literature but i

[jira] [Commented] (MAHOUT-898) Error in formula for preference estimation in GenericItemBasedRecommender

2011-11-27 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13158028#comment-13158028 ] Sean Owen commented on MAHOUT-898: -- I understand the issue, but this doesn't fix it. Say

[jira] [Commented] (MAHOUT-896) Improve readability of AbstractDifferenceRecommenderEvaluator class

2011-11-25 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157244#comment-13157244 ] Sean Owen commented on MAHOUT-896: -- I am ex-Google too, and agree. This change does not m

[jira] [Commented] (MAHOUT-896) Improve readability of AbstractDifferenceRecommenderEvaluator class

2011-11-25 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157224#comment-13157224 ] Sean Owen commented on MAHOUT-896: -- OK. I think this is fairly trivial, renaming things l

[jira] [Commented] (MAHOUT-896) Improve readability of AbstractRecommender class

2011-11-25 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-896?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13157160#comment-13157160 ] Sean Owen commented on MAHOUT-896: -- OK, do you have any specific suggestions? The fields

[jira] [Commented] (MAHOUT-891) LoadEvaluationRunner and Recommender stats

2011-11-21 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-891?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13154543#comment-13154543 ] Sean Owen commented on MAHOUT-891: -- Yeah, it was for symmetry with the IRStatistics I sup

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13153791#comment-13153791 ] Sean Owen commented on MAHOUT-881: -- I'm referring to TopItemsTest and anything already co

[jira] [Commented] (MAHOUT-890) Performance issue in FPGrowth

2011-11-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-890?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13153770#comment-13153770 ] Sean Owen commented on MAHOUT-890: -- Do you have a suggested fix, or is it more of an obse

[jira] [Commented] (MAHOUT-886) FPtree nodes multiply-added (becoming siblings in tree)

2011-11-14 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-886?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13150011#comment-13150011 ] Sean Owen commented on MAHOUT-886: -- It looks OK to me, and passes tests, and I trust that

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149381#comment-13149381 ] Sean Owen commented on MAHOUT-881: -- I think the tests should at least be committed. I'd a

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-13 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149270#comment-13149270 ] Sean Owen commented on MAHOUT-881: -- Since it's easy, I just used jprofiler to observe the

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-12 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149113#comment-13149113 ] Sean Owen commented on MAHOUT-881: -- Both good points, I get you now. For me, these last f

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-12 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149109#comment-13149109 ] Sean Owen commented on MAHOUT-881: -- (See my comments on dev@ too) Why are there fewer op

[jira] [Commented] (MAHOUT-882) TopItems.getTopUsers ignores rescoring

2011-11-12 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149039#comment-13149039 ] Sean Owen commented on MAHOUT-882: -- Ugh, that's been there for ages. I'll fix it and Simi

[jira] [Commented] (MAHOUT-881) Refactor TopItems to use Lucene's PriorityQueue and remove excessive sorting

2011-11-12 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13149034#comment-13149034 ] Sean Owen commented on MAHOUT-881: -- -1 Grant I thought we discussed this on the mailing l

[jira] [Commented] (MAHOUT-155) ARFF VectorIterable

2011-11-02 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-155?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13142003#comment-13142003 ] Sean Owen commented on MAHOUT-155: -- Committed, with a small change to spilt out your test

[jira] [Commented] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-11-01 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13141114#comment-13141114 ] Sean Owen commented on MAHOUT-838: -- Lance, there are still unresolved issues in this patc

[jira] [Commented] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-10-30 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13139691#comment-13139691 ] Sean Owen commented on MAHOUT-838: -- I'm still seeing minor issues, like funny indentation

[jira] [Commented] (MAHOUT-834) rowsimilarityjob doesn't clean it's temp dir, and fails when seeing it again

2011-10-24 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13134120#comment-13134120 ] Sean Owen commented on MAHOUT-834: -- On the one hand I'm reluctant to mix output and inter

[jira] [Commented] (MAHOUT-834) rowsimilarityjob doesn't clean it's temp dir, and fails when seeing it again

2011-10-24 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133989#comment-13133989 ] Sean Owen commented on MAHOUT-834: -- Oh, probably so. I didn't know about this option. Any

[jira] [Commented] (MAHOUT-834) rowsimilarityjob doesn't clean it's temp dir, and fails when seeing it again

2011-10-24 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-834?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133981#comment-13133981 ] Sean Owen commented on MAHOUT-834: -- Out of interest, I tried making all jobs delete their

[jira] [Commented] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-10-24 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133866#comment-13133866 ] Sean Owen commented on MAHOUT-838: -- OK, better. Now we're to small items. Javadoc must st

[jira] [Commented] (MAHOUT-838) Make the confusion matrix writable to a file when testing classifiers

2011-10-22 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-838?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133355#comment-13133355 ] Sean Owen commented on MAHOUT-838: -- Lance I tried just this, but I still get compile erro

[jira] [Commented] (MAHOUT-847) Improve Euclidean distance similarity calculation

2011-10-21 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13133279#comment-13133279 ] Sean Owen commented on MAHOUT-847: -- No, EuclideanDistanceMeasure is a distance measure ra

[jira] [Commented] (MAHOUT-847) Improve Euclidean distance similarity calculation

2011-10-21 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-847?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13132507#comment-13132507 ] Sean Owen commented on MAHOUT-847: -- The problem with caching sqrt(n) is that every pair o

[jira] [Commented] (MAHOUT-828) bin/mahout should only print classpath on request, not all the time

2011-10-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-828?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13131506#comment-13131506 ] Sean Owen commented on MAHOUT-828: -- Ted I don't think you actually committed this to SVN.

[jira] [Commented] (MAHOUT-829) bin/mahout doesn't match the way the packaged forms of Mahout are arranged

2011-10-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-829?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13131507#comment-13131507 ] Sean Owen commented on MAHOUT-829: -- Same here it did not seem to be in trunk.

[jira] [Commented] (MAHOUT-672) Implementation of Conjugate Gradient for solving large linear systems

2011-10-20 Thread Sean Owen (Commented) (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-672?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13131505#comment-13131505 ] Sean Owen commented on MAHOUT-672: -- Folks -- what's the status on this? It's been sitting

  1   2   >