[jira] Commented: (MAHOUT-141) Cluster Centroid improperly reported when no points associated w/ the cluster

2009-12-11 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-141?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789665#action_12789665 ] Drew Farris commented on MAHOUT-141: For the case of computeCentriod() in the kmeans pa

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789663#action_12789663 ] Benson Margulies commented on MAHOUT-219: - Comments would have avoided all this, I

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789655#action_12789655 ] Sean Owen commented on MAHOUT-219: -- Yeah that's where it ended up, back where it started.

The next patch

2009-12-11 Thread Benson Margulies
The next patch will be a bigger. It will remove a bunch of deprecates from interfaces (since you can't unit test an interface). It will add some new pieces to the collections code.

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789652#action_12789652 ] Benson Margulies commented on MAHOUT-219: - Gentlemen, I stand by the original patch

Re: Welcome Jake Mannix

2009-12-11 Thread Sean Owen
On Sat, Dec 12, 2009 at 12:28 AM, Jake Mannix wrote: > Ok, this kind of hook is good, but it leaves all of the work to the user - > it would > be nice to extend it along the lines I described, whereby developers can > define how to pull out various features of their items (or users), and then > gi

Re: Welcome Jake Mannix

2009-12-11 Thread Jake Mannix
On Fri, Dec 11, 2009 at 3:01 PM, Sean Owen wrote: > On Fri, Dec 11, 2009 at 10:23 PM, Jake Mannix > wrote: > > Where are these hooks you're describing here? The kind of general > > framework I would imagine would be nice to have is something like this: > > users and items themselves live as (se

Re: Welcome Jake Mannix

2009-12-11 Thread Sean Owen
Yes the editor says it's OK to share drafts with a handful of project people. I think their main concern is that you'd make good reviewers, and they'd want to be involved in recording and managing your feedback. But there seems to be benefit, and little harm, to showing you the relatively-finished

Re: Welcome Jake Mannix

2009-12-11 Thread Ted Dunning
Sean, It is clear also that I am woefully uninformed about Taste other than what I imagine to be how it works based on what you say and my estimate that you have generally good sense. Can you send chapters of your book as you write them to me and Jake so we can know your frame of reference better

Re: Welcome Jake Mannix

2009-12-11 Thread Sean Owen
On Fri, Dec 11, 2009 at 10:23 PM, Jake Mannix wrote: > Where are these hooks you're describing here?  The kind of general > framework I would imagine would be nice to have is something like this: > users and items themselves live as (semi-structured) documents (e.g. like > a Lucene Document, or mo

Re: [jira] Updated: (MAHOUT-168) Need integer compression routines

2009-12-11 Thread Sean Owen
0) Best thing is not to get to this point in the first place! It was necessary from the outset to just let 100 things proceed and see what sticks. Now I think we can gently move towards more focus. So I'd hope someone doesn't make up a big patch without it being clear there's a path to commit it qu

Re: [jira] Updated: (MAHOUT-168) Need integer compression routines

2009-12-11 Thread Ted Dunning
I am also unclear on how to do this. Anybody have good suggestions? On Fri, Dec 11, 2009 at 1:33 PM, Jake Mannix wrote: > I'm not sure of the right way to avoid redoing work again and again, yet > still avoid cluttering our codebase with a bunch of unsupported, unfinished > code. > -- Ted D

Re: [jira] Updated: (MAHOUT-168) Need integer compression routines

2009-12-11 Thread Jake Mannix
So I have a question about the whole "mothballing" process. We're don't have infinite time, and there's a limited number of us, so I understand wanting to keep some focus and not have half-finished work all over the place. But when we archive something as "mothballed", how will we ever find it ag

[jira] Resolved: (MAHOUT-45) Matrix QR decomposition

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-45?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-45. - Resolution: Won't Fix Sounds like the patch at hand is obsoleted by the recent matrix changes, and, large

[jira] Updated: (MAHOUT-156) Documentation and Code cleanup for all Bayesian Classes

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-156?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-156: - Priority: Minor (was: Major) Fix Version/s: 0.3 Assignee: Robin Anil > Documentation a

[jira] Updated: (MAHOUT-168) Need integer compression routines

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-168: - Resolution: Won't Fix Fix Version/s: (was: 0.3) Status: Resolved (was: Patch Availa

[jira] Commented: (MAHOUT-168) Need integer compression routines

2009-12-11 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789539#action_12789539 ] Ted Dunning commented on MAHOUT-168: Then intent for this was for improving the storage

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789534#action_12789534 ] Sean Owen commented on MAHOUT-219: -- ... would this be easier to grok if it were using Obje

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789533#action_12789533 ] Sean Owen commented on MAHOUT-219: -- OK yeah my head's on straight now. I see that too. >

[jira] Commented: (MAHOUT-61) Text problem matrix builder

2009-12-11 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789529#action_12789529 ] Ted Dunning commented on MAHOUT-61: --- I take it back... this looks slightly useful. Gra

[jira] Commented: (MAHOUT-61) Text problem matrix builder

2009-12-11 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789528#action_12789528 ] Ted Dunning commented on MAHOUT-61: --- I think superseded. > Text problem matrix builder

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Drew Farris (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789521#action_12789521 ] Drew Farris commented on MAHOUT-219: It is still not quite right in r889812 because sd[

[jira] Updated: (MAHOUT-168) Need integer compression routines

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-168?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-168: - Priority: Minor (was: Major) Affects Version/s: 0.1 Fix Version/s: 0.3 Question, is

[jira] Updated: (MAHOUT-141) Cluster Centroid improperly reported when no points associated w/ the cluster

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-141?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen updated MAHOUT-141: - Priority: Minor (was: Major) Affects Version/s: 0.1 Fix Version/s: 0.3 Marking for

[jira] Commented: (MAHOUT-61) Text problem matrix builder

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-61?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789516#action_12789516 ] Sean Owen commented on MAHOUT-61: - Same, is this still relevant? Looks kind of MAHOUT-116 wh

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789509#action_12789509 ] Sean Owen commented on MAHOUT-219: -- I made a mistake here but I think it's not quite that

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789503#action_12789503 ] Benson Margulies commented on MAHOUT-219: - Sean, The String constructor is NOT red

[jira] Resolved: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-219. -- Resolution: Fixed Fix Version/s: 0.3 Assignee: Benson Margulies Committed, with style c

[jira] Resolved: (MAHOUT-218) Update to Junit 4.5

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-218. -- Resolution: Fixed Fix Version/s: 0.3 Assignee: Benson Margulies Took the liberty of com

Re: colt-collections

2009-12-11 Thread Ted Dunning
Shucks The Spotted Pony is one of my favorite fiddle tunes. I would have (for unworthy, unprofessional reasons) love to work on a pony package. But I think Sean has the right of it... we should just promote matrix and move the collections stuff to a collections package. On Fri, Dec 11, 2009

Re: One more colt question

2009-12-11 Thread Ted Dunning
Our thought was that we would remove the deprecations as we add the tests and in the meantime would have a continual nudge to do more testing. Colt came with no tests. On Fri, Dec 11, 2009 at 7:51 AM, Sean Owen wrote: > They were put in place to mark that which doesn't have a unit test. > > On

Re: [jira] Commented: (MAHOUT-218) Update to Junit 4.5

2009-12-11 Thread Ted Dunning
I have largely switched to the new style, but Sean is right that there is nearly no difference. The only positive differences that I have seen so far are: a) inheritance is more flexible since you don't have to explicitly inherit from TestCase b) I can remember how to do class level setup versus

[jira] Commented: (MAHOUT-116) Decode matrix methods

2009-12-11 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789438#action_12789438 ] Ted Dunning commented on MAHOUT-116: Hasn't this been subsumed by other work? > Deco

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789437#action_12789437 ] Ted Dunning commented on MAHOUT-219: This looks like a fine patch (just a test and a s

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789404#action_12789404 ] Sean Owen commented on MAHOUT-219: -- OK well I can assist to keep your work moving. Let's p

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789402#action_12789402 ] Benson Margulies commented on MAHOUT-219: - no. I have always depended on the kindn

[jira] Commented: (MAHOUT-116) Decode matrix methods

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-116?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789401#action_12789401 ] Sean Owen commented on MAHOUT-116: -- Pinging this issue -- still live? > Decode matrix met

[jira] Commented: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789399#action_12789399 ] Sean Owen commented on MAHOUT-219: -- Looks good to me. Dumb question but are you a committe

[jira] Commented: (MAHOUT-218) Update to Junit 4.5

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789395#action_12789395 ] Benson Margulies commented on MAHOUT-218: - I'm setting out to help fill the large g

[jira] Updated: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-219?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benson Margulies updated MAHOUT-219: Attachment: generic-sorting-test.patch > Unit test for GenericSorting > ---

[jira] Created: (MAHOUT-219) Unit test for GenericSorting

2009-12-11 Thread Benson Margulies (JIRA)
Unit test for GenericSorting Key: MAHOUT-219 URL: https://issues.apache.org/jira/browse/MAHOUT-219 Project: Mahout Issue Type: Improvement Components: Matrix Affects Versions: 0.4 Report

[jira] Commented: (MAHOUT-217) Tidy up generated data after unit tests are run

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789389#action_12789389 ] Sean Owen commented on MAHOUT-217: -- Agree with this. The right approach is a combination o

[jira] Commented: (MAHOUT-218) Update to Junit 4.5

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789385#action_12789385 ] Sean Owen commented on MAHOUT-218: -- I'm OK with it. Maybe I'm a luddite or just not getti

Re: SVM algo, code, etc.

2009-12-11 Thread Sean Owen
Sure is there a mailing list or something for this? I'd like to be looped into talking about issues like this. On Fri, Dec 11, 2009 at 3:59 PM, Isabel Drost wrote: > Attracting new committers and integrating contributors is a topic that > is not only relevant for Mahout but for many, if not all A

[jira] Updated: (MAHOUT-218) Update to Junit 4.5

2009-12-11 Thread Benson Margulies (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-218?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Benson Margulies updated MAHOUT-218: Attachment: up-junit.patch > Update to Junit 4.5 > --- > >

[jira] Created: (MAHOUT-218) Update to Junit 4.5

2009-12-11 Thread Benson Margulies (JIRA)
Update to Junit 4.5 --- Key: MAHOUT-218 URL: https://issues.apache.org/jira/browse/MAHOUT-218 Project: Mahout Issue Type: Task Components: Utils Affects Versions: 0.4 Reporter: Benson Margulies Jun

Re: SVM algo, code, etc.

2009-12-11 Thread Isabel Drost
On Fri Sean Owen wrote: > On Fri Isabel Drost wrote: > > If you are interested in a broader discussion, it might make sense > > to include the people over at the newly founded community > > development project in the discussion? > > What's this? Attracting new committers and integrating contribu

Re: One more colt question

2009-12-11 Thread Sean Owen
They were put in place to mark that which doesn't have a unit test. On Fri, Dec 11, 2009 at 3:48 PM, Benson Margulies wrote: > Did all those deprecations come from Colt, or were they put in place > as part of the fork/cleanup process? >

Re: colt-collections

2009-12-11 Thread Sean Owen
On Fri, Dec 11, 2009 at 3:43 PM, Benson Margulies wrote: > One proposal I want to make first: the token 'matrix' in the package > names is confusing. I know you don't want to use 'colt'. How about > 'pony' for bitvector, buffer, and functions, and 'collections' for > list and map? A less silly alt

One more colt question

2009-12-11 Thread Benson Margulies
Did all those deprecations come from Colt, or were they put in place as part of the fork/cleanup process?

Re: SVM algo, code, etc.

2009-12-11 Thread Sean Owen
On Fri, Dec 11, 2009 at 2:08 PM, Isabel Drost wrote: > I would guess that your question is not really restricted to Mahout but > the same question appears on other projects as well. Basically the > question is on "how to mentor developers to become new project members". The first question is how

[jira] Updated: (MAHOUT-210) Publish code quality reports through maven

2009-12-11 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-210?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Isabel Drost updated MAHOUT-210: Attachment: MAHOUT-210.patch The patch adds clover, findbugs, pmd, cpd and maven dependency reports

colt-collections

2009-12-11 Thread Benson Margulies
Folks, I have rather slowly and densely assimilated the messages from Ted and others about what you've been up to with COLT. It seems to me that my ambitions to get an Apache library to compete with Trove would be most easily tackled by improving your fork than by any other strategy, so, with your

Re: SVM algo, code, etc.

2009-12-11 Thread Isabel Drost
On Fri Sean Owen wrote: > 1) Is SVM in scope for Mahout? (I am guessing so.) Yes. > 2) Who is nominally committing to shepherd the code into the code base > and fix bugs and answer questions? (Jake?) > > I'm not really bothered about this particular patch, but the more > general question. I

[jira] Commented: (MAHOUT-85) Perceptron/Winnow Trainer

2009-12-11 Thread Isabel Drost (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-85?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789312#action_12789312 ] Isabel Drost commented on MAHOUT-85: I am about to add tests currently. I guess, I will

[jira] Created: (MAHOUT-217) Tidy up generated data after unit tests are run

2009-12-11 Thread Isabel Drost (JIRA)
Tidy up generated data after unit tests are run --- Key: MAHOUT-217 URL: https://issues.apache.org/jira/browse/MAHOUT-217 Project: Mahout Issue Type: Improvement Affects Versions: 0.3

Re: SVM algo, code, etc.

2009-12-11 Thread Jake Mannix
I really feel like I should respond to this, but seeing as I live on the west coast of the US, going to bed might be more advisable. On a very specific topic of SVMs, I can certainly look into this, but David, were you interested in helping bring this into Mahout and help maintain it? You are ofte

[jira] Resolved: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-208. -- Resolution: Fixed Fixed by adding ' lengthSquared = -1.0; ' to every place the values are mutated. >

Re: SVM algo, code, etc.

2009-12-11 Thread Sean Owen
This is a timely message, since I'm currently presuming to close some old Mahout issues at the moment and it raises a related concern. There's lots of old JIRA issues of the form: 1) somebody submits a patch implementing part of something 2) some comments happen, maybe 3) nothing happens for a yea

[jira] Commented: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789274#action_12789274 ] Jake Mannix commented on MAHOUT-208: bq. Actually there are a number of issues I'd like

Re: SVM algo, code, etc.

2009-12-11 Thread Jake Mannix
Hi Zhao, I would certainly love to see a nice parallel SVM on hadoop. Submit a patch, let's get it in Mahout! -jake On Fri, Dec 11, 2009 at 3:52 AM, zhao zhendong wrote: > True, I am still wondering about whether it is valuable to implement a > parallel SVM on hadoop? I really wanna join i

[jira] Resolved: (MAHOUT-54) parallelize k-means sharing the predominance of canopies

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-54?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-54. - Resolution: Won't Fix It appears this one has been inactive for over a year and the patch as submitted wa

[jira] Resolved: (MAHOUT-24) Skeletal LWLR implementation

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-24?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-24. - Resolution: Later Archiving due to inactivity > Skeletal LWLR implementation > --

[jira] Commented: (MAHOUT-19) Hierarchial clusterer

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-19?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789264#action_12789264 ] Sean Owen commented on MAHOUT-19: - Sounds like this is something that should be marked "Late

[jira] Commented: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789262#action_12789262 ] Sean Owen commented on MAHOUT-208: -- Actually I think you are right that this is the ultima

Re: SVM algo, code, etc.

2009-12-11 Thread zhao zhendong
True, I am still wondering about whether it is valuable to implement a parallel SVM on hadoop? I really wanna join in mike's group. Just like Olivier concerned, some linear version of SVM solvers can handle large-scale data sets ( several seconds for 100K-level samples). It's true that the linear

[jira] Commented: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789217#action_12789217 ] Jake Mannix commented on MAHOUT-208: So the strategy is just "be careful" (what I did i

[jira] Commented: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789207#action_12789207 ] Sean Owen commented on MAHOUT-208: -- Agree with Jake, there isn't any special handling of h

[jira] Resolved: (MAHOUT-117) Add Javadocs to site

2009-12-11 Thread Sean Owen (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-117?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sean Owen resolved MAHOUT-117. -- Resolution: Duplicate Really subsumed by MAHOUT-210 > Add Javadocs to site > > >

[jira] Commented: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Jake Mannix (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789201#action_12789201 ] Jake Mannix commented on MAHOUT-208: bq. Alternative to maintaining caching flag is to

[jira] Commented: (MAHOUT-208) Vector.getLengthSquared() is dangerously optimized

2009-12-11 Thread Shashikant Kore (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-208?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=12789171#action_12789171 ] Shashikant Kore commented on MAHOUT-208: It is important to have this caching to k