[jira] [Commented] (MAHOUT-1214) Improve the accuracy of the Spectral KMeans Method

2013-05-17 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13661305#comment-13661305 ] Ted Dunning commented on MAHOUT-1214: - Shannon, Wouldn't it be better to drop the L

[jira] [Commented] (MAHOUT-1214) Improve the accuracy of the Spectral KMeans Method

2013-05-17 Thread Yiqun Hu (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1214?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13661212#comment-13661212 ] Yiqun Hu commented on MAHOUT-1214: -- For the orthogonality check, our proposal is sprcifi

Jenkins build is back to normal : Mahout-Examples-Cluster-Reuters-II #483

2013-05-17 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-1219) LSHSearcher not always faster than BruteSearcher

2013-05-17 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660836#comment-13660836 ] Suneel Marthi commented on MAHOUT-1219: --- Here's the error from StreamingKMeansTest:

[jira] [Commented] (MAHOUT-1219) LSHSearcher not always faster than BruteSearcher

2013-05-17 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660823#comment-13660823 ] Dan Filimon commented on MAHOUT-1219: - The StreamingKMeansTest is now also wonky beca

[jira] [Commented] (MAHOUT-1219) LSHSearcher not always faster than BruteSearcher

2013-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1219?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660752#comment-13660752 ] Hudson commented on MAHOUT-1219: Integrated in Mahout-Quality #2002 (See [https://builds

Jenkins build is back to normal : Mahout-Quality #2002

2013-05-17 Thread Apache Jenkins Server
See

[jira] [Created] (MAHOUT-1219) LSHSearcher not always faster than BruteSearcher

2013-05-17 Thread Dan Filimon (JIRA)
Dan Filimon created MAHOUT-1219: --- Summary: LSHSearcher not always faster than BruteSearcher Key: MAHOUT-1219 URL: https://issues.apache.org/jira/browse/MAHOUT-1219 Project: Mahout Issue Type: T

Build failed in Jenkins: Mahout-Quality #2001

2013-05-17 Thread Apache Jenkins Server
See Changes: [dfilimon] MAHOUT-1217: Nearest neighbor searchers sometimes fail to remove points This fixes FastProjectionSearch's searchFirst() which was not also searching through pendingAdditions. I think I replicated the bug in the

[jira] [Commented] (MAHOUT-1217) Nearest neighbor searchers sometimes fail to remove points

2013-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660705#comment-13660705 ] Hudson commented on MAHOUT-1217: Integrated in Mahout-Quality #2001 (See [https://builds

[jira] [Commented] (MAHOUT-1217) Nearest neighbor searchers sometimes fail to remove points

2013-05-17 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660675#comment-13660675 ] Suneel Marthi commented on MAHOUT-1217: --- Yes, Projection Search works fine. We were

[jira] [Commented] (MAHOUT-1216) Add locality sensitive hashing and a LocalitySensitiveHash searcher

2013-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660542#comment-13660542 ] Hudson commented on MAHOUT-1216: Integrated in Mahout-Quality #2000 (See [https://builds

[jira] [Commented] (MAHOUT-1156) Adding nearest neighbor Searchers

2013-05-17 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1156?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660543#comment-13660543 ] Hudson commented on MAHOUT-1156: Integrated in Mahout-Quality #2000 (See [https://builds

Jenkins build is back to normal : Mahout-Examples-Cluster-Reuters #307

2013-05-17 Thread Apache Jenkins Server
See

[jira] [Commented] (MAHOUT-1217) Nearest neighbor searchers sometimes fail to remove points

2013-05-17 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660533#comment-13660533 ] Dan Filimon commented on MAHOUT-1217: - Possible fix: https://reviews.apache.org/r/112

[jira] [Resolved] (MAHOUT-1216) Add locality sensitive hashing and a LocalitySensitiveHash searcher

2013-05-17 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dan Filimon resolved MAHOUT-1216. - Resolution: Fixed Committed revision 1483702. > Add locality sensitive hashing

[jira] [Commented] (MAHOUT-1216) Add locality sensitive hashing and a LocalitySensitiveHash searcher

2013-05-17 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1216?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660498#comment-13660498 ] Dan Filimon commented on MAHOUT-1216: - Committed revision 1483702. >

[jira] [Commented] (MAHOUT-1218) Streamimg k-means fails when the number of clusters specified is <= estimated map clusters

2013-05-17 Thread Ted Dunning (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660478#comment-13660478 ] Ted Dunning commented on MAHOUT-1218: - Each mapper has to do a good sketch of its own

[jira] [Commented] (MAHOUT-1217) Nearest neighbor searchers sometimes fail to remove points

2013-05-17 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1217?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660454#comment-13660454 ] Dan Filimon commented on MAHOUT-1217: - So ProjectionSearch is indeed working properly

Re: [jira] [Commented] (MAHOUT-1179) GSOC 2013: Refactor and improve the classification APIs

2013-05-17 Thread Ted Dunning
Please lay out a plan before coding. The key questions will be a) can you serialize a model efficiently? b) can you deal with the random forest and SGD models? c) what are the real changes to the API needed? On Thu, May 16, 2013 at 10:51 AM, Angel Martinez Gonzalez (JIRA) < j...@apache.org>

[jira] [Commented] (MAHOUT-1218) Streamimg k-means fails when the number of clusters specified is <= estimated map clusters

2013-05-17 Thread Dan Filimon (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1218?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13660434#comment-13660434 ] Dan Filimon commented on MAHOUT-1218: - This is still an issue I think. If k is the n