Re: Random Errors

2013-06-08 Thread Dawid Weiss
In fact the core of it is that people assume method order will be that of declaration within the class and this is not guaranteed anywhere. Java7 returns methods from reflection api in an undefined order and this propagates to junit. I believe surefire can be configured to use a junit runner that

[jira] [Updated] (MAHOUT-974) org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob use integer as userId and itemId

2013-06-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-974?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-974: -- Resolution: Fixed Status: Resolved (was: Patch Available) test issue is

Build failed in Jenkins: Mahout-Quality #2056

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2056/changes Changes: [ssc] MAHOUT-974 org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob use integer as userId and itemId -- [...truncated 7177 lines...] at

[jira] [Commented] (MAHOUT-974) org.apache.mahout.cf.taste.hadoop.als.ParallelALSFactorizationJob use integer as userId and itemId

2013-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-974?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678706#comment-13678706 ] Hudson commented on MAHOUT-974: --- Integrated in Mahout-Quality #2056 (See

[jira] [Resolved] (MAHOUT-1243) Dictionary file format in Lucene-Mahout integration is not in SequenceFileFormat

2013-06-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1243. Resolution: Fixed Added new option seqDictOut that trigger writing of the

Jenkins build is back to normal : Mahout-Quality #2057

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2057/changes

[jira] [Commented] (MAHOUT-1243) Dictionary file format in Lucene-Mahout integration is not in SequenceFileFormat

2013-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678713#comment-13678713 ] Hudson commented on MAHOUT-1243: Integrated in Mahout-Quality #2057 (See

[DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Grant Ingersoll
Hi Mahouts, A full copy of proposed draft release notes are up at https://cwiki.apache.org/confluence/display/MAHOUT/Release+0.8. Please add/edit as appropriate. IN PARTICULAR, PLEASE PAY CLOSE ATTENTION TO THE SECTION LABELLED __FUTURE PLANS__, which I have included below. This is purely

[jira] [Created] (MAHOUT-1245) Move Website(s) to ASF CMS

2013-06-08 Thread Grant Ingersoll (JIRA)
Grant Ingersoll created MAHOUT-1245: --- Summary: Move Website(s) to ASF CMS Key: MAHOUT-1245 URL: https://issues.apache.org/jira/browse/MAHOUT-1245 Project: Mahout Issue Type: Task

[jira] [Commented] (MAHOUT-1241) Mailing list archives not available

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678725#comment-13678725 ] Grant Ingersoll commented on MAHOUT-1241: - I get: {quote} patch -p 0 -i

[jira] [Commented] (MAHOUT-1233) Problem in processing datasets as a single chunk vs many chunks in HADOOP mode in mostly all the clustering algos

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678726#comment-13678726 ] Grant Ingersoll commented on MAHOUT-1233: - Yannis, any chance you have a small

[jira] [Updated] (MAHOUT-1103) clusterpp is not writing directories for all clusters

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll updated MAHOUT-1103: Attachment: MAHOUT-1103.patch Matt, can you check this iteration on your patch? That

[jira] [Work started] (MAHOUT-1084) Kmeans for synthetic control example--there are 12 cluster during iterations.

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on MAHOUT-1084 started by Grant Ingersoll. Kmeans for synthetic control example--there are 12 cluster during iterations.

Re: 0.8 progress

2013-06-08 Thread Grant Ingersoll
We are down to 18 issues! Let's keep cranking. I'm working on 1103 and 1084 at the moment. On Jun 6, 2013, at 12:00 PM, Grant Ingersoll gsing...@apache.org wrote: On Jun 6, 2013, at 12:12 PM, Sebastian Schelter ssc.o...@googlemail.com wrote: Hi Grant, Here's my take: Will/Must be

[jira] [Commented] (MAHOUT-1084) Kmeans for synthetic control example--there are 12 cluster during iterations.

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678728#comment-13678728 ] Grant Ingersoll commented on MAHOUT-1084: - I confirm there is something wrong

[jira] [Commented] (MAHOUT-1084) Kmeans for synthetic control example--there are 12 cluster during iterations.

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678733#comment-13678733 ] Grant Ingersoll commented on MAHOUT-1084: - liutengfei is right in that it is

[jira] [Resolved] (MAHOUT-1084) Kmeans for synthetic control example--there are 12 cluster during iterations.

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Grant Ingersoll resolved MAHOUT-1084. - Resolution: Fixed Thanks liutengfei! Kmeans for synthetic control

[jira] [Commented] (MAHOUT-1233) Problem in processing datasets as a single chunk vs many chunks in HADOOP mode in mostly all the clustering algos

2013-06-08 Thread yannis ats (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1233?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678738#comment-13678738 ] yannis ats commented on MAHOUT-1233: for some strange reason i obtain errors in

Build failed in Jenkins: Mahout-Examples-Cluster-Reuters #348

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Examples-Cluster-Reuters/348/changes Changes: [gsingers] MAHOUT-1084: fix random seed issue in synthetic control k-means clustering -- [...truncated 1928 lines...] INFO: Task:attempt_local_0009_m_00_0 is done.

Re: Random Errors

2013-06-08 Thread Sean Owen
I would more readily expect that the dependency is due to the random number generators. Where a test depends on a particular sequence, and somewhere an RNG doesn't use the RandomUtils trick, it may have a different state if other tests ran before. The surefire parameter just controls what order

Re: Random Errors

2013-06-08 Thread Dawid Weiss
number generators. Where a test depends on a particular sequence, and somewhere an RNG doesn't use the RandomUtils trick, it may have a different state if other tests ran before. I have a different solution for this in randomizedtesting framework (a Random instance cannot be shared from test

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Ravi Mummulla
Hi Grant, Regarding 1.0 plans, do we also want to include a note on adding tests where they don't exist or improving them where needed or is that implicit? Thanks. On Sat, Jun 8, 2013 at 3:55 AM, Grant Ingersoll gsing...@apache.org wrote: Hi Mahouts, A full copy of proposed draft release

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Grant Ingersoll
More tests are always welcome. On Jun 8, 2013, at 10:29 AM, Ravi Mummulla ravi.mummu...@gmail.com wrote: Hi Grant, Regarding 1.0 plans, do we also want to include a note on adding tests where they don't exist or improving them where needed or is that implicit? Thanks. On Sat, Jun 8,

Re: 0.8 progress

2013-06-08 Thread Grant Ingersoll
Hmm, JIRA seems to be down... 1084 is in. I'm pretty close to being done on 1103. I'm on #mahout on Freenode if anyone wants to coordinate, and will be there for the next 1 hour or so. On Jun 8, 2013, at 7:21 AM, Grant Ingersoll gsing...@apache.org wrote: We are down to 18 issues! Let's

Maven and Mac Jars

2013-06-08 Thread Grant Ingersoll
Can someone who is a whiz in Maven take a look at https://issues.apache.org/jira/browse/MAHOUT-1126 ? It is definitely reproducible. -Grant

[jira] [Commented] (MAHOUT-1126) Mac builds won't unjar

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678751#comment-13678751 ] Grant Ingersoll commented on MAHOUT-1126: - OK, I see the issue more clearly now

[jira] [Resolved] (MAHOUT-1241) Mailing list archives not available

2013-06-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1241?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil resolved MAHOUT-1241. Resolution: Fixed Mailing list archives not available ---

[jira] [Resolved] (MAHOUT-1245) Move Website(s) to ASF CMS

2013-06-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1245?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil resolved MAHOUT-1245. Resolution: Not A Problem Assignee: Robin Anil Move Website(s) to ASF CMS

Re: Maven and Mac Jars

2013-06-08 Thread Grant Ingersoll
I think I figured it out. On Jun 8, 2013, at 11:22 AM, Grant Ingersoll gsing...@apache.org wrote: Can someone who is a whiz in Maven take a look at https://issues.apache.org/jira/browse/MAHOUT-1126 ? It is definitely reproducible. -Grant

Re: Maven and Mac Jars

2013-06-08 Thread Robin Anil
Good job! Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc. On Sat, Jun 8, 2013 at 10:36 AM, Grant Ingersoll gsing...@apache.orgwrote: I think I figured it out. On Jun 8, 2013, at 11:22 AM, Grant Ingersoll gsing...@apache.org wrote: Can someone who is a whiz in Maven take a

[jira] [Work started] (MAHOUT-1103) clusterpp is not writing directories for all clusters

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on MAHOUT-1103 started by Grant Ingersoll. clusterpp is not writing directories for all clusters -

[jira] [Work started] (MAHOUT-1126) Mac builds won't unjar

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1126?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Work on MAHOUT-1126 started by Grant Ingersoll. Mac builds won't unjar -- Key: MAHOUT-1126 URL:

[jira] [Commented] (MAHOUT-1103) clusterpp is not writing directories for all clusters

2013-06-08 Thread Grant Ingersoll (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1103?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678753#comment-13678753 ] Grant Ingersoll commented on MAHOUT-1103: - The MapReduce portion of this will

[jira] [Updated] (MAHOUT-1026) Add LDA (CVB implementation) to the cluster_reuters.sh example script

2013-06-08 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1026?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-1026: -- Resolution: Fixed Status: Resolved (was: Patch Available) Add LDA (CVB

Re: Work on ALS for future releases

2013-06-08 Thread Sebastian Schelter
Hi Saikat, Great that you want to work on the ALS code. I think it is very important to make it easier to use, ideally no knowledge of the papers and formulas should be necessary. As you know, the ALS code has a hyperparameter lambda that needs to be tuned in order to get a good factorization.

[jira] [Commented] (MAHOUT-1026) Add LDA (CVB implementation) to the cluster_reuters.sh example script

2013-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1026?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678778#comment-13678778 ] Hudson commented on MAHOUT-1026: Integrated in Mahout-Quality #2059 (See

Jenkins build is back to normal : Mahout-Examples-Cluster-Reuters #349

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Examples-Cluster-Reuters/349/changes

Build failed in Jenkins: Mahout-Quality #2059

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/Mahout-Quality/2059/changes Changes: [smarthi] MAHOUT-1243: Dictionary file format in Lucene-Mahout integration is not in SequenceFileFormat - removed deprecated calls to Closeables.closeQuietly() [smarthi] MAHOUT-1026: Add LDA (CVB implementation) to the

[jira] [Commented] (MAHOUT-1243) Dictionary file format in Lucene-Mahout integration is not in SequenceFileFormat

2013-06-08 Thread Hudson (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1243?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678777#comment-13678777 ] Hudson commented on MAHOUT-1243: Integrated in Mahout-Quality #2059 (See

[jira] [Created] (MAHOUT-1246) Bring Mahout website from the 2000s to 2010s.

2013-06-08 Thread Robin Anil (JIRA)
Robin Anil created MAHOUT-1246: -- Summary: Bring Mahout website from the 2000s to 2010s. Key: MAHOUT-1246 URL: https://issues.apache.org/jira/browse/MAHOUT-1246 Project: Mahout Issue Type: Bug

Re: Work on ALS for future releases

2013-06-08 Thread Sean Owen
PS you might find it useful to pinch a bit of this code to implement convergence checking and grid search... not quite the same code base but maps quite directly:

[jira] [Assigned] (MAHOUT-833) Make conversion to sequence files map-reduce

2013-06-08 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-833?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi reassigned MAHOUT-833: Assignee: Suneel Marthi (was: Josh Patterson) Make conversion to sequence files

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Sebastian Schelter
Hi Grant, Very good release announcement. I propose that we deprecate a lot more, I think we should be aggressive here to pave the way for a clean and slim 1.0 release. I propose to additionally deprecate the following algorithms, as to my state of knowledge, they are not actively used:

Adding twitter widget to the home page

2013-06-08 Thread Robin Anil
Anyone oppose to me adding the twitter widget to out home page, seems like lot of action over there and could feel more welcoming to users?

[jira] [Updated] (MAHOUT-884) Matrix Concatenate utility

2013-06-08 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-884: - Affects Version/s: 0.7 Status: Patch Available (was: Open) Matrix

[jira] [Updated] (MAHOUT-884) Matrix Concatenate utility

2013-06-08 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi updated MAHOUT-884: - Attachment: MAHOUT-884.patch Matrix Concatenate utility --

Re: Adding twitter widget to the home page

2013-06-08 Thread Sebastian Schelter
I think its a great idea. On 08.06.2013 19:26, Robin Anil wrote: Anyone oppose to me adding the twitter widget to out home page, seems like lot of action over there and could feel more welcoming to users?

Re: Adding twitter widget to the home page

2013-06-08 Thread Grant Ingersoll
Awesome! On Jun 8, 2013, at 1:26 PM, Robin Anil robin.a...@gmail.com wrote: Anyone oppose to me adding the twitter widget to out home page, seems like lot of action over there and could feel more welcoming to users? Grant Ingersoll | @gsingers

Re: 0.8 progress

2013-06-08 Thread Suneel Marthi
I am done with M-1026. From: Grant Ingersoll gsing...@apache.org To: dev@mahout.apache.org Sent: Saturday, June 8, 2013 10:42 AM Subject: Re: 0.8 progress Hmm, JIRA seems to be down... 1084 is in.  I'm pretty close to being done on 1103. I'm on #mahout

[jira] [Commented] (MAHOUT-884) Matrix Concatenate utility

2013-06-08 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-884?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678783#comment-13678783 ] Suneel Marthi commented on MAHOUT-884: -- Modified patch to be compatible with present

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Grant Ingersoll
On Jun 8, 2013, at 1:26 PM, Sebastian Schelter s...@apache.org wrote: Hi Grant, Very good release announcement. I propose that we deprecate a lot more, I think we should be aggressive here to pave the way for a clean and slim 1.0 release. I propose to additionally deprecate the

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Shannon Quinn
Clustering - Fuzzy k-Means o.a.m.clustering.fuzzykmeans - Spectral k-Means in o.a.m.clustering.spectral -1 on spectral being dropped as that seems to receive decent traction. Agreed, given recent activity in particular. However I would put forth deprecating Eigencuts

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Shannon Quinn
Sorry, that's o.a.m.clustering.spectral.eigencuts. Then move the .kmeans package to simply be o.a.m.clustering.spectral . On 6/8/13 1:37 PM, Shannon Quinn wrote: Clustering - Fuzzy k-Means o.a.m.clustering.fuzzykmeans - Spectral k-Means in o.a.m.clustering.spectral -1 on spectral being

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-06-08 Thread Suneel Marthi (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678785#comment-13678785 ] Suneel Marthi commented on MAHOUT-1030: --- [~pferrel]Did you have a chance to try the

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Sean Owen
I agree with deprecating all of that FWIW. On Sat, Jun 8, 2013 at 6:33 PM, Grant Ingersoll gsing...@apache.org wrote: Collaborative Filtering: - all recommenders in o.a.m.cf.taste.impl.recommender.knn - the TreeClusteringRecommender in o.a.m.cf.taste.impl.recommender - the SlopeOne

Re: Adding twitter widget to the home page

2013-06-08 Thread Robin Anil
Alright its done. If there is opposition I can remove it. its all in svn. Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc. On Sat, Jun 8, 2013 at 12:29 PM, Grant Ingersoll gsing...@apache.orgwrote: Awesome! On Jun 8, 2013, at 1:26 PM, Robin Anil robin.a...@gmail.com wrote:

[jira] [Resolved] (MAHOUT-1246) Bring Mahout website from the 2000s to 2010s.

2013-06-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil resolved MAHOUT-1246. Resolution: Fixed Done! Humanist fonts, widescreen, twitter, CSS. Bring Mahout

Re: Work on ALS for future releases

2013-06-08 Thread Saikat Kanjilal
Sebastian/Sean, Thanks for your responses, first of I am more familiar with cross validation than grid search, regardless I'll go ahead and jira up tasks for building tools to support both grid search and training error checking. What about the clustering sub-section of the code, is there any

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Suneel Marthi
Under Release Highlights, please also add: a) Dan's Streaming kmeans clustering. b) Mahout upgrade to be Lucene 4.3.0 compatible (both of the above deserve special mentions along with lucene2seq and vector/matrix performance improvements). From: Grant

[jira] [Updated] (MAHOUT-1192) Speed up Vector Operations

2013-06-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1192?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil updated MAHOUT-1192: --- Fix Version/s: 0.8 Speed up Vector Operations --

[jira] [Updated] (MAHOUT-1191) Cleanup Vector Benchmarks make it less variable

2013-06-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1191?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil updated MAHOUT-1191: --- Fix Version/s: 0.8 Cleanup Vector Benchmarks make it less variable

[jira] [Updated] (MAHOUT-1246) Bring Mahout website from the 2000s to 2010s.

2013-06-08 Thread Robin Anil (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1246?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Robin Anil updated MAHOUT-1246: --- Fix Version/s: 0.8 Bring Mahout website from the 2000s to 2010s.

Re: 0.8 progress

2013-06-08 Thread Robin Anil
Down to 15. Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc. On Sat, Jun 8, 2013 at 12:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: I am done with M-1026. From: Grant Ingersoll gsing...@apache.org To: dev@mahout.apache.org Sent:

[jira] [Commented] (MAHOUT-1030) Regression: Clustered Points Should be WeightedPropertyVectorWritable not WeightedVectorWritable

2013-06-08 Thread Pat Ferrel (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13678847#comment-13678847 ] Pat Ferrel commented on MAHOUT-1030: +1 to Lance's point. My point of 01/Jul/12 was

Re: Work on ALS for future releases

2013-06-08 Thread Sean Owen
Grid search to me is just trying all combinations of values for different parameters, and you try them with a cross-validation set. They aren't alternatives. I don't have any knowledge of clustering-related items. On Jun 8, 2013 7:42 PM, Saikat Kanjilal sxk1...@hotmail.com wrote: Sebastian/Sean,

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Grant Ingersoll
Yes, please edit the Wiki directly with the highlights! On Jun 8, 2013, at 3:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Under Release Highlights, please also add: a) Dan's Streaming kmeans clustering. b) Mahout upgrade to be Lucene 4.3.0 compatible (both of the above deserve

Re: 0.8 progress

2013-06-08 Thread Grant Ingersoll
I've got 1103 and 1126 close to done. Should be in by tomorrow. On Jun 8, 2013, at 4:18 PM, Robin Anil robin.a...@gmail.com wrote: Down to 15. Robin Anil | Software Engineer | +1 312 869 2602 | Google Inc. On Sat, Jun 8, 2013 at 12:30 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

Re: 0.8 progress

2013-06-08 Thread Suneel Marthi
I will be looking at M-833 and M-1030 tonight. I can get the initial limited functionality for M-884 as part of 0.8 release by tomorrow. Thanks to Robin for reviewing. From: Grant Ingersoll gsing...@apache.org To: dev@mahout.apache.org Sent: Saturday,

Re: Work on ALS for future releases

2013-06-08 Thread Ted Dunning
If there is only one parameter to optimize, then line search is an easy answer. If there are more than one parameter, then the EvolutionarySearch that we already have can work or any of the many optimization methods from commons.math would apply. ES is better if the parameter space is

Re: [DRAFT] 0.8 Release Announcement + Future Plans Discussion

2013-06-08 Thread Ted Dunning
Actually this stats stuff is definitely used in application code (of mine if not others). The OnlineSummarizer has 20 usages throughout Mahout. On Sat, Jun 8, 2013 at 11:08 PM, Grant Ingersoll gsing...@apache.orgwrote: Yes, please edit the Wiki directly with the highlights! On Jun 8, 2013,

Jenkins build is back to normal : mahout-nightly » Mahout Integration #1256

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/mahout-nightly/org.apache.mahout$mahout-integration/1256/changes

Jenkins build is back to normal : mahout-nightly #1256

2013-06-08 Thread Apache Jenkins Server
See https://builds.apache.org/job/mahout-nightly/1256/changes

Re: 0.8 progress

2013-06-08 Thread Suneel Marthi
Working on M-833. From: Suneel Marthi suneel_mar...@yahoo.com To: dev@mahout.apache.org dev@mahout.apache.org Sent: Saturday, June 8, 2013 6:09 PM Subject: Re: 0.8 progress I will be looking at M-833 and M-1030 tonight. I can get the initial limited