[jira] [Created] (MAHOUT-1482) Rework quickstart website

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1482: -- Summary: Rework quickstart website Key: MAHOUT-1482 URL: https://issues.apache.org/jira/browse/MAHOUT-1482 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1481) Clean up website on breiman example

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1481: -- Summary: Clean up website on breiman example Key: MAHOUT-1481 URL: https://issues.apache.org/jira/browse/MAHOUT-1481 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1480) Clean up website on 20 newsgroups

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1480: -- Summary: Clean up website on 20 newsgroups Key: MAHOUT-1480 URL: https://issues.apache.org/jira/browse/MAHOUT-1480 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1479) Cleanup website on wikipedia example

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1479: -- Summary: Cleanup website on wikipedia example Key: MAHOUT-1479 URL: https://issues.apache.org/jira/browse/MAHOUT-1479 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1478) Clean up website on Random Forests

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1478: -- Summary: Clean up website on Random Forests Key: MAHOUT-1478 URL: https://issues.apache.org/jira/browse/MAHOUT-1478 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1477) Clean up website on Logistic Regression

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1477: -- Summary: Clean up website on Logistic Regression Key: MAHOUT-1477 URL: https://issues.apache.org/jira/browse/MAHOUT-1477 Project: Mahout Issue

[jira] [Created] (MAHOUT-1475) Clean up website on Naive Bayes

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1475: -- Summary: Clean up website on Naive Bayes Key: MAHOUT-1475 URL: https://issues.apache.org/jira/browse/MAHOUT-1475 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1476) Cleanup website on Hidden Markov Models

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1476: -- Summary: Cleanup website on Hidden Markov Models Key: MAHOUT-1476 URL: https://issues.apache.org/jira/browse/MAHOUT-1476 Project: Mahout Issue

[jira] [Created] (MAHOUT-1474) Add Seinfeld clustering example

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1474: -- Summary: Add Seinfeld clustering example Key: MAHOUT-1474 URL: https://issues.apache.org/jira/browse/MAHOUT-1474 Project: Mahout Issue Type

[jira] [Updated] (MAHOUT-1472) Cleanup website on Fuzzy k-Means

2014-03-22 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1472: --- Description: The website on fuzzy k-Means needs clean up. We need to go through the

[jira] [Updated] (MAHOUT-1471) Cleanup website on Canopy Clustering

2014-03-22 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1471?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1471: --- Description: The websites on Canopy clustering needs clean up. We need to go

[jira] [Updated] (MAHOUT-1472) Cleanup website on Fuzzy k-Means

2014-03-22 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1472?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1472: --- Description: The websites on fuzzy k-Means needs clean up. We need to go through

[jira] [Created] (MAHOUT-1472) Cleanup website on Fuzzy k-Means

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1472: -- Summary: Cleanup website on Fuzzy k-Means Key: MAHOUT-1472 URL: https://issues.apache.org/jira/browse/MAHOUT-1472 Project: Mahout Issue Type

[jira] [Created] (MAHOUT-1473) Cleanup website on Spectral Clustering

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1473: -- Summary: Cleanup website on Spectral Clustering Key: MAHOUT-1473 URL: https://issues.apache.org/jira/browse/MAHOUT-1473 Project: Mahout Issue

[jira] [Created] (MAHOUT-1471) Cleanup website on Canopy Clustering

2014-03-22 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1471: -- Summary: Cleanup website on Canopy Clustering Key: MAHOUT-1471 URL: https://issues.apache.org/jira/browse/MAHOUT-1471 Project: Mahout Issue Type

[jira] [Resolved] (MAHOUT-1248) Build tools around mahout to use grid search with cross validation to tune hyperparameter lambda

2014-03-21 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1248?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1248. Resolution: Won't Fix Assignee: Sebastian Schelter Dismissing th

[jira] [Commented] (MAHOUT-1425) SGD classifier example with bank marketing dataset

2014-03-21 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1425?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13943672#comment-13943672 ] Sebastian Schelter commented on MAHOUT-1425: Would be awesome to a

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941521#comment-13941521 ] Sebastian Schelter commented on MAHOUT-1464: one possibility would b

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941516#comment-13941516 ] Sebastian Schelter commented on MAHOUT-1464: In a SparseRowMatrix, thi

[jira] [Comment Edited] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13941500#comment-13941500 ] Sebastian Schelter edited comment on MAHOUT-1464 at 3/20/14 7:5

[jira] [Updated] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-20 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1464: --- Attachment: MAHOUT-1464.patch replaced viewRow(i) calls with nicer looking (i

Re: Plan for 1.0

2014-03-19 Thread Sebastian Schelter
13 AM, "Dmitriy Lyubimov" wrote: i am on vacation, so most of the pacific daylight ranges on any day should work for me. On Wed, Mar 19, 2014 at 12:07 AM, Sebastian Schelter wrote: Friday would also work for me. On 03/19/2014 08:05 AM, Suneel Marthi wrote: Same here, travel next week a

Re: Plan for 1.0

2014-03-19 Thread Sebastian Schelter
day should work for me. On Wed, Mar 19, 2014 at 12:07 AM, Sebastian Schelter wrote: Friday would also work for me. On 03/19/2014 08:05 AM, Suneel Marthi wrote: Same here, travel next week and in Amsterdam the first week of April. I avoid Sundays or weekends for obvious reasons. How bout thi

Re: [GSOC 2014] Uniform API for Mahout Clustering

2014-03-19 Thread Sebastian Schelter
It's not about directly porting algorithms to Spark, its about porting them to a DSL that executes on top of Spark. This page has information about it: https://mahout.apache.org/users/sparkbindings/home.html --sebastian On 03/19/2014 08:43 AM, chalitha udara Perera wrote: Thanks a lot everyo

Re: Plan for 1.0

2014-03-19 Thread Sebastian Schelter
Friday would also work for me. On 03/19/2014 08:05 AM, Suneel Marthi wrote: Same here, travel next week and in Amsterdam the first week of April. I avoid Sundays or weekends for obvious reasons. How bout this Friday? Sent from my iPhone On Mar 19, 2014, at 3:02 AM, Sebastian Schelter

Re: Plan for 1.0

2014-03-19 Thread Sebastian Schelter
ve it? Mondays and Wednesdays don't work for me. Would Tuesdays 6pm Eastern Time work ? On Wednesday, March 19, 2014 2:45 AM, Sebastian Schelter wrote: Hi Saikat, 1) I think that Mahout-1248 and 1249 are still very important features that I would love to see in the codebase as they wo

Re: [GSOC 2014] Uniform API for Mahout Clustering

2014-03-19 Thread Sebastian Schelter
I think it would be great to port our kMeans implementation to Spark. It should be done by using Dmitriy's DSL similar to what I'm trying in https://issues.apache.org/jira/browse/MAHOUT-1464 On 03/19/2014 07:56 AM, chalitha udara Perera wrote: Hi Dmitriy, I agree with you that i need to be m

Re: Plan for 1.0

2014-03-18 Thread Sebastian Schelter
Hi Saikat, 1) I think that Mahout-1248 and 1249 are still very important features that I would love to see in the codebase as they would highly improve the usability of our ALS code. 2) I think the last discussion item regarding h2o was to find a way to compare it against existing or spark r

[jira] [Commented] (MAHOUT-1365) Weighted ALS-WR iterator for Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1365?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13939019#comment-13939019 ] Sebastian Schelter commented on MAHOUT-1365: would be awesome to have

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938958#comment-13938958 ] Sebastian Schelter commented on MAHOUT-1464: The physical operator for

[jira] [Updated] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-18 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1464: --- Attachment: MAHOUT-1464.patch Updated patch to match the coding conventions and use

[jira] [Updated] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1464: --- Attachment: MAHOUT-1464.patch Luckily, Dmitriy's latest commit solved most

[jira] [Updated] (MAHOUT-1466) Cluster visualization fails to execute

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1466?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1466: --- Resolution: Fixed Status: Resolved (was: Patch Available) > Clus

[jira] [Comment Edited] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938259#comment-13938259 ] Sebastian Schelter edited comment on MAHOUT-1464 at 3/17/14 7:3

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938259#comment-13938259 ] Sebastian Schelter commented on MAHOUT-1464: I'd like to rework my

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13938254#comment-13938254 ] Sebastian Schelter commented on MAHOUT-1464: I havent tested Spark on Ha

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937987#comment-13937987 ] Sebastian Schelter commented on MAHOUT-1464: @Pat I'm pretty busy

[jira] [Reopened] (MAHOUT-1461) The tour

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter reopened MAHOUT-1461: > The tour > > > Key: MAHOUT-1461 >

[jira] [Commented] (MAHOUT-1461) The tour

2014-03-17 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937979#comment-13937979 ] Sebastian Schelter commented on MAHOUT-1461: Could you create a versio

[jira] [Created] (MAHOUT-1466) Cluster visualization fails to execute

2014-03-16 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1466: -- Summary: Cluster visualization fails to execute Key: MAHOUT-1466 URL: https://issues.apache.org/jira/browse/MAHOUT-1466 Project: Mahout Issue

[jira] [Commented] (MAHOUT-1464) RowSimilarityJob on Spark

2014-03-16 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1464?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937490#comment-13937490 ] Sebastian Schelter commented on MAHOUT-1464: I've started to wor

[jira] [Resolved] (MAHOUT-1436) Missing pages need to be migrated over from old CMS site

2014-03-16 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1436. Resolution: Won't Fix > Missing pages need to be migrated over from old

[jira] [Commented] (MAHOUT-1461) The tour

2014-03-16 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13937488#comment-13937488 ] Sebastian Schelter commented on MAHOUT-1461: Ok, but we can ignore this

[jira] [Resolved] (MAHOUT-1437) Remove all links to wiki pages from the website

2014-03-16 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1437?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1437. Resolution: Fixed Took a tour of the site and couldn't find any more links t

[jira] [Resolved] (MAHOUT-1461) The tour

2014-03-16 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1461?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1461. Resolution: Won't Fix Assignee: Sebastian Schelter Hi Scott, Please

Re: 0xdata interested in contributing

2014-03-14 Thread Sebastian Schelter
ngle digit factor anyway. On Fri, Mar 14, 2014 at 10:14 AM, Pat Ferrel wrote: Isn't there some work on RSJ on Spark? Can we compare that to something 0xdata can "knock off"? On Mar 14, 2014, at 10:08 AM, Sebastian Schelter wrote: Dmitriy, I share a lot your concerns ex

Re: 0xdata interested in contributing

2014-03-14 Thread Sebastian Schelter
100x faster then that’s compelling and there is a concrete reason to reset. I’d still want the Spark stuff, who knows when the alternative would be ready even if it is faster. On Mar 14, 2014, at 10:16 AM, Sebastian Schelter wrote: I did a port recently. It doesn't use Dmitriy's DSL

Re: 0xdata interested in contributing

2014-03-14 Thread Sebastian Schelter
faster then that’s compelling and there is a concrete reason to reset. I’d still want the Spark stuff, who knows when the alternative would be ready even if it is faster. On Mar 14, 2014, at 10:16 AM, Sebastian Schelter wrote: I did a port recently. It doesn't use Dmitriy's DSL, it

Re: 0xdata interested in contributing

2014-03-14 Thread Sebastian Schelter
014, at 10:08 AM, Sebastian Schelter wrote: Dmitriy, I share a lot your concerns expressed here. I hear more complaints about Mahout being too inaccessible and too hard to customize for use cases and inputs more than complaints about it being too slow. I also concur with your analysis that the

Re: 0xdata interested in contributing

2014-03-14 Thread Sebastian Schelter
Dmitriy, I share a lot your concerns expressed here. I hear more complaints about Mahout being too inaccessible and too hard to customize for use cases and inputs more than complaints about it being too slow. I also concur with your analysis that the clear and accessible programming model is

Re: 0xdata interested in contributing

2014-03-14 Thread Sebastian Schelter
Hi, to me one problem is that a couldn't find documentation that gives a comprehensive picture of the programming and execution model of h2o. I'd like to get answers to the following questions: Which operators does it offer, how those are combined to create programs and how are those program

Re: 0xdata interested in contributing

2014-03-13 Thread Sebastian Schelter
On 03/13/2014 09:49 PM, Ted Dunning wrote: > >(4) Couple days of work to throw in Stratosphere primitives. > Likewise. If the stratosphere community would like to step up to help with this, I would champion that contribution as well. I'm sure this is well received in the Stratosphere commun

[jira] [Commented] (MAHOUT-1453) ImplicitFeedbackAlternatingLeastSquaresSolver add support for user supplied confidence functions

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933512#comment-13933512 ] Sebastian Schelter commented on MAHOUT-1453: Oh, I meant the first one

[jira] [Commented] (MAHOUT-1453) ImplicitFeedbackAlternatingLeastSquaresSolver add support for user supplied confidence functions

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933497#comment-13933497 ] Sebastian Schelter commented on MAHOUT-1453: I think your idea is

[jira] [Commented] (MAHOUT-1450) Cleaning up k-means documentation on mahout website

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933419#comment-13933419 ] Sebastian Schelter commented on MAHOUT-1450: Whats the status

[jira] [Commented] (MAHOUT-1454) Remove the seinfeld clustering example from the website

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1454?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933408#comment-13933408 ] Sebastian Schelter commented on MAHOUT-1454: [~frankscholten] do you

[jira] [Commented] (MAHOUT-1453) ImplicitFeedbackAlternatingLeastSquaresSolver add support for user supplied confidence functions

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1453?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933394#comment-13933394 ] Sebastian Schelter commented on MAHOUT-1453: Good point, Adam. Loo

[jira] [Resolved] (MAHOUT-1451) Cleaning up the examples for clustering on the website

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1451. Resolution: Fixed Assignee: Sebastian Schelter > Cleaning up the examp

[jira] [Commented] (MAHOUT-1451) Cleaning up the examples for clustering on the website

2014-03-13 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1451?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13933148#comment-13933148 ] Sebastian Schelter commented on MAHOUT-1451: Thank you. I simplied

Re: 0xdata interested in contributing

2014-03-13 Thread Sebastian Schelter
Hi, Lots of questions from my side here. @Ted At first a comment about your point that dataflow systems cannot do efficient in-memory mutable storage: Stratosphere's delta-iterate operator [1] supports iterative dataflows where the solution is held in an in-memory index and updated in every

[jira] [Updated] (MAHOUT-1450) Cleaning up k-means documentation on mahout website

2014-03-12 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1450?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1450: --- Fix Version/s: (was: collections-1.0) 1.0 > Cleaning u

Re: Website, urgent help needed

2014-03-12 Thread Sebastian Schelter
can start looking on the page myself. Manoj On Wed, Mar 12, 2014 at 12:33 PM, Sebastian Schelter wrote: Hi, As you've probably noticed, I've put in a lot of effort over the last days to kickstart cleaning up our website. I've thrown out a lot of stuff and have been startled by the

Website, urgent help needed

2014-03-12 Thread Sebastian Schelter
Hi, As you've probably noticed, I've put in a lot of effort over the last days to kickstart cleaning up our website. I've thrown out a lot of stuff and have been startled by the amout of outdated and incorrect information on our website, as well as links pointing to nowhere. I think our lack

[jira] [Updated] (MAHOUT-1448) In Random Forest, the training does not support multiple input files. The input dataset must be one single file.

2014-03-11 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1448?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1448: --- Resolution: Fixed Assignee: Sebastian Schelter Status: Resolved (was

Re: [jira] [Commented] (MAHOUT-1447) ImplicitFeedbackAlternatingLeastSquaresSolver tests and features

2014-03-11 Thread Sebastian Schelter
HOUT-1447 Project: Mahout Issue Type: Improvement Components: Collaborative Filtering Affects Versions: 0.9 Reporter: Adam Ilardi Assignee: Sebastian Schelter Priority: Minor Labels: newbie, patch, performance

[jira] [Commented] (MAHOUT-1447) ImplicitFeedbackAlternatingLeastSquaresSolver tests and features

2014-03-11 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13930725#comment-13930725 ] Sebastian Schelter commented on MAHOUT-1447: Its clear know, I didn&#

[jira] [Resolved] (MAHOUT-1447) ImplicitFeedbackAlternatingLeastSquaresSolver tests and features

2014-03-11 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1447. Resolution: Fixed > ImplicitFeedbackAlternatingLeastSquaresSolver tests

[jira] [Commented] (MAHOUT-1447) ImplicitFeedbackAlternatingLeastSquaresSolver tests and features

2014-03-11 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1447?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13930503#comment-13930503 ] Sebastian Schelter commented on MAHOUT-1447: Hi Adam, yes the order of

Re: How to start

2014-03-10 Thread Sebastian Schelter
? Would appreciate any help. Thanks! *Gaurav Misra* On Sat, Mar 8, 2014 at 6:51 AM, Sebastian Schelter wrote: Great, you could create a ticket at https://issues.apache.org/ jira/browse/MAHOUT to track this? Best, Sebastian On 03/08/2014 12:41 PM, Maciej Mazur wrote: OK, thanks. I'll

[jira] [Created] (MAHOUT-1443) Update "How to release page"

2014-03-09 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1443: -- Summary: Update "How to release page" Key: MAHOUT-1443 URL: https://issues.apache.org/jira/browse/MAHOUT-1443 Project: Mahout Issue

[jira] [Commented] (MAHOUT-1441) Add documentation for Spectral KMeans to Mahout Website

2014-03-09 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1441?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13925234#comment-13925234 ] Sebastian Schelter commented on MAHOUT-1441: Yexi, could you crea

[jira] [Resolved] (MAHOUT-1438) "quickstart" tutorial for building a simple recommender

2014-03-09 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1438. Resolution: Fixed Thank you. The article is now online at https

[jira] [Commented] (MAHOUT-1438) "quickstart" tutorial for building a simple recommender

2014-03-09 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13925207#comment-13925207 ] Sebastian Schelter commented on MAHOUT-1438: The tutorial looks very

How To Release page

2014-03-09 Thread Sebastian Schelter
Hi Suneel, I have a favor to ask. Could you have a look at the "How To Release" page and tell if the information there is still correct? I'm asking you this because you have done the latest release. After your OK, I'll go and improve formatting and readability of that page. Best, Sebastian

[jira] [Reopened] (MAHOUT-1438) "quickstart" tutorial for building a simple recommender

2014-03-09 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter reopened MAHOUT-1438: reopening this, because it was opened by Maciej not Steve, so I guess Maciej also

[jira] [Resolved] (MAHOUT-1438) "quickstart" tutorial for building a simple recommender

2014-03-09 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1438?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1438. Resolution: Fixed Fix Version/s: 1.0 Great tutorial, I put it onto a new

[jira] [Created] (MAHOUT-1439) Update talks on Mahout

2014-03-08 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1439: -- Summary: Update talks on Mahout Key: MAHOUT-1439 URL: https://issues.apache.org/jira/browse/MAHOUT-1439 Project: Mahout Issue Type: Bug

[jira] [Created] (MAHOUT-1437) Remove all links to wiki pages from the website

2014-03-08 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1437: -- Summary: Remove all links to wiki pages from the website Key: MAHOUT-1437 URL: https://issues.apache.org/jira/browse/MAHOUT-1437 Project: Mahout

[jira] [Assigned] (MAHOUT-1436) Missing pages need to be migrated over from old CMS site

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1436?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter reassigned MAHOUT-1436: -- Assignee: Sebastian Schelter > Missing pages need to be migrated over f

Re: Cleaning up the backlog

2014-03-08 Thread Sebastian Schelter
As promised, I removed the mentioned backlog issues. I think we should return to what Sean did when he was still PMC chair: if there is no work on an issue for quite some time, close it. Keeping it lingering around and waiting for people come back for months justs "pollutes" our jira and makes

[jira] [Resolved] (MAHOUT-953) ArffVectorIterable does not gracefully handle duplicate attribute name

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-953?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-953. --- Resolution: Won't Fix > ArffVectorIterable does not gracefully handle d

[jira] [Updated] (MAHOUT-880) Add some matrix method(like addition, subtraction, norm ... etc) to DistributedRowMatrix

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-880?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-880: -- Resolution: Won't Fix Status: Resolved (was: Patch Available) >

[jira] [Resolved] (MAHOUT-968) Classifier based on restricted boltzmann machines

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-968?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-968. --- Resolution: Won't Fix > Classifier based on restricted boltzmann

[jira] [Updated] (MAHOUT-716) Implement Boosting

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-716?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-716: -- Resolution: Won't Fix Status: Resolved (was: Patch Available) > I

[jira] [Updated] (MAHOUT-1153) Implement streaming random forests

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1153?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1153: --- Fix Version/s: (was: Backlog) 1.0 > Implement stream

[jira] [Updated] (MAHOUT-874) Extract Writables into a separate module to allow smaller dependencies

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-874?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-874: -- Fix Version/s: (was: Backlog) 1.0 > Extract Writables int

[jira] [Resolved] (MAHOUT-668) Adding knn support to Mahout classifiers

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-668?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-668. --- Resolution: Won't Fix > Adding knn support to Mahout cla

[jira] [Resolved] (MAHOUT-928) Add the ARFF data loader/converter on DF

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-928?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-928. --- Resolution: Won't Fix > Add the ARFF data loader/convert

[jira] [Resolved] (MAHOUT-1193) We may want a BlockSparseMatrix

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1193?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1193. Resolution: Won't Fix > We may want a BlockSpar

[jira] [Updated] (MAHOUT-627) Baum-Welch Algorithm on Map-Reduce for Parallel Hidden Markov Model Training.

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-627?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-627: -- Resolution: Won't Fix Status: Resolved (was: Patch Available) > Ba

[jira] [Resolved] (MAHOUT-1204) Rewrite Benchmarks using Caliper

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1204?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter resolved MAHOUT-1204. Resolution: Won't Fix > Rewrite Benchmarks using

[jira] [Updated] (MAHOUT-732) Implement ranking autoencoder on top of gradient machine

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-732?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-732: -- Resolution: Won't Fix Status: Resolved (was: Patch Available) > I

Re: How to start

2014-03-08 Thread Sebastian Schelter
Great, you could create a ticket at https://issues.apache.org/jira/browse/MAHOUT to track this? Best, Sebastian On 03/08/2014 12:41 PM, Maciej Mazur wrote: OK, thanks. I'll start with it. On Sat, Mar 8, 2014 at 12:21 PM, Sebastian Schelter wrote: Hi Maciej, A nice todo would

Website

2014-03-08 Thread Sebastian Schelter
I'm currently updating the website, there might be some quirks over the next days, but I'll try to get everything fixed. --sebastian

Re: How to start

2014-03-08 Thread Sebastian Schelter
Hi Maciej, A nice todo would be to create a "quickstart" tutorial for building a simple recommender with mahout which we can publish our website then. --sebastian On 03/08/2014 12:03 PM, Maciej Mazur wrote: Hi, I am new in this project. I am on mailing list since couple of weeks, but frankl

[jira] [Created] (MAHOUT-1435) Website Redesign

2014-03-08 Thread Sebastian Schelter (JIRA)
Sebastian Schelter created MAHOUT-1435: -- Summary: Website Redesign Key: MAHOUT-1435 URL: https://issues.apache.org/jira/browse/MAHOUT-1435 Project: Mahout Issue Type: Bug Affects

[jira] [Updated] (MAHOUT-1435) Website Redesign

2014-03-08 Thread Sebastian Schelter (JIRA)
[ https://issues.apache.org/jira/browse/MAHOUT-1435?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Sebastian Schelter updated MAHOUT-1435: --- Attachment: mahout2.jpg Here is the draft. > Website Redes

Re: Mahout 1.0 goals

2014-03-08 Thread Sebastian Schelter
2:24 PM, Sebastian Schelter wrote: - AFAIK its also a problem to ship it license-wise as the required libraries would not be Apache licensed See this discussion from the Spark community for details: https://github.com/apache/incubator-spark/pull/575 This is a real issue and getting a lot of

Re: Mahout 1.0 goals

2014-03-07 Thread Sebastian Schelter
Mar 2014 19:32:40 -0800 Subject: Re: Mahout 1.0 goals To: dev@mahout.apache.org; s...@apache.org On Tue, Mar 4, 2014 at 2:24 PM, Sebastian Schelter wrote: - AFAIK its also a problem to ship it license-wise as the required libraries would not be Apache licensed See this discussion from the

Welcome Andrew Musselman as new comitter

2014-03-07 Thread Sebastian Schelter
Hi, this is to announce that the Project Management Committee (PMC) for Apache Mahout has asked Andrew Musselman to become committer and we are pleased to announce that he has accepted. Being a committer enables easier contribution to the project since in addition to posting patches on JIRA

<    1   2   3   4   5   6   7   8   9   10   >