Re: Mahout 0.9 Release
Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two. On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
Re: Mahout 0.9 Release
Yes, thanks Suneel! On Feb 2, 2014, at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two. On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote: +1 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote: LGTM On 1/29/14, 4:27 PM, peng wrote: +1, can't see a bad side. On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote: +1 from me On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter s...@apache.org wrote: +1 On 01/29/2014 05:25 AM, Andrew Musselman wrote: Looks good. +1 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com wrote: a), b), c), d) all passed here. CosineDistance of clustered points from cluster-reuters.sh -1 kmeans were within the range [0,1]. Date: Tue, 28 Jan 2014 16:45:42 -0800 From: suneel_mar...@yahoo.com Subject: Mahout 0.9 Release To: u...@mahout.apache.org; dev@mahout.apache.org Fixed the issues that were reported with Clustering code this past week, upgraded codebase to Lucene 4.6.1 that was released today. Here's the URL for the 0.9 release in staging:- https://repository.apache.org/content/repositories/ orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/ The artifacts have been signed with the following key: https://people.apache.org/keys/committer/smarthi.asc Please:- a) Verify that u can unpack the release (tar or zip) b) Verify u r able to compile the distro c) Run through the unit tests: mvn clean test d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run through all the different options in each script. Need a minimum of 3 '+1' votes from PMC for the release to be finalized.
[jira] [Commented] (MAHOUT-1388) Add command line support and logging for MLP
[ https://issues.apache.org/jira/browse/MAHOUT-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889097#comment-13889097 ] Yexi Jiang commented on MAHOUT-1388: [~smarthi] I have revised the code, could you please have a look at the code at the review board? https://reviews.apache.org/r/16700/ Add command line support and logging for MLP Key: MAHOUT-1388 URL: https://issues.apache.org/jira/browse/MAHOUT-1388 Project: Mahout Issue Type: Improvement Components: Classification Affects Versions: 1.0 Reporter: Yexi Jiang Labels: mlp, sgd Fix For: 1.0 Attachments: Mahout-1388.patch, Mahout-1388.patch The user should have the ability to run the Perceptron from the command line. There are two programs to execute MLP, the training and labeling. The first one takes the data as input and outputs the model, the second one takes the model and unlabeled data as input and outputs the results. The parameters for training are as follows: --input -i (input data) --skipHeader -sk // whether to skip the first row, this parameter is optional --labels -labels // the labels of the instances, separated by whitespace. Take the iris dataset for example, the labels are 'setosa versicolor virginica'. --model -mo // in training mode, this is the location to store the model (if the specified location has an existing model, it will update the model through incremental learning), in labeling mode, this is the location to store the result --update -u // whether to incremental update the model, if this parameter is not given, train the model from scratch --output -o // this is only useful in labeling mode --layersize -ls (no. of units per hidden layer) // use whitespace separated number to indicate the number of neurons in each layer (including input layer and output layer), e.g. '5 3 2'. --squashingFunction -sf // currently only supports Sigmoid --momentum -m --learningrate -l --regularizationweight -r --costfunction -cf // the type of cost function, For example, train a 3-layer (including input, hidden, and output) MLP with 0.1 learning rate, 0.1 momentum rate, and 0.01 regularization weight, the parameter would be: mlp -i /tmp/training-data.csv -labels setosa versicolor virginica -o /tmp/model.model -ls 5,3,1 -l 0.1 -m 0.1 -r 0.01 This command would read the training data from /tmp/training-data.csv and write the trained model to /tmp/model.model. The parameters for labeling is as follows: - --input -i // input file path --columnRange -cr // the range of column used for feature, start from 0 and separated by whitespace, e.g. 0 5 --format -f // the format of input file, currently only supports csv --model -mo // the file path of the model --output -o // the output path for the results - If a user need to use an existing model, it will use the following command: mlp -i /tmp/unlabel-data.csv -m /tmp/model.model -o /tmp/label-result Moreover, we should be providing default values if the user does not specify any. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
Re: Mahout 0.9 Release
I just checked and the release has propagated to French mirrors. On Sun, Feb 2, 2014 at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote: Mahout 0.9 has been pushed to the mirrors and is available for download at http://www.apache.org/dyn/closer.cgi/mahout/ On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com wrote: The release has passed with the required votes from PMC, will be pushing 0.9 to the mirrors and updating the release notes over the next day or two.
[jira] [Commented] (MAHOUT-1366) Please delete old releases from mirroring system
[ https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889260#comment-13889260 ] Suneel Marthi commented on MAHOUT-1366: --- Stevo, could u please delete the old releases 0.8 (where possible)? Lets escalate to INFRA about deleting the 0.7 folder. It would also be good to have svnpubsub setup for Mahout, lets raise that with INFRA. Please delete old releases from mirroring system Key: MAHOUT-1366 URL: https://issues.apache.org/jira/browse/MAHOUT-1366 Project: Mahout Issue Type: Bug Affects Versions: 0.4, 0.5, 0.6, 0.7 Environment: http://www.apache.org/dist/mahout/ Reporter: Sebb Assignee: Stevo Slavic Fix For: 0.9 To reduce the load on the ASF mirrors, projects are required to delete old releases [1] Please can you remove all non-current releases? Thanks! [Note that older releases are always available from the ASF archive server] [1] http://www.apache.org/dev/release.html#when-to-archive -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Comment Edited] (MAHOUT-1366) Please delete old releases from mirroring system
[ https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889260#comment-13889260 ] Suneel Marthi edited comment on MAHOUT-1366 at 2/3/14 6:42 AM: --- Stevo, I removed the directories for 0.4 - 0.6. Lets escalate to INFRA about deleting the 0.7 folder. It would also be good to have svnpubsub setup for Mahout, lets raise that with INFRA. was (Author: smarthi): Stevo, could u please delete the old releases 0.8 (where possible)? Lets escalate to INFRA about deleting the 0.7 folder. It would also be good to have svnpubsub setup for Mahout, lets raise that with INFRA. Please delete old releases from mirroring system Key: MAHOUT-1366 URL: https://issues.apache.org/jira/browse/MAHOUT-1366 Project: Mahout Issue Type: Bug Affects Versions: 0.4, 0.5, 0.6, 0.7 Environment: http://www.apache.org/dist/mahout/ Reporter: Sebb Assignee: Stevo Slavic Fix For: 0.9 To reduce the load on the ASF mirrors, projects are required to delete old releases [1] Please can you remove all non-current releases? Thanks! [Note that older releases are always available from the ASF archive server] [1] http://www.apache.org/dev/release.html#when-to-archive -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Resolved] (MAHOUT-1366) Please delete old releases from mirroring system
[ https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi resolved MAHOUT-1366. --- Resolution: Fixed Deleted old archive releases 0.4 to 0.6 Please delete old releases from mirroring system Key: MAHOUT-1366 URL: https://issues.apache.org/jira/browse/MAHOUT-1366 Project: Mahout Issue Type: Bug Affects Versions: 0.4, 0.5, 0.6, 0.7 Environment: http://www.apache.org/dist/mahout/ Reporter: Sebb Assignee: Stevo Slavic Fix For: 0.9 To reduce the load on the ASF mirrors, projects are required to delete old releases [1] Please can you remove all non-current releases? Thanks! [Note that older releases are always available from the ASF archive server] [1] http://www.apache.org/dev/release.html#when-to-archive -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Commented] (MAHOUT-1305) Rework the wiki
[ https://issues.apache.org/jira/browse/MAHOUT-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889267#comment-13889267 ] Suneel Marthi commented on MAHOUT-1305: --- Can this be marked as Resolved now that 0.9 is officially out-the-door? Rework the wiki --- Key: MAHOUT-1305 URL: https://issues.apache.org/jira/browse/MAHOUT-1305 Project: Mahout Issue Type: Bug Components: Documentation Affects Versions: 0.8 Reporter: Sebastian Schelter Priority: Blocker Labels: Documentation Fix For: 0.9 Attachments: MAHOUT-1304and1305.patch, MAHOUT-221213-1315-15716.pdf We should think about completely redoing our wiki. At the moment, we're listing lots of algorithms that we either never implemented or already removed. I also have the impression that a lot of stuff is outdated. It would be awesome if we had an up-to-date documentation of the code with instructions on how to get into using mahout quickly. We should also have examples for all our 3 C's. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1401) Resurrect Frequent Pattern mining
[ https://issues.apache.org/jira/browse/MAHOUT-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1401. - Resurrect Frequent Pattern mining - Key: MAHOUT-1401 URL: https://issues.apache.org/jira/browse/MAHOUT-1401 Project: Mahout Issue Type: Bug Reporter: Suneel Marthi Assignee: Suneel Marthi Priority: Critical Fix For: 0.9 -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1364) Upgrade Mahout codebase to Lucene 4.6
[ https://issues.apache.org/jira/browse/MAHOUT-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1364. - Mahout 0.9 has been released supporting Lucene 4.6.1. Upgrade Mahout codebase to Lucene 4.6 - Key: MAHOUT-1364 URL: https://issues.apache.org/jira/browse/MAHOUT-1364 Project: Mahout Issue Type: Improvement Components: Classification, CLI, Clustering, Examples, Integration Affects Versions: 0.8 Reporter: Suneel Marthi Assignee: Frank Scholten Fix For: 0.9 Attachments: LuceneIterableTest.diff, MAHOUT-1364.patch Parallel Randomized tests (using Carrot RandomizedRunner) fail on Mac OS for code that invokes Lucene API, see the discussion in M-1345. The fix is to upgrade to a Lucene version 4.3.1 (which is the present Lucene version in Mahout trunk). -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1395) Mahout CMS 404 Pages
[ https://issues.apache.org/jira/browse/MAHOUT-1395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1395. - Assignee: Suneel Marthi Patch committed to website. Mahout CMS 404 Pages Key: MAHOUT-1395 URL: https://issues.apache.org/jira/browse/MAHOUT-1395 Project: Mahout Issue Type: Bug Components: Documentation Affects Versions: 0.9 Reporter: Sotiris Salloumis Assignee: Suneel Marthi Priority: Blocker Labels: Documentation Fix For: 0.9 Attachments: MAHOUT-1395.patch Following pages currently are 404, please provide me the correct link/content to update it in CMS 1) Developes - Code Quality reports: Broken Link: https://builds.apache.org/hudson/job/Mahout-Quality/clover/ 2) Classification - Design complimentary bayes : http://mahout.apache.org/users/classification/complementary-naive-bayes.html -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1279) The test TestMinHashClustering fails in Mahout core
[ https://issues.apache.org/jira/browse/MAHOUT-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1279. - The test TestMinHashClustering fails in Mahout core - Key: MAHOUT-1279 URL: https://issues.apache.org/jira/browse/MAHOUT-1279 Project: Mahout Issue Type: Bug Components: Clustering Affects Versions: 0.8 Environment: MacOS X 1,6+, Java7, Maven 3+ Reporter: Erhan Bagdemir Assignee: Suneel Marthi Fix For: 0.9 Hey folks, i tried building the core component, but unfortunately the test TestMinHashClustering fails. Failed tests: TestMinHashClustering.testMurmurMinHashMRJobHashIndex:218-Assert.assertEquals:555-Assert.assertEquals:118-Assert.failNotEquals:743-Assert.fail:88 MinHash MR Job Hash Index failed for MURMUR expected:0 but was:-1 Tests run: 834, Failures: 1, Errors: 0, Skipped: 0 -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1326) Fix broken links to quickstart tutorials
[ https://issues.apache.org/jira/browse/MAHOUT-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1326. - Fix broken links to quickstart tutorials Key: MAHOUT-1326 URL: https://issues.apache.org/jira/browse/MAHOUT-1326 Project: Mahout Issue Type: Bug Components: Documentation Reporter: Ravi Mummulla Fix For: 0.9 All links are broken in https://cwiki.apache.org/MAHOUT/quickstart.html. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1375) Apache Mahout
[ https://issues.apache.org/jira/browse/MAHOUT-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1375. - Apache Mahout - Key: MAHOUT-1375 URL: https://issues.apache.org/jira/browse/MAHOUT-1375 Project: Mahout Issue Type: Bug Reporter: kaan can Assignee: Suneel Marthi Fix For: 0.9 Hello, Firstly, thank you for spending time in read my letter! well,my question is : 1) Which tools are used in Carrot2? 2) Carrot2 is provide suitable for supervised learning or unsupervised? 3) Which preprocessing methods tools in Carrot2? Kind regards -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1304) Website doesn't fit on 1280 px
[ https://issues.apache.org/jira/browse/MAHOUT-1304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1304. - Assignee: Isabel Drost-Fromm Website doesn't fit on 1280 px -- Key: MAHOUT-1304 URL: https://issues.apache.org/jira/browse/MAHOUT-1304 Project: Mahout Issue Type: Bug Components: Documentation Affects Versions: 0.8 Reporter: Sebastian Schelter Assignee: Isabel Drost-Fromm Labels: documentation Fix For: 0.9 Attachments: MAHOUT-304.patch, home-wrapper-v3.png, screen.png Hi, since the latest changes, our website doesn't fit onto 1280 anymore it seems. -- This message was sent by Atlassian JIRA (v6.1.5#6160)
[jira] [Closed] (MAHOUT-1377) Exclude JUnit.jar from tarball
[ https://issues.apache.org/jira/browse/MAHOUT-1377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Suneel Marthi closed MAHOUT-1377. - Exclude JUnit.jar from tarball -- Key: MAHOUT-1377 URL: https://issues.apache.org/jira/browse/MAHOUT-1377 Project: Mahout Issue Type: Improvement Affects Versions: 0.9 Reporter: Sergey Svinarchuk Assignee: Suneel Marthi Fix For: 0.9 Attachments: PATHC-1377.patch -- This message was sent by Atlassian JIRA (v6.1.5#6160)