Re: Mahout 0.9 Release

2014-02-02 Thread Suneel Marthi
Mahout 0.9 has been pushed to the mirrors and is available for download at 
http://www.apache.org/dyn/closer.cgi/mahout/




On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com 
wrote:
 
The release has passed with the required votes from PMC, will be pushing 0.9 to 
the mirrors and updating the release notes over the next day or two.




On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote:

+1



On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:

 LGTM


 On 1/29/14, 4:27 PM, peng wrote:

 +1, can't see a bad side.

 On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:

 +1 from me





 On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
 s...@apache.org wrote:

 +1


 On 01/29/2014 05:25 AM, Andrew Musselman wrote:

 Looks good.

 +1


 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
 wrote:

  a), b), c), d) all passed here.

 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
 were
 within the range [0,1].

  Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org

 Fixed the issues that were reported with Clustering code this past
 week,

 upgraded codebase to Lucene 4.6.1 that was released today.


 Here's the URL for the 0.9 release in staging:-

  https://repository.apache.org/content/repositories/
 orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/


 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc

 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run

 through all the different options in each script.


 Need a minimum of 3 '+1' votes from PMC for the release to be
 finalized.







Re: Mahout 0.9 Release

2014-02-02 Thread Andrew Musselman
Yes, thanks Suneel!

 On Feb 2, 2014, at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.com wrote:
 
 Mahout 0.9 has been pushed to the mirrors and is available for download at 
 http://www.apache.org/dyn/closer.cgi/mahout/
 
 
 
 
 On Friday, January 31, 2014 11:21 PM, Suneel Marthi suneel_mar...@yahoo.com 
 wrote:
 
 The release has passed with the required votes from PMC, will be pushing 0.9 
 to the mirrors and updating the release notes over the next day or two.
 
 
 
 
 On Thursday, January 30, 2014 2:16 AM, Stevo Slavić ssla...@gmail.com wrote:
 
 +1
 
 
 
 On Wed, Jan 29, 2014 at 10:56 PM, Shannon Quinn squ...@gatech.edu wrote:
 
 LGTM
 
 
 On 1/29/14, 4:27 PM, peng wrote:
 
 +1, can't see a bad side.
 
 On Wed 29 Jan 2014 11:33:02 AM EST, Suneel Marthi wrote:
 
 +1 from me
 
 
 
 
 
 On Wednesday, January 29, 2014 8:58 AM, Sebastian Schelter 
 s...@apache.org wrote:
 
 +1
 
 
 On 01/29/2014 05:25 AM, Andrew Musselman wrote:
 
 Looks good.
 
 +1
 
 
 On Tue, Jan 28, 2014 at 8:07 PM, Andrew Palumbo ap@outlook.com
 wrote:
 
   a), b), c), d) all passed here.
 
 CosineDistance of clustered points from cluster-reuters.sh -1 kmeans
 were
 within the range [0,1].
 
   Date: Tue, 28 Jan 2014 16:45:42 -0800
 From: suneel_mar...@yahoo.com
 Subject: Mahout 0.9 Release
 To: u...@mahout.apache.org; dev@mahout.apache.org
 
 Fixed the issues that were reported with Clustering code this past
 week,
 upgraded codebase to Lucene 4.6.1 that was released today.
 
 
 Here's the URL for the 0.9 release in staging:-
 
   https://repository.apache.org/content/repositories/
 orgapachemahout-1004/org/apache/mahout/mahout-distribution/0.9/
 
 
 The artifacts have been signed with the following key:
 https://people.apache.org/keys/committer/smarthi.asc
 
 Please:-
 a) Verify that u can unpack the release (tar or zip)
 b) Verify u r able to compile the distro
 c)  Run through the unit tests: mvn clean test
 d) Run the example scripts under $MAHOUT_HOME/examples/bin. Please run
 through all the different options in each script.
 
 
 Need a minimum of 3 '+1' votes from PMC for the release to be
 finalized.


[jira] [Commented] (MAHOUT-1388) Add command line support and logging for MLP

2014-02-02 Thread Yexi Jiang (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1388?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889097#comment-13889097
 ] 

Yexi Jiang commented on MAHOUT-1388:


[~smarthi] I have revised the code, could you please have a look at the code at 
the review board? https://reviews.apache.org/r/16700/

 Add command line support and logging for MLP
 

 Key: MAHOUT-1388
 URL: https://issues.apache.org/jira/browse/MAHOUT-1388
 Project: Mahout
  Issue Type: Improvement
  Components: Classification
Affects Versions: 1.0
Reporter: Yexi Jiang
  Labels: mlp, sgd
 Fix For: 1.0

 Attachments: Mahout-1388.patch, Mahout-1388.patch


 The user should have the ability to run the Perceptron from the command line.
 There are two programs to execute MLP, the training and labeling. The first 
 one takes the data as input and outputs the model, the second one takes the 
 model and unlabeled data as input and outputs the results.
 The parameters for training are as follows:
 
 --input -i (input data)
 --skipHeader -sk // whether to skip the first row, this parameter is optional
 --labels -labels // the labels of the instances, separated by whitespace. 
 Take the iris dataset for example, the labels are 'setosa versicolor 
 virginica'.
 --model -mo  // in training mode, this is the location to store the model (if 
 the specified location has an existing model, it will update the model 
 through incremental learning), in labeling mode, this is the location to 
 store the result
 --update -u // whether to incremental update the model, if this parameter is 
 not given, train the model from scratch
 --output -o   // this is only useful in labeling mode
 --layersize -ls (no. of units per hidden layer) // use whitespace separated 
 number to indicate the number of neurons in each layer (including input layer 
 and output layer), e.g. '5 3 2'.
 --squashingFunction -sf // currently only supports Sigmoid
 --momentum -m 
 --learningrate -l
 --regularizationweight -r
 --costfunction -cf   // the type of cost function,
 
 For example, train a 3-layer (including input, hidden, and output) MLP with 
 0.1 learning rate, 0.1 momentum rate, and 0.01 regularization weight, the 
 parameter would be:
 mlp -i /tmp/training-data.csv -labels setosa versicolor virginica -o 
 /tmp/model.model -ls 5,3,1 -l 0.1 -m 0.1 -r 0.01
 This command would read the training data from /tmp/training-data.csv and 
 write the trained model to /tmp/model.model.
 The parameters for labeling is as follows:
 -
 --input -i // input file path
 --columnRange -cr // the range of column used for feature, start from 0 and 
 separated by whitespace, e.g. 0 5
 --format -f // the format of input file, currently only supports csv
 --model -mo // the file path of the model
 --output -o // the output path for the results
 -
 If a user need to use an existing model, it will use the following command:
 mlp -i /tmp/unlabel-data.csv -m /tmp/model.model -o /tmp/label-result
 Moreover, we should be providing default values if the user does not specify 
 any. 



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


Re: Mahout 0.9 Release

2014-02-02 Thread Ted Dunning
I just checked and the release has propagated to French mirrors.

On Sun, Feb 2, 2014 at 1:22 PM, Suneel Marthi suneel_mar...@yahoo.comwrote:

 Mahout 0.9 has been pushed to the mirrors and is available for download at
 http://www.apache.org/dyn/closer.cgi/mahout/




 On Friday, January 31, 2014 11:21 PM, Suneel Marthi 
 suneel_mar...@yahoo.com wrote:

 The release has passed with the required votes from PMC, will be pushing
 0.9 to the mirrors and updating the release notes over the next day or two.



[jira] [Commented] (MAHOUT-1366) Please delete old releases from mirroring system

2014-02-02 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889260#comment-13889260
 ] 

Suneel Marthi commented on MAHOUT-1366:
---

Stevo, could u please delete the old releases  0.8 (where possible)?  Lets 
escalate to INFRA about deleting the 0.7 folder.

It would also be good to have svnpubsub setup for Mahout, lets raise that with 
INFRA.

 Please delete old releases from mirroring system
 

 Key: MAHOUT-1366
 URL: https://issues.apache.org/jira/browse/MAHOUT-1366
 Project: Mahout
  Issue Type: Bug
Affects Versions: 0.4, 0.5, 0.6, 0.7
 Environment: http://www.apache.org/dist/mahout/
Reporter: Sebb
Assignee: Stevo Slavic
 Fix For: 0.9


 To reduce the load on the ASF mirrors, projects are required to delete old 
 releases [1]
 Please can you remove all non-current releases?
 Thanks!
 [Note that older releases are always available from the ASF archive server]
 [1] http://www.apache.org/dev/release.html#when-to-archive



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Comment Edited] (MAHOUT-1366) Please delete old releases from mirroring system

2014-02-02 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889260#comment-13889260
 ] 

Suneel Marthi edited comment on MAHOUT-1366 at 2/3/14 6:42 AM:
---

Stevo, I removed the directories for 0.4 - 0.6. Lets escalate to INFRA about 
deleting the 0.7 folder.

It would also be good to have svnpubsub setup for Mahout, lets raise that with 
INFRA.


was (Author: smarthi):
Stevo, could u please delete the old releases  0.8 (where possible)?  Lets 
escalate to INFRA about deleting the 0.7 folder.

It would also be good to have svnpubsub setup for Mahout, lets raise that with 
INFRA.

 Please delete old releases from mirroring system
 

 Key: MAHOUT-1366
 URL: https://issues.apache.org/jira/browse/MAHOUT-1366
 Project: Mahout
  Issue Type: Bug
Affects Versions: 0.4, 0.5, 0.6, 0.7
 Environment: http://www.apache.org/dist/mahout/
Reporter: Sebb
Assignee: Stevo Slavic
 Fix For: 0.9


 To reduce the load on the ASF mirrors, projects are required to delete old 
 releases [1]
 Please can you remove all non-current releases?
 Thanks!
 [Note that older releases are always available from the ASF archive server]
 [1] http://www.apache.org/dev/release.html#when-to-archive



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Resolved] (MAHOUT-1366) Please delete old releases from mirroring system

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1366?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi resolved MAHOUT-1366.
---

Resolution: Fixed

Deleted old archive releases 0.4 to 0.6

 Please delete old releases from mirroring system
 

 Key: MAHOUT-1366
 URL: https://issues.apache.org/jira/browse/MAHOUT-1366
 Project: Mahout
  Issue Type: Bug
Affects Versions: 0.4, 0.5, 0.6, 0.7
 Environment: http://www.apache.org/dist/mahout/
Reporter: Sebb
Assignee: Stevo Slavic
 Fix For: 0.9


 To reduce the load on the ASF mirrors, projects are required to delete old 
 releases [1]
 Please can you remove all non-current releases?
 Thanks!
 [Note that older releases are always available from the ASF archive server]
 [1] http://www.apache.org/dev/release.html#when-to-archive



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Commented] (MAHOUT-1305) Rework the wiki

2014-02-02 Thread Suneel Marthi (JIRA)

[ 
https://issues.apache.org/jira/browse/MAHOUT-1305?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanelfocusedCommentId=13889267#comment-13889267
 ] 

Suneel Marthi commented on MAHOUT-1305:
---

Can this be marked as Resolved now that 0.9 is officially out-the-door?

 Rework the wiki
 ---

 Key: MAHOUT-1305
 URL: https://issues.apache.org/jira/browse/MAHOUT-1305
 Project: Mahout
  Issue Type: Bug
  Components: Documentation
Affects Versions: 0.8
Reporter: Sebastian Schelter
Priority: Blocker
  Labels: Documentation
 Fix For: 0.9

 Attachments: MAHOUT-1304and1305.patch, MAHOUT-221213-1315-15716.pdf


 We should think about completely redoing our wiki. At the moment, we're 
 listing lots of algorithms that we either never implemented or already 
 removed. I also have the impression that a lot of stuff is outdated.
 It would be awesome if we had an up-to-date documentation of the code with 
 instructions on how to get into using mahout quickly.
 We should also have examples for all our 3 C's.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1401) Resurrect Frequent Pattern mining

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1401?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1401.
-


 Resurrect Frequent Pattern mining
 -

 Key: MAHOUT-1401
 URL: https://issues.apache.org/jira/browse/MAHOUT-1401
 Project: Mahout
  Issue Type: Bug
Reporter: Suneel Marthi
Assignee: Suneel Marthi
Priority: Critical
 Fix For: 0.9






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1364) Upgrade Mahout codebase to Lucene 4.6

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1364.
-


Mahout 0.9 has been released supporting Lucene 4.6.1.

 Upgrade Mahout codebase to Lucene 4.6
 -

 Key: MAHOUT-1364
 URL: https://issues.apache.org/jira/browse/MAHOUT-1364
 Project: Mahout
  Issue Type: Improvement
  Components: Classification, CLI, Clustering, Examples, Integration
Affects Versions: 0.8
Reporter: Suneel Marthi
Assignee: Frank Scholten
 Fix For: 0.9

 Attachments: LuceneIterableTest.diff, MAHOUT-1364.patch


 Parallel Randomized tests (using Carrot RandomizedRunner) fail on Mac OS for 
 code that invokes Lucene API, see the discussion in M-1345.  The fix is to 
 upgrade to a Lucene version  4.3.1 (which is the present Lucene version in 
 Mahout trunk).  



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1395) Mahout CMS 404 Pages

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1395?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1395.
-

Assignee: Suneel Marthi

Patch committed to website.

 Mahout CMS 404 Pages
 

 Key: MAHOUT-1395
 URL: https://issues.apache.org/jira/browse/MAHOUT-1395
 Project: Mahout
  Issue Type: Bug
  Components: Documentation
Affects Versions: 0.9
Reporter: Sotiris Salloumis
Assignee: Suneel Marthi
Priority: Blocker
  Labels: Documentation
 Fix For: 0.9

 Attachments: MAHOUT-1395.patch


 Following pages currently are 404, please provide me the correct link/content 
 to update it in CMS
 1) Developes - Code Quality reports: Broken Link: 
 https://builds.apache.org/hudson/job/Mahout-Quality/clover/
 2) Classification - Design complimentary bayes : 
 http://mahout.apache.org/users/classification/complementary-naive-bayes.html



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1279) The test TestMinHashClustering fails in Mahout core

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1279?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1279.
-


 The test TestMinHashClustering fails in Mahout core
 -

 Key: MAHOUT-1279
 URL: https://issues.apache.org/jira/browse/MAHOUT-1279
 Project: Mahout
  Issue Type: Bug
  Components: Clustering
Affects Versions: 0.8
 Environment: MacOS X 1,6+, Java7, Maven 3+
Reporter: Erhan Bagdemir
Assignee: Suneel Marthi
 Fix For: 0.9


 Hey folks, 
 i tried building the core component, but unfortunately the test 
 TestMinHashClustering fails. 
 Failed tests: 
   
 TestMinHashClustering.testMurmurMinHashMRJobHashIndex:218-Assert.assertEquals:555-Assert.assertEquals:118-Assert.failNotEquals:743-Assert.fail:88
  MinHash MR Job Hash Index failed for MURMUR expected:0 but was:-1
 Tests run: 834, Failures: 1, Errors: 0, Skipped: 0



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1326) Fix broken links to quickstart tutorials

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1326?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1326.
-


 Fix broken links to quickstart tutorials
 

 Key: MAHOUT-1326
 URL: https://issues.apache.org/jira/browse/MAHOUT-1326
 Project: Mahout
  Issue Type: Bug
  Components: Documentation
Reporter: Ravi Mummulla
 Fix For: 0.9


 All links are broken in https://cwiki.apache.org/MAHOUT/quickstart.html.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1375) Apache Mahout

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1375?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1375.
-


 Apache Mahout
 -

 Key: MAHOUT-1375
 URL: https://issues.apache.org/jira/browse/MAHOUT-1375
 Project: Mahout
  Issue Type: Bug
Reporter: kaan can
Assignee: Suneel Marthi
 Fix For: 0.9


 Hello,
  Firstly, thank you for spending time in read my letter! 
  well,my question is :
 1) Which tools are used in Carrot2?
 2) Carrot2 is provide suitable for supervised learning or unsupervised?
 3) Which preprocessing methods tools in Carrot2?
 Kind regards



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1304) Website doesn't fit on 1280 px

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1304?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1304.
-

Assignee: Isabel Drost-Fromm

 Website doesn't fit on 1280 px
 --

 Key: MAHOUT-1304
 URL: https://issues.apache.org/jira/browse/MAHOUT-1304
 Project: Mahout
  Issue Type: Bug
  Components: Documentation
Affects Versions: 0.8
Reporter: Sebastian Schelter
Assignee: Isabel Drost-Fromm
  Labels: documentation
 Fix For: 0.9

 Attachments: MAHOUT-304.patch, home-wrapper-v3.png, screen.png


 Hi,
 since the latest changes, our website doesn't fit onto 1280 anymore it seems.



--
This message was sent by Atlassian JIRA
(v6.1.5#6160)


[jira] [Closed] (MAHOUT-1377) Exclude JUnit.jar from tarball

2014-02-02 Thread Suneel Marthi (JIRA)

 [ 
https://issues.apache.org/jira/browse/MAHOUT-1377?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Suneel Marthi closed MAHOUT-1377.
-


 Exclude JUnit.jar from tarball
 --

 Key: MAHOUT-1377
 URL: https://issues.apache.org/jira/browse/MAHOUT-1377
 Project: Mahout
  Issue Type: Improvement
Affects Versions: 0.9
Reporter: Sergey Svinarchuk
Assignee: Suneel Marthi
 Fix For: 0.9

 Attachments: PATHC-1377.patch






--
This message was sent by Atlassian JIRA
(v6.1.5#6160)