Space: Apache Mahout (https://cwiki.apache.org/confluence/display/MAHOUT)
Page: Data Processing 
(https://cwiki.apache.org/confluence/display/MAHOUT/Data+Processing)


Edited by Lance Norskog:
---------------------------------------------------------------------
Mahout has several programs which do useful operations on formats used in 
Mahout. These are Mahout jobs, mostly map/reduce capable.
h3. Math operations
* bin/mahout transpose
** transpose a matrix
* bin/mahout matrixmult
** multiply two or more matrices
* bin/mahout svd
** Lanzcos algorithm for SVD
* bin/mahout ssvd
** Stochastic SVD
* bin/mahout cleansvd
** Something useful 

*Note:* many of these have options to only create some outputs, or limit the 
accuracy of outputs in the name of efficiency & tractability
h3. Other operations
* bin/mahout split
** Split an input set into training and test parts
** Can take text inputs or SequenceFiles
* bin/mahout splitDataset
** Split a recommender rating dataset into training and test parts


Change your notification preferences: 
https://cwiki.apache.org/confluence/users/viewnotifications.action    

Reply via email to