Space: Apache Mahout (https://cwiki.apache.org/confluence/display/MAHOUT) Page: Data Processing (https://cwiki.apache.org/confluence/display/MAHOUT/Data+Processing)
Edited by Lance Norskog: --------------------------------------------------------------------- Mahout has several programs which do useful operations on formats used in Mahout. These are Mahout jobs, mostly map/reduce capable. h3. Math operations * bin/mahout transpose ** transpose a matrix * bin/mahout matrixmult ** multiply two or more matrices * bin/mahout svd ** Lanzcos algorithm for SVD * bin/mahout ssvd ** Stochastic SVD * bin/mahout cleansvd ** Something useful *Note:* many of these have options to only create some outputs, or limit the accuracy of outputs in the name of efficiency & tractability h3. Other operations * bin/mahout split ** Split an input set into training and test parts ** Can take text inputs or SequenceFiles * bin/mahout splitDataset ** Split a recommender rating dataset into training and test parts Change your notification preferences: https://cwiki.apache.org/confluence/users/viewnotifications.action
