Matrix and Vector Needs (MAHOUT) edited by Grant Ingersoll Page: http://cwiki.apache.org/confluence/display/MAHOUT/Matrix+and+Vector+Needs Changes: http://cwiki.apache.org/confluence/pages/diffpagesbyversion.action?pageId=75990&originalVersion=1&revisedVersion=2
Content: --------------------------------------------------------------------- h1. Intro We need some basic Matrix operations to start, and will definitely need more as we go. h2. Background See [http://mail-archives.apache.org/mod_mbox/lucene-mahout-dev/200802.mbox/browser] h2. Needs We will more than likely need all the basic Matrix operations, plus some more advanced ones: * Addition, Subtraction, Multiplication, Transpose, Inverse, Scaling For ideas like PageRank/TextRank, iterative approaches that, essentially, calculate eigenvectors are also useful. Similarly, for vectors, things like dot/cross product will be useful h2. Ideas Use HBase (BigTable) in Hadoop to represent the Matrix. Batching row/column operations can be useful. h2. References Have a look at the old parallel computing libraries like [ScalaPACK|http://www.netlib.org/scalapack/], others --------------------------------------------------------------------- CONFLUENCE INFORMATION This message is automatically generated by Confluence Unsubscribe or edit your notifications preferences http://cwiki.apache.org/confluence/users/viewnotifications.action If you think it was sent incorrectly contact one of the administrators http://cwiki.apache.org/confluence/administrators.action If you want more information on Confluence, or have a bug to report see http://www.atlassian.com/software/confluence