Brian Dolan created MADLIB-1159:
-----------------------------------
Summary: Provide examples for common sparse matrix cases
Key: MADLIB-1159
URL: https://issues.apache.org/jira/browse/MADLIB-1159
Project: Apache MADlib
Issue Type: Documentation
Reporter: Brian Dolan
A fairly common table structure is of the form `key1, key2, value` like a
triples in a graph. These are often not normalized.
It would be useful to provide an example of transforming this class of tables
into a sparse matrix. Perhaps an example dataset could be a term-document
matrix.
TABLE doc_term;
document, term, freq
"do androids dream of electric sheep", "rachel", 75
"do androids dream of electric sheep", "andy", 56
"do androids dream of electric sheep", "hands", 128
"da vinci code book review", "vapid",1326
"da vinci code book review", "uninspired",265
"da vinci code book review", "nauseating",879293
"da vinci code book review", "inane",471
Into a sparse matrix table of documents by features.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)