Brian Dolan created MADLIB-1159:
-----------------------------------

             Summary: Provide examples for common sparse matrix cases
                 Key: MADLIB-1159
                 URL: https://issues.apache.org/jira/browse/MADLIB-1159
             Project: Apache MADlib
          Issue Type: Documentation
            Reporter: Brian Dolan


A fairly common table structure is of the form  `key1, key2, value` like a 
triples in a graph.  These are often not normalized.

It would be useful to provide an example of transforming this class of tables 
into a sparse matrix.  Perhaps an example dataset could be a term-document 
matrix.

TABLE doc_term;

document, term, freq
"do androids dream of electric sheep", "rachel", 75
"do androids dream of electric sheep", "andy", 56
"do androids dream of electric sheep", "hands", 128
"da vinci code book review", "vapid",1326
"da vinci code book review", "uninspired",265
"da vinci code book review", "nauseating",879293
"da vinci code book review", "inane",471

Into a sparse matrix table of documents by features.





--
This message was sent by Atlassian JIRA
(v6.4.14#64029)

Reply via email to