[
https://issues.apache.org/jira/browse/MADLIB-1159?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16168699#comment-16168699
]
Brian Dolan edited comment on MADLIB-1159 at 9/15/17 11:56 PM:
---------------------------------------------------------------
Thanks, Frank! I was suggesting this should go into a "Tutorials" section or
something as it is a rather common use case. The current examples assume you
have already enumerated the features and have arrays.
Also, the `INSERT` lines with pre-defined feature ids is what I'm trying to
avoid. We should be able to assign the feature id easily and programmatically.
It doesn't make sense to make the user do it.
was (Author: buddha_314):
Thanks, Frank! I was suggesting this should go into a "Tutorials" section or
something as it is a rather common use case. The current examples assume you
have already enumerated the features and have arrays.
> Provide examples for common sparse matrix cases
> -----------------------------------------------
>
> Key: MADLIB-1159
> URL: https://issues.apache.org/jira/browse/MADLIB-1159
> Project: Apache MADlib
> Issue Type: Documentation
> Reporter: Brian Dolan
>
> A fairly common table structure is of the form `key1, key2, value` like a
> triples in a graph. These are often not normalized.
> It would be useful to provide an example of transforming this class of tables
> into a sparse matrix. Perhaps an example dataset could be a term-document
> matrix.
> TABLE doc_term;
> document, term, freq
> "do androids dream of electric sheep", "rachel", 75
> "do androids dream of electric sheep", "andy", 56
> "do androids dream of electric sheep", "hands", 128
> "da vinci code book review", "vapid",1326
> "da vinci code book review", "uninspired",265
> "da vinci code book review", "nauseating",879293
> "da vinci code book review", "inane",471
> Into a sparse matrix table of documents by features.
--
This message was sent by Atlassian JIRA
(v6.4.14#64029)