GitHub user kaknikhil opened a pull request: https://github.com/apache/madlib/pull/195
Feature: Add grouping support to HITS JIRA: MADLIB-1151 Changes to support grouping column in HITS. Update queries to use group by and other necessary sql constructs. You can merge this pull request into a Git repository by running: $ git pull https://github.com/kaknikhil/madlib hits_grouping Alternatively you can review and apply these changes as the patch at: https://github.com/apache/madlib/pull/195.patch To close this pull request, make a commit to your master/trunk branch with (at least) the following in the commit message: This closes #195 ---- commit 9b47a4345f9efd60be65f5adf3a884e6c660050c Author: Jingyi Mei <j...@pivotal.io> Date: 2017-10-05T19:18:32Z Feature: Add grouping support to HITS JIRA: MADLIB-1151 Changes to support grouping column in HITS. Update queries to use group by and other necessary sql constructs. commit 5f88d0553d85251575eb4919d2d69b89b2fec162 Author: Nikhil Kak <n...@pivotal.io> Date: 2017-10-30T23:01:34Z Add install check test for hits grouping - Add tests for calling hits with grouping cols - Improve error messages if the result from hits_out does not meet expectations commit c7ebe372fe3c0038ee47cfe9245d39d32c1ab95b Author: Nikhil Kak <n...@pivotal.io> Date: 2017-11-02T17:20:02Z Update HITS examples and usage with grouping columns commit 81b03cf88598627a2df1bad3b57302f6b2a0aa9d Author: Nikhil Kak <n...@pivotal.io> Date: 2017-11-02T22:41:18Z Add user docs for grouping column Also adds a few more examples for threshold and max_iter. commit 5267d272150fb6071f2604a6516d1bf4a3db05d1 Author: Nikhil Kak <n...@pivotal.io> Date: 2017-11-03T18:31:38Z Refactor code, add comments, change var names. - Refactor logic into functions - This changes some code in pagerank, since we pulled out some functions out of HITS code which can be used in pagerank too. - Refactor indentation with PEP8 standards ---- ---