GitHub user kaknikhil opened a pull request:
https://github.com/apache/madlib/pull/195
Feature: Add grouping support to HITS
JIRA: MADLIB-1151
Changes to support grouping column in HITS. Update queries to
use group by and other necessary sql constructs.
You can merge this pull request into a Git repository by running:
$ git pull https://github.com/kaknikhil/madlib hits_grouping
Alternatively you can review and apply these changes as the patch at:
https://github.com/apache/madlib/pull/195.patch
To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:
This closes #195
----
commit 9b47a4345f9efd60be65f5adf3a884e6c660050c
Author: Jingyi Mei <[email protected]>
Date: 2017-10-05T19:18:32Z
Feature: Add grouping support to HITS
JIRA: MADLIB-1151
Changes to support grouping column in HITS. Update queries to
use group by and other necessary sql constructs.
commit 5f88d0553d85251575eb4919d2d69b89b2fec162
Author: Nikhil Kak <[email protected]>
Date: 2017-10-30T23:01:34Z
Add install check test for hits grouping
- Add tests for calling hits with grouping cols
- Improve error messages if the result from hits_out does not meet
expectations
commit c7ebe372fe3c0038ee47cfe9245d39d32c1ab95b
Author: Nikhil Kak <[email protected]>
Date: 2017-11-02T17:20:02Z
Update HITS examples and usage with grouping columns
commit 81b03cf88598627a2df1bad3b57302f6b2a0aa9d
Author: Nikhil Kak <[email protected]>
Date: 2017-11-02T22:41:18Z
Add user docs for grouping column
Also adds a few more examples for threshold and max_iter.
commit 5267d272150fb6071f2604a6516d1bf4a3db05d1
Author: Nikhil Kak <[email protected]>
Date: 2017-11-03T18:31:38Z
Refactor code, add comments, change var names.
- Refactor logic into functions
- This changes some code in pagerank, since we pulled out some
functions out of HITS code which can be used in pagerank too.
- Refactor indentation with PEP8 standards
----
---