[ https://issues.apache.org/jira/browse/GIRAPH-26?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ]
Sean Choi updated GIRAPH-26: ---------------------------- Attachment: GIRAPH-26-1.patch Adding a more realistic graph generator based on this paper cs.stanford.edu/~jure/pubs/kronFit-icml07.pdf The input takes two arguments -K which represents the number of Kronecker power -M which represents the initial matrix in matlab notation for example the arguments -K 10 -M "0.9 0.5; 0.5 0.1;" will give a matrix of size 2^10 with randomly generated edges with power law distributions and all the goodies... > Improve PseudoRandomVertexInputFormat to create a more realistic synthetic > graph (e.g. power-law distributed vertex-cardinality). > --------------------------------------------------------------------------------------------------------------------------------- > > Key: GIRAPH-26 > URL: https://issues.apache.org/jira/browse/GIRAPH-26 > Project: Giraph > Issue Type: Test > Components: benchmark > Reporter: Jake Mannix > Assignee: Jake Mannix > Priority: Minor > Attachments: GIRAPH-26-1.patch > > > The PageRankBenchmark class, to be a proper benchmark, should run over graphs > which look more like data seen in the wild, and web link graphs, social > network graphs, and text corpora (represented as a bipartite graph) all have > power-law distributions, so benchmarking a synthetic graph which looks more > like this would be a nice test which would stress cases of uneven > split-distribution and bottlenecks of subclusters of the graph of heavily > connected vertices. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira