[ https://issues.apache.org/jira/browse/GIRAPH-273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13425610#comment-13425610 ]
Avery Ching commented on GIRAPH-273: ------------------------------------ I think that as an option, writing to HDFS should be fine, but the default should be in-memory, as writing to HDFS is likely to be a bit slow. Again, moving this out of Zookeeper should improve our scalability a lot, even with say 100k aggregators, this shouldn't be an issue (assuming they are small objects). The master doesn't require a lot of memory for other things, so keeping it in memory should be fine. > Aggregators shouldn't use Zookeeper > ----------------------------------- > > Key: GIRAPH-273 > URL: https://issues.apache.org/jira/browse/GIRAPH-273 > Project: Giraph > Issue Type: Improvement > Reporter: Maja Kabiljo > Assignee: Maja Kabiljo > > We use Zookeeper znodes to transfer aggregated values from workers to master > and back. Zookeeper is supposed to be used for coordination, and it also has > a memory limit which prevents users from having aggregators with large value > objects. These are the reasons why we should implement aggregators gathering > and distribution in a different way. -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira