[ 
https://issues.apache.org/jira/browse/GIRAPH-326?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13456009#comment-13456009
 ] 

Eli Reisman commented on GIRAPH-326:
------------------------------------

This makes a lot of sense. I have seen ZooKeeper really bog down when lots of 
writes and reads are all occurring concurrently and the quorum is syncing all 
the time.

On the other hand, I have run countless jobs with many machines and many 
inputsplits, and have not encountered a big slowdown in this initial write 
phase. The threads should be accessing different znode path to write splits to 
all the time. What was the setup where this problem is encountered? Lots of 
machines, or few machines and lots of splits?

Interesting stuff, I look trying this out & hearing more about this problem.

                
> Writing input splits to ZooKeeper in parallel
> ---------------------------------------------
>
>                 Key: GIRAPH-326
>                 URL: https://issues.apache.org/jira/browse/GIRAPH-326
>             Project: Giraph
>          Issue Type: Improvement
>            Reporter: Maja Kabiljo
>         Attachments: GIRAPH-326.patch
>
>
> (Posting issue and the patch from a colleague)
> Writing input splits to zookeeper can take a lot of time. From his 
> experiments: serial 2m45s, with 16 cores 15s.

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators
For more information on JIRA, see: http://www.atlassian.com/software/jira

Reply via email to