Mark Lu created GIRAPH-1016:
-------------------------------

             Summary: Number of Workers and Giraph Speed
                 Key: GIRAPH-1016
                 URL: https://issues.apache.org/jira/browse/GIRAPH-1016
             Project: Giraph
          Issue Type: Task
    Affects Versions: 1.1.0
         Environment: aws ec2 Linux.
            Reporter: Mark Lu


I am trying to run giraph's SimpleShortestPathsComputation to processing a 
small graph dataset with nearly 77510 vertices and 898900 edges on aws ec2 
instances, (T2.micro with 1 master and 2 slave nodes), Hadoop version is 1.2.1. 
The giraph command is  
hadoop jar giraph-with-dependencies.jar org.apache.giraph.GiraphRunner 
org.apache.giraph.examples.SimpleShortestPathsComputation -vif 
org.apache.giraph.io.formats.JsonLongDoubleFloatDoubleVertexInputFormat -vip 
/user/ec2-user/a2.txt -vof 
org.apache.giraph.io.formats.IdWithValueTextOutputFormat -op 
/user/ec2-user/output1 -w 1. 
As I increase the number of workers (ie, -w 2,3...), the cpu time as well as 
the total time of giraph computation is also increased. So should the cpu time 
and computation time decreased when more workers are added? What should I do?




--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to