Mark Lu created GIRAPH-1016: ------------------------------- Summary: Number of Workers and Giraph Speed Key: GIRAPH-1016 URL: https://issues.apache.org/jira/browse/GIRAPH-1016 Project: Giraph Issue Type: Task Affects Versions: 1.1.0 Environment: aws ec2 Linux. Reporter: Mark Lu
I am trying to run giraph's SimpleShortestPathsComputation to processing a small graph dataset with nearly 77510 vertices and 898900 edges on aws ec2 instances, (T2.micro with 1 master and 2 slave nodes), Hadoop version is 1.2.1. The giraph command is hadoop jar giraph-with-dependencies.jar org.apache.giraph.GiraphRunner org.apache.giraph.examples.SimpleShortestPathsComputation -vif org.apache.giraph.io.formats.JsonLongDoubleFloatDoubleVertexInputFormat -vip /user/ec2-user/a2.txt -vof org.apache.giraph.io.formats.IdWithValueTextOutputFormat -op /user/ec2-user/output1 -w 1. As I increase the number of workers (ie, -w 2,3...), the cpu time as well as the total time of giraph computation is also increased. So should the cpu time and computation time decreased when more workers are added? What should I do? -- This message was sent by Atlassian JIRA (v6.3.4#6332)