Btw, the jobs we run typically run for hours, so total time is mostly just sum 
of input + supersteps for us, since a very little extra time is negligible. 
However, I see your job itself is so small, so to be more accuratetotal time = 
time between start of your job (once all machines were allocated) and the end 
of your job (when u can see in logs that all workers are done with the last 
superstep)

From: pava...@outlook.com
To: user@giraph.apache.org
Subject: RE: input superstep of giraph.
Date: Fri, 18 Apr 2014 23:58:53 +0530




Please take a look at GIRAPH-838Note that there is a little window between end 
of one superstep & start of the other. So, this 120 s can be accounted for 
that. But what I meant was total time is as good as sum of input + other 
supersteps (though only approximately because of this slight extra time)

Date: Fri, 18 Apr 2014 16:28:48 +0100
Subject: Re: input superstep of giraph.
From: ghufran1ma...@gmail.com
To: user@giraph.apache.org

Hi Pavan, 
I might have misunderstood your explanation. But from the giraph timers I 
received, Total does not seem to be:

 sum of input time + sum of time in all supersteps



For example the following timers were outputted after I ran the 
ConnectedComponents algorithm: 

        Giraph Timers
                Initialize (ms)=424             Input superstep (ms)=1457       
        Setup (ms)=85
                Shutdown (ms)=11666             Superstep 0 
ConnectedComponentsComputation (ms)=903             Superstep 1 
ConnectedComponentsComputation (ms)=4565
                Superstep 10 ConnectedComponentsComputation (ms)=475            
Superstep 11 ConnectedComponentsComputation (ms)=454
                Superstep 12 ConnectedComponentsComputation (ms)=342            
Superstep 2 ConnectedComponentsComputation (ms)=3094
                Superstep 3 ConnectedComponentsComputation (ms)=1399            
Superstep 4 ConnectedComponentsComputation (ms)=783
                Superstep 5 ConnectedComponentsComputation (ms)=591             
Superstep 6 ConnectedComponentsComputation (ms)=458
                Superstep 7 ConnectedComponentsComputation (ms)=458             
Superstep 8 ConnectedComponentsComputation (ms)=483             Superstep 9 
ConnectedComponentsComputation (ms)=458
                Total (ms)=27675
Input superstep = sum of input time? 
Input superstep + sum of supersteps = 1457 + 14463                              
                         = 15920


and the total is: 27675
there is still 11755 ms unaccounted for? 
Or have I miss understood what sum of input time should be? 

Kind regards, 

Ghufran 




On Fri, Apr 18, 2014 at 3:52 PM, ghufran malik <ghufran1ma...@gmail.com> wrote:

Hi, 
Thank you for the explanation :) 
It was confusing when reading it, some of the timers I can intuitively 
understand, however I think it would be beneficial if these explanations were 
added to the API docs, then if anyone else is confused they can look up the 
meanings there. 



https://giraph.apache.org/giraph-core/apidocs/org/apache/giraph/counters/GiraphTimers.html
 


Thanks, 
Ghufran

On Fri, Apr 18, 2014 at 3:25 PM, Pavan Kumar A <pava...@outlook.com> wrote:





I wrote the Initialize counter :) Please tell me if the name seems confusing
So,Initialize = the time spent by job waiting for resources. In a shared pool 
the job you launch may not get all the machines needed to start the job. So for 
instance you want to run a job with 200 workers, giraph does not start until 
all the workers have are allocated & register with the master.


Setup = once you have all the machines allocated, how much time it takes before 
starting to read input
Shutdown = once you have written your output howmuch time it takes to stop 
verify that everything is done & shutdown resources & notify user - for 
instance wait for all network connections to close, all threads to join, etc.


Total = sum of input time + sum of time in all superstepsi.e., actual time 
taken to run by your application after it got all the resources (does not 
include time waiting to get resources which is initialize or shutdown time)



Date: Fri, 18 Apr 2014 13:28:47 +0100
Subject: Re: input superstep of giraph.
From: ghufran1ma...@gmail.com
To: user@giraph.apache.org



Hi, 

Could you also explain what the following timers correspond to as well please: 

Giraph Timers           Initialize (ms)=775


                                Setup (ms)=105
                Shutdown (ms)=12537

                Total (ms)=27075                
Thanks, 

Ghufran



On Thu, Apr 17, 2014 at 9:10 PM, Pavan Kumar A <pava...@outlook.com> wrote:






Input consists of > reading the input (vertices and/or edges as provided) into 
memory on individual workers> assigning vertices to partitions and partitions 
to workers


> moving all partitions (i.e., vertices & their out-edges) to a worker (which 
> owns the partition)> doing some bookkeeping of internal data-structures to be 
> used during computation

Date: Thu, 17 Apr 2014 10:06:03 -0500



Subject: input superstep of giraph.
From: suijian.z...@gmail.com
To: user@giraph.apache.org




Hi, 
  From the screen output of a successful giraph program run, what does the 
following line mean?

Input superstep (ms)=22884



 Does it mean the time used to load the input graph into memory? Thanks.

  Best Regards,
  Suijian



                                          

                                          



                                                                                
  

Reply via email to