Gary,

On Wed, Apr 9, 2008 at 1:18 AM,  <[EMAIL PROTECTED]> wrote:
>
>  I have a customer who has an application that when run under pfmon reports
>  154 billion CPU_CYCLES used (appears to be a reasonable value).  When this
>  same application is run under Hpcrun (from HPCToolkit using PAPI) it only
>  reports about 2 billion CPU_CYCLES used.  These tests are run on an Intel
>  IA64 platform.
>
You need to tell me which kernel version, which perfmon version.

Also how did you calculate those 2 numbers? What this simlpe counting and
derived from the samples you are getting.

The 'losing interrupts' should not affect you because it is related to handling
of signals in multi-threaded programs.


As for the log mail them to me directly.

Thanks.

>  This application runs as a single thread and does not set a signal handler
>  or mask the SIGIO signal. Hpcrun produces 8 data output files when run on
>  this application.  One for the application itself, 4 for bash scripts the
>  application runs, 2 for 'rm' commands the application executes and 1 for a
>  gzip command it runs.
>
>  The customer wants to know why Hpcrun only reports a little over 1% of the
>  cpu
>  cycles used.  I have been trying to compare what pfmon does to what hpcrun
>  does
>  and it seems that the only debug data available for both runs is the kernel
>  debug
>  data written by perfmon.  This data clearly shows that Hpcrun/Papi is using
>  the perfmon services differently than pfmon does.  I tried to attach the
>  debug output for these two runs to this mail but that exceeded the allowed
>  message
>  size for the list.
>
>  I tried adding code (as a test case) to the Papi signal handler to count
>  and print
>  the number of signals paid during the run.  The values printed seemed to
>  pretty
>  much match the values reported as number of samples when hpcprof is run on
>  the
>  hpcrun data files.  This was an attempt to detect if my problem was
>  handling signals
>  or getting them and I think this test showed the problem is in getting
>  them.
>
>  I have also browsed this mailing list and found a thread called
>  "papi on compute node linux" which was last updated 2008-03-10.  The
>  discussion in this thread sounds to me like it could easily explain what
>  I am seeing.
>
>  Is there a way I can determine if this discussion (ie: loosing interrupts)
>  is what I am seeing ?
>
>  Thanks for any help you can provide.
>
>  Gary
>
>
>  -------------------------------------------------------------------------
>  This SF.net email is sponsored by the 2008 JavaOne(SM) Conference
>  Don't miss this year's exciting event. There's still time to save $100.
>  Use priority code J8TL2D2.
>  
> http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
>  _______________________________________________
>  perfmon2-devel mailing list
>  [email protected]
>  https://lists.sourceforge.net/lists/listinfo/perfmon2-devel
>

-------------------------------------------------------------------------
This SF.net email is sponsored by the 2008 JavaOne(SM) Conference 
Don't miss this year's exciting event. There's still time to save $100. 
Use priority code J8TL2D2. 
http://ad.doubleclick.net/clk;198757673;13503038;p?http://java.sun.com/javaone
_______________________________________________
perfmon2-devel mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/perfmon2-devel

Reply via email to