---------- Forwarded message ----------
From: Hernan Laffitte <[email protected]>
Date: Wed, Jun 22, 2016 at 4:45 PM
Subject: Re: [Collectl-interest] colmux time format error
To: Mark Seger <[email protected]>


Hello Mark,

Thanks for the reply! I finally had some time to run the additional tests
you requested. Some comments below...

On Thu, Jun 16, 2016 at 5:30 AM, Mark Seger <[email protected]> wrote:

> maybe you have a different kernel?
>

The machine where I am having this issue is running Debian "jessie/sid".
Kernel is:

   Linux spaa-1 3.13.0-85-generic #129-Ubuntu SMP Thu Mar 17 20:50:15 UTC
2016 x86_64 x86_64 x86_64 GNU/Linux

The version of colmux I have installed is "colmux: 4.7.1 (Term::ReadKey:
V2.31 Threads: 1.86)"

When running the command in 'test' mode, the columns 10, 20, 30, ... were
the "%Idle" of the CPU's. Columns 11, 21, 31,... were the "%Total" of the
CPU's.

In both cases, the commands give this error all the time (not an
intermittent error). One or two "-1" rows appear, followed by the message:

   Minute '60' out of range 0..59 at /usr/bin/colmux line 1699.

What you didn't say is does this fail all the time or intermittently.  If
> intermittent it will indeed be hard to track down, but there is hope too ;)
>
>
The error occurs every time I try this command.



> Have you tried playing back a file with colmux yet?
>

I am gathering the output of collectl from all the machines into an NFS
directory. All the machines in the cluster have /var/log/collectl
 symlinked to /nfs/mnt/path/to/collectl

If I run the command via replay, it doesn't :

colmux -addr 'spaa-[1-3]' -command "-sC -oT -P -p
'/nfs/mnt/path/to/collectl/*20160621*raw.gz'" -cols 11,21 | less

However, every row the 3 machines all have the same values for CPU0 and
CPU1. Something like:

#Time    spaa-1 spaa-2 spaa-3 |  spaa-1 spaa-2 spaa-3
...
      1      1      1 |       3      3      3
      0      0      0 |      11     11     11
...


and since this is a playback command, you can use time ranges as well to
> limit what is being displayed so I may help zero in on where in the data
> the problem is and then maybe even send me a subset of the problem raw file
> [use collectl --extract to create a new raw from from the time slice of an
> old one].  then, maybe I can track down why this is happening.
>
> -mark
>
>
Thanks Mark, I will send a copy of the raw files in a private email.

Regards,

Hernan
------------------------------------------------------------------------------
Attend Shape: An AT&T Tech Expo July 15-16. Meet us at AT&T Park in San
Francisco, CA to explore cutting-edge tech and listen to tech luminaries
present their vision of the future. This family event has something for
everyone, including kids. Get more information and register today.
http://sdm.link/attshape
_______________________________________________
Collectl-interest mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/collectl-interest

Reply via email to