re playback: can you try it out? even if only for a little bit? I'm
guessing you'll see the problem there as well but I'd really like to
understand what is happening. All you need to do is add "--rawtoo -f/tmp"
to the DaemonCommands in your /etc/collectl.conf, restart collectl and it
will write a raw file to /tmp. Then, if you can run your tests and save
the interactive output with timestamps - include -oT, you should be able to
play back the data with 'collectl -p file -sx -oT' from the raw file and
see almost identical numbers OR 1/2 the values. Just remember to reset
/etc/collectl.conf when you're done.
It would provide a useful data point. You can even play back the data with
--export lexpr.
Meanwhile we can try to reproduce what you're seeing. Actually have you
seen this with earlier versions of collectl? I haven't touch the IB code
in years, at least I don't remember doing so, but I have touched lexpr,
that's why it's important to try and understand where the actual problem
lies.
-mark
On Tue, Nov 26, 2013 at 8:09 AM, Dragseth Roy Einar <[email protected]>wrote:
> Hi Mark. Yes, its been a while...
>
> I must admit I have never used playback mode so I do not know. We do not
> have any .raw files produced by collectl.
>
> r.
>
> ------------------------------
> *From:* Mark Seger [[email protected]]
> *Sent:* Tuesday, November 26, 2013 1:52 PM
> *To:* Dragseth Roy Einar
> *Cc:* [email protected]
> *Subject:* Re: [Collectl-interest] collectl disagrees with itself
> regarding infiniband bandwidth.
>
> hi roy - long time no chat...
>
> This is indeed an interesting one I haven't seen. Just to be clear,
> because you said it reports half as a daemon when using lexpr. Does it
> also record 1/2 as a daemon and playback as 1/2 w/o lexpr?
>
> -mark
>
>
> On Tue, Nov 26, 2013 at 4:01 AM, Roy Dragseth <[email protected]> wrote:
>
>> Collectl seems to disagree with itself when reporting infiniband bandwidth
>> usage.
>>
>> I'm running a bandwidth benchmark that reports appr. 7 GB/s bidirectional
>> bandwidth on our QDR infiniband network:
>>
>> Benchmark exchange(MPI_Sendrecv)
>> ================================
>> lenght iterations elapsed time transfer rate latency
>> (bytes) (count) (seconds) (Mbytes/s) (usec)
>> --------------------------------------------------------------------------
>> 12582912 8578 30.626 7048.6 1785.2
>>
>>
>> Running collectl interactively shows approximately the same
>>
>> [root@c10-13 etc]# collectl -s x
>> Couldn't find 'ofed_info'. Won't be able to determine OFED version
>> waiting for 1 second sample...
>> #<-----------InfiniBand----------->
>> # KBIn PktIn KBOut PktOut Errs
>> 3472553 1717K 3472483 1717K 0
>> 3472962 1717K 3472977 1717K 0
>> 3472570 1717K 3472629 1717K 0
>> 3470588 1716K 3470598 1716K 0
>> 3472094 1717K 3472105 1717K 0
>> 3471221 1716K 3471156 1716K 0
>> 3472378 1717K 3472409 1717K 0
>>
>> But if I run it as a daemon, with this addition to DaemonCommands in
>> collectl.conf, -P --export lexpr,f=/tmp/L, (*) it only reports half the
>> bandwidth usage
>>
>> [root@c10-13 etc]# grep iconnect /tmp/L
>> iconnect.kbin 1677721
>> iconnect.pktin 1722455
>> iconnect.kbout 1677721
>> iconnect.pktout 1722455
>>
>>
>> Is this a bug? Any workarounds?
>> The test was done with collectl 3.6.9.
>>
>>
>> * I use this to report infiniband traffic in ganglia,
>>
>> https://wiki.rocksclusters.org/wiki/index.php/Roy_Dragseth#Integrating_collectl_and_ganglia
>>
>>
>>
>>
>> --
>>
>> The Computer Center, University of Tromsø, N-9037 TROMSØ Norway.
>> phone:+47 77 64 41 07, fax:+47 77 64 41 00
>> Roy Dragseth, Team Leader, High Performance Computing
>> Direct call: +47 77 64 62 56. email: [email protected]
>>
>>
>>
>> ------------------------------------------------------------------------------
>> Shape the Mobile Experience: Free Subscription
>> Software experts and developers: Be at the forefront of tech innovation.
>> Intel(R) Software Adrenaline delivers strategic insight and game-changing
>> conversations that shape the rapidly evolving mobile landscape. Sign up
>> now.
>>
>> http://pubads.g.doubleclick.net/gampad/clk?id=63431311&iu=/4140/ostg.clktrk
>> _______________________________________________
>> Collectl-interest mailing list
>> [email protected]
>> https://lists.sourceforge.net/lists/listinfo/collectl-interest
>>
>
>
>
> ------------------------------------------------------------------------------
> Shape the Mobile Experience: Free Subscription
> Software experts and developers: Be at the forefront of tech innovation.
> Intel(R) Software Adrenaline delivers strategic insight and game-changing
> conversations that shape the rapidly evolving mobile landscape. Sign up
> now.
> http://pubads.g.doubleclick.net/gampad/clk?id=63431311&iu=/4140/ostg.clktrk
> _______________________________________________
> Collectl-interest mailing list
> [email protected]
> https://lists.sourceforge.net/lists/listinfo/collectl-interest
>
>
------------------------------------------------------------------------------
Shape the Mobile Experience: Free Subscription
Software experts and developers: Be at the forefront of tech innovation.
Intel(R) Software Adrenaline delivers strategic insight and game-changing
conversations that shape the rapidly evolving mobile landscape. Sign up now.
http://pubads.g.doubleclick.net/gampad/clk?id=63431311&iu=/4140/ostg.clktrk
_______________________________________________
Collectl-interest mailing list
[email protected]
https://lists.sourceforge.net/lists/listinfo/collectl-interest