Re: GNUnet build failure on mips64el

2014-02-22 Thread Sree Harsha Totakura
On 02/22/2014 08:44 AM, Mark H Weaver wrote:
 PASS: test_local_revocation.py
 Feb 22 07:17:50-660583 testbed-api-topology-16547 WARNING Error while 
 establishing a link: 0x4: Timeout while connecting to CORE of peer with id: 0 
 -- Retrying
 Feb 22 07:17:51-098510 testbed-api-topology-16547 WARNING Error while 
 establishing a link: 0x5: Timeout while connecting to CORE of peer with id: 1 
 -- Retrying
 Feb 22 07:17:56-127624 testbed-api-topology-16547 WARNING Error while 
 establishing a link: 0x7: Timeout while acquiring HELLO of peer 6ULB -- 
 Retrying
 Feb 22 07:17:56-134033 testbed-api-topology-16547 WARNING Error while 
 establishing a link: 0x6: Timeout while acquiring HELLO of peer 6DER -- 
 Retrying
 Feb 22 07:18:01-142761 testbed-api-topology-16547 WARNING Error while 
 establishing a link: 0x9: Timeout while acquiring HELLO of peer 6DER -- 
 Retrying
 Feb 22 07:18:01-143920 testbed-api-topology-16547 WARNING Error while 
 establishing a link: 0x8: Timeout while acquiring HELLO of peer 6ULB -- 
 Retrying
 --8---cut here---end---8---

These warnings are likely caused due to the system getting overloaded.
The testcase has a test ramp-up part which starts 2 peers and then
connects them.  This part is failing as peers are unable to connect.
 
 After printing the text above, it gets stuck in
 'src/revocation/.libs/test_revocation', with the two
 'gnunet-service-nse' subprocesses both running continuously and all
 other gnunet processes sleeping.

This is likely to cause the load on the system.  'gnunet-service-nse'
tries to find a proof-of-work to participate in the size estimation
application of GNUnet.  This proof-of-work involves find a value whose
hash matches its first n digits; this is required to avoid Sybil attacks
and can be computationally expensive and can be taxing on the loongson
you have.

I am not sure if the NSE service is needed for this testcase.  If not it
should be easy to fix.

In any case, the testcase should not run for more than ~1 minute.  I'd
like to debug this.  Would it be possible to provide me access to this
machine?

Sree



Re: GNUnet build failure on mips64el

2014-02-22 Thread Christian Grothoff
On 02/22/2014 09:46 AM, Sree Harsha Totakura wrote:
 This is likely to cause the load on the system.  'gnunet-service-nse'
 tries to find a proof-of-work to participate in the size estimation
 application of GNUnet.  This proof-of-work involves find a value whose
 hash matches its first n digits; this is required to avoid Sybil attacks
 and can be computationally expensive and can be taxing on the loongson
 you have.
 
 I am not sure if the NSE service is needed for this testcase.  If not it
 should be easy to fix.

I've disabled the POW calculation in SVN 32456, so the CPU load issue
should be fixed now.


0x48426C7E.asc
Description: application/pgp-keys


Re: GNUnet build failure on mips64el

2014-02-21 Thread Andreas Enge
On Fri, Feb 21, 2014 at 06:49:43PM +0100, Ludovic Courtès wrote:
  The same holds for Qt - its build times out after 2 hours, when it is simply
  not finished. Could we raise the timeout on hydra?
 IIUC the code, there’s not absolute timeout by default, only a
 timeout-on-silence.  Which one do we hit here?

Things seem to stop after exactly two hours, and the last line of the log
looks as if this happens in the middle of compilation.

  Another thing: We have four cores (I suppose actually two hyperthreaded
  ones) on hydra. On my machine, top shows four times 100%. On hydra, I get
  the impression that only one core is working. Is this normal (due to outside
  load, maybe), or do we need to tweak something?
 It’s supposed to use all the cores.  What makes you think only one core
 is used?

top does not reach the 4 times 100%, but is usually rather below 100%.

Andreas




Re: GNUnet build failure on mips64el

2014-02-21 Thread Mark H Weaver
Sree Harsha Totakura sreehar...@totakura.in writes:

 On 02/21/2014 07:22 AM, Mark H Weaver wrote:
 Any idea what's going on here?  In case it's helpful, I've also attached
 the entire build log.

 Can you go into the build directory
 '/tmp/nix-build-gnunet-0.10.0.drv-1/gnunet-0.10.0/src/revocation' and
 run 'make check' and see if you get the same output?

That failed immediately.  The reason is that apparently GNUnet's test
suite only works if GNUnet has already been installed.  In this case,
since the build failed, the guix-daemon deleted the /nix/store/*
directory where it had been installed :-(

I think it's going to hard to reproduce the build environment outside of
guix-daemon.  I think it would be better to try a modified recipe that
only runs a subset of the tests.

  Mark



Re: GNUnet build failure on mips64el

2014-02-21 Thread Sree Harsha Totakura
On 02/21/2014 04:31 PM, Ludovic Courtès wrote:
 On the GNUnet side, it would be great if the tests were less timing sensitive.

We faced these problems earlier when we added Sheevaplug and RasberryPi
into our buildbots.  Unfortunately, since we are having a network
application, the timeouts are necessary.

We can try increasing the timeouts but that makes the test runs long if
a testcase were to fail genuinely.  Guess we'll have to come up with
some automated measurement for timeouts depending on the host
configuration.  I just don't know how; any ideas?

Sree