Hi

I'm having an issue with ganglia-3.0.1. The build appeared to be 
successful, but when running gmetad (when gmond is running) it segfaults.

I have tried on Solaris 8 and RedHat 3 - same result on both OS's.

end of truss:

19315:  brk(0x0009A690)                                 = 0
hash_create size = 19315:       write(2, " h a s h _ c r e a t e  ".., 19) 
     = 19
5019315:        write(2, " 5 0", 2)                             = 2

19315:  write(2, "\n", 1)                               = 1
hash->size is 19315:    write(2, " h a s h - > s i z e   i".., 14)      = 
14
5319315:        write(2, " 5 3", 2)                             = 2

19315:  write(2, "\n", 1)                               = 1
19315:  brk(0x0009A690)                                 = 0
19315:  brk(0x0009C690)                                 = 0
hash_create size = 19315:       write(2, " h a s h _ c r e a t e  ".., 19) 
     = 19
5019315:        write(2, " 5 0", 2)                             = 2

19315:  write(2, "\n", 1)                               = 1
hash->size is 19315:    write(2, " h a s h - > s i z e   i".., 14)      = 
14
5319315:        write(2, " 5 3", 2)                             = 2

19315:  write(2, "\n", 1)                               = 1
Updating host 19315:    write(2, " U p d a t i n g   h o s".., 14)      = 
14
andy19315:      write(2, " a n d y", 4)                         = 4
, metric 19315: write(2, " ,   m e t r i c  ", 9)               = 9
cpu_num19315:   write(2, " c p u _ n u m", 7)                   = 7

19315:  write(2, "\n", 1)                               = 1
19315:  mkdir("/var/lib/ganglia/rrds/unspecified", 0755) Err#17 EEXIST
19315:  mkdir("/var/lib/ganglia/rrds/unspecified/andy", 0755) Err#17 
EEXIST
19315:  stat("/var/lib/ganglia/rrds/unspecified/andy/cpu_num.rrd", 
0xEBC05258) Err#2 ENOENT
19315:  time()                                          = 1115224171
19315:  time()                                          = 1115224171
19315:  open("/usr/share/lib/zoneinfo/GB", O_RDONLY)    = 9
19315:  read(9, " T Z i f\0\0\0\0\0\0\0\0".., 8192)     = 1323
19315:  close(9)                                        = 0
19315:  brk(0x0009C690)                                 = 0
19315:  brk(0x0009E690)                                 = 0
19315:      Incurred fault #6, FLTBOUNDS  %pc = 0xFF0CFA14
19315:        siginfo: SIGSEGV SEGV_ACCERR addr=0x00060083
19315:      Received signal #11, SIGSEGV [default]
19315:        siginfo: SIGSEGV SEGV_ACCERR addr=0x00060083
19315:          *** process killed ***


(gdb) run
Starting program: /usr/sbin/gmetad 
[New LWP 1]
[New LWP 2]
[New LWP 3]
[New LWP 4]
Going to run as user nobody
Sources are ...
Source: [my cluster, step 15] has 1 sources
        127.0.0.1
xml listening on port 8651
interactive xml listening on port 8652
cleanup thread has been started
[New LWP 5]
[New LWP 6]
[New LWP 7]
[New LWP 8]
Data thread 19 is monitoring [my cluster] data source
        127.0.0.1
[New LWP 9]
[my cluster] is an OLD version
hash_create size = 1024
hash->size is 1031
hash_create size = 50
hash->size is 53
hash_create size = 50
hash->size is 53
Updating host andy, metric cpu_num

Program received signal SIGSEGV, Segmentation fault.
[Switching to LWP 7]
0xff0cfa14 in strtok_r () from /usr/lib/libc.so.1

Does anybody have an idea what I have done wrong ?

Cheers

Andy

----------------------------------------------------------------------------------------
This is a PRIVATE message. If you are not the intended recipient, please 
delete without copying and kindly advise us by e-mail of the mistake in 
delivery. NOTE: Regardless of content, this e-mail shall not operate to 
bind CSC to any order or other contract unless pursuant to explicit 
written agreement or government initiative expressly permitting the use of 
e-mail for such purpose.
----------------------------------------------------------------------------------------

Reply via email to