Hi I'm having an issue with ganglia-3.0.1. The build appeared to be successful, but when running gmetad (when gmond is running) it segfaults.
I have tried on Solaris 8 and RedHat 3 - same result on both OS's. end of truss: 19315: brk(0x0009A690) = 0 hash_create size = 19315: write(2, " h a s h _ c r e a t e ".., 19) = 19 5019315: write(2, " 5 0", 2) = 2 19315: write(2, "\n", 1) = 1 hash->size is 19315: write(2, " h a s h - > s i z e i".., 14) = 14 5319315: write(2, " 5 3", 2) = 2 19315: write(2, "\n", 1) = 1 19315: brk(0x0009A690) = 0 19315: brk(0x0009C690) = 0 hash_create size = 19315: write(2, " h a s h _ c r e a t e ".., 19) = 19 5019315: write(2, " 5 0", 2) = 2 19315: write(2, "\n", 1) = 1 hash->size is 19315: write(2, " h a s h - > s i z e i".., 14) = 14 5319315: write(2, " 5 3", 2) = 2 19315: write(2, "\n", 1) = 1 Updating host 19315: write(2, " U p d a t i n g h o s".., 14) = 14 andy19315: write(2, " a n d y", 4) = 4 , metric 19315: write(2, " , m e t r i c ", 9) = 9 cpu_num19315: write(2, " c p u _ n u m", 7) = 7 19315: write(2, "\n", 1) = 1 19315: mkdir("/var/lib/ganglia/rrds/unspecified", 0755) Err#17 EEXIST 19315: mkdir("/var/lib/ganglia/rrds/unspecified/andy", 0755) Err#17 EEXIST 19315: stat("/var/lib/ganglia/rrds/unspecified/andy/cpu_num.rrd", 0xEBC05258) Err#2 ENOENT 19315: time() = 1115224171 19315: time() = 1115224171 19315: open("/usr/share/lib/zoneinfo/GB", O_RDONLY) = 9 19315: read(9, " T Z i f\0\0\0\0\0\0\0\0".., 8192) = 1323 19315: close(9) = 0 19315: brk(0x0009C690) = 0 19315: brk(0x0009E690) = 0 19315: Incurred fault #6, FLTBOUNDS %pc = 0xFF0CFA14 19315: siginfo: SIGSEGV SEGV_ACCERR addr=0x00060083 19315: Received signal #11, SIGSEGV [default] 19315: siginfo: SIGSEGV SEGV_ACCERR addr=0x00060083 19315: *** process killed *** (gdb) run Starting program: /usr/sbin/gmetad [New LWP 1] [New LWP 2] [New LWP 3] [New LWP 4] Going to run as user nobody Sources are ... Source: [my cluster, step 15] has 1 sources 127.0.0.1 xml listening on port 8651 interactive xml listening on port 8652 cleanup thread has been started [New LWP 5] [New LWP 6] [New LWP 7] [New LWP 8] Data thread 19 is monitoring [my cluster] data source 127.0.0.1 [New LWP 9] [my cluster] is an OLD version hash_create size = 1024 hash->size is 1031 hash_create size = 50 hash->size is 53 hash_create size = 50 hash->size is 53 Updating host andy, metric cpu_num Program received signal SIGSEGV, Segmentation fault. [Switching to LWP 7] 0xff0cfa14 in strtok_r () from /usr/lib/libc.so.1 Does anybody have an idea what I have done wrong ? Cheers Andy ---------------------------------------------------------------------------------------- This is a PRIVATE message. If you are not the intended recipient, please delete without copying and kindly advise us by e-mail of the mistake in delivery. NOTE: Regardless of content, this e-mail shall not operate to bind CSC to any order or other contract unless pursuant to explicit written agreement or government initiative expressly permitting the use of e-mail for such purpose. ----------------------------------------------------------------------------------------