after running my cluster for a while performance has become unacceptable, 200+
ms for reads. if running well, i see reads <10ms. when i run iostat the disk
is being hammered by reads. seems like i/o caching isn't even being used
avg-cpu: %user %nice %system %iowait %steal %idle
2.81 0.00 1.41 13.62 0.00 82.16
Device: rrqm/s wrqm/s r/s w/s rMB/s wMB/s avgrq-sz avgqu-sz
await svctm %util
cciss/c0d0p1
0.00 0.00 0.00 0.00 0.00 0.00 0.00 0.00
0.00 0.00 0.00
cciss/c0d1p1
0.00 0.00 848.50 0.00 13.66 0.00 32.98 21.50
25.23 1.18 100.05
i run top and i see cassandra's memory usage as follows:
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
31510 bburruss 19 0 359g 37g 27g S 48.8 80.1 2137:30 java
i set -Xmx10g so it isn't java using the memory. is it mmap i/o? what would
be causing the huge memory usage? it seems reasonable that the performance is
bad because the i/o cache can't be used properly.