Re: Problem after upgrade to 1.0.1

Jonathan Ellis Thu, 03 Nov 2011 17:06:31 -0700

I found the problem and posted a patch on
https://issues.apache.org/jira/browse/CASSANDRA-3451.  If you build
with that patch and rerun scrub the exception should go away.


On Thu, Nov 3, 2011 at 2:08 PM, Bryce Godfrey <bryce.godf...@azaleos.com> wrote:
> A restart fixed the load numbers, they are back to where I expect them to be 
> now, but disk utilization is double the load #.  I'm also still get the 
> cfstats exception from any node.
>
> -----Original Message-----
> From: Jonathan Ellis [mailto:jbel...@gmail.com]
> Sent: Thursday, November 03, 2011 11:52 AM
> To: user@cassandra.apache.org
> Subject: Re: Problem after upgrade to 1.0.1
>
> Does restarting the node fix this?
>
> On Thu, Nov 3, 2011 at 1:51 PM, Bryce Godfrey <bryce.godf...@azaleos.com> 
> wrote:
>> Disk utilization is actually about 80% higher than what is reported
>> for nodetool ring across all my nodes on the data drive
>>
>>
>>
>> Bryce Godfrey | Sr. Software Engineer | Azaleos Corporation | T:
>> 206.926.1978 | M: 206.849.2477
>>
>>
>>
>> From: Dan Hendry [mailto:dan.hendry.j...@gmail.com]
>> Sent: Thursday, November 03, 2011 11:47 AM
>> To: user@cassandra.apache.org
>> Subject: RE: Problem after upgrade to 1.0.1
>>
>>
>>
>> Regarding load growth, presumably you are referring to the load as
>> reported by JMX/nodetool. Have you actually looked at the disk
>> utilization on the nodes themselves? Potential issue I have seen:
>> http://www.mail-archive.com/user@cassandra.apache.org/msg18142.html
>>
>>
>>
>> Dan
>>
>>
>>
>> From: Bryce Godfrey [mailto:bryce.godf...@azaleos.com]
>> Sent: November-03-11 14:40
>> To: user@cassandra.apache.org
>> Subject: Problem after upgrade to 1.0.1
>>
>>
>>
>> I recently upgraded from 0.8.6 to 1.0.1 and everything seemed to go
>> just fine with the rolling upgrade.  But now I'm having extreme load
>> growth on one of my nodes (and others are growing faster than usual
>> also).  I attempted to run a cfstats against the extremely large node
>> that was seeing 2x the load of others and I get this error below.  I'm
>> also went into the o.a.c.db.HintedHandoffManager mbean and attempted
>> to list pending hints to see if it was growing out of control for some
>> reason, but that just times out eventually for any node.  I'm not sure what 
>> to do next with this issue.
>>
>>
>>
>>                Column Family: HintsColumnFamily
>>
>>                 SSTable count: 3
>>
>>                 Space used (live): 12681676437
>>
>>                 Space used (total): 10233130272
>>
>>                 Number of Keys (estimate): 384
>>
>>                 Memtable Columns Count: 117704
>>
>>                 Memtable Data Size: 115107307
>>
>>                 Memtable Switch Count: 66
>>
>>                 Read Count: 0
>>
>>                 Read Latency: NaN ms.
>>
>>                 Write Count: 21203290
>>
>>                 Write Latency: 0.014 ms.
>>
>>                 Pending Tasks: 0
>>
>>                 Key cache capacity: 3
>>
>>                 Key cache size: 0
>>
>>                 Key cache hit rate: NaN
>>
>>                 Row cache: disabled
>>
>>                 Compacted row minimum size: 30130993
>>
>>                 Compacted row maximum size: 9223372036854775807
>>
>> Exception in thread "main" java.lang.IllegalStateException: Unable to
>> compute ceiling for max when histogram overflowed
>>
>>         at
>> org.apache.cassandra.utils.EstimatedHistogram.mean(EstimatedHistogram.
>> java:170)
>>
>>         at
>> org.apache.cassandra.db.DataTracker.getMeanRowSize(DataTracker.java:39
>> 5)
>>
>>         at
>> org.apache.cassandra.db.ColumnFamilyStore.getMeanRowSize(ColumnFamilyS
>> tore.java:293)
>>
>>         at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
>>
>>         at
>> sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.j
>> ava:39)
>>
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccess
>> orImpl.java:25)
>>
>>         at java.lang.reflect.Method.invoke(Method.java:597)
>>
>>         at
>> com.sun.jmx.mbeanserver.StandardMBeanIntrospector.invokeM2(StandardMBe
>> anIntrospector.java:93)
>>
>>         at
>> com.sun.jmx.mbeanserver.StandardMBeanIntrospector.invokeM2(StandardMBe
>> anIntrospector.java:27)
>>
>>         at
>> com.sun.jmx.mbeanserver.MBeanIntrospector.invokeM(MBeanIntrospector.ja
>> va:208)
>>
>>         at
>> com.sun.jmx.mbeanserver.PerInterface.getAttribute(PerInterface.java:65
>> )
>>
>>         at
>> com.sun.jmx.mbeanserver.MBeanSupport.getAttribute(MBeanSupport.java:21
>> 6)
>>
>>         at
>> com.sun.jmx.interceptor.DefaultMBeanServerInterceptor.getAttribute(Def
>> aultMBeanServerInterceptor.java:666)
>>
>>         at
>> com.sun.jmx.mbeanserver.JmxMBeanServer.getAttribute(JmxMBeanServer.jav
>> a:638)
>>
>>         at
>> javax.management.remote.rmi.RMIConnectionImpl.doOperation(RMIConnectio
>> nImpl.java:1404)
>>
>>         at
>> javax.management.remote.rmi.RMIConnectionImpl.access$200(RMIConnection
>> Impl.java:72)
>>
>>         at
>> javax.management.remote.rmi.RMIConnectionImpl$PrivilegedOperation.run(
>> RMIConnectionImpl.java:1265)
>>
>>         at
>> javax.management.remote.rmi.RMIConnectionImpl.doPrivilegedOperation(RM
>> IConnectionImpl.java:1360)
>>
>>         at
>> javax.management.remote.rmi.RMIConnectionImpl.getAttribute(RMIConnecti
>> onImpl.java:600)
>>
>>         at sun.reflect.GeneratedMethodAccessor15.invoke(Unknown
>> Source)
>>
>>         at
>> sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccess
>> orImpl.java:25)
>>
>>         at java.lang.reflect.Method.invoke(Method.java:597)
>>
>>         at
>> sun.rmi.server.UnicastServerRef.dispatch(UnicastServerRef.java:305)
>>
>>         at sun.rmi.transport.Transport$1.run(Transport.java:159)
>>
>>         at java.security.AccessController.doPrivileged(Native Method)
>>
>>         at sun.rmi.transport.Transport.serviceCall(Transport.java:155)
>>
>>         at
>> sun.rmi.transport.tcp.TCPTransport.handleMessages(TCPTransport.java:53
>> 5)
>>
>>         at
>> sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run0(TCPTransport
>> .java:790)
>>
>>         at
>> sun.rmi.transport.tcp.TCPTransport$ConnectionHandler.run(TCPTransport.
>> java:649)
>>
>>         at
>> java.util.concurrent.ThreadPoolExecutor$Worker.runTask(ThreadPoolExecu
>> tor.java:886)
>>
>>         at
>> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.
>> java:908)
>>
>>         at java.lang.Thread.run(Thread.java:662)
>>
>>
>>
>> Bryce Godfrey | Sr. Software Engineer | Azaleos Corporation | T:
>> 206.926.1978 | M: 206.849.2477
>>
>>
>>
>> No virus found in this incoming message.
>> Checked by AVG - www.avg.com
>> Version: 9.0.920 / Virus Database: 271.1.1/3993 - Release Date:
>> 11/03/11
>> 03:39:00
>
>
>
> --
> Jonathan Ellis
> Project Chair, Apache Cassandra
> co-founder of DataStax, the source for professional Cassandra support 
> http://www.datastax.com
>



-- 
Jonathan Ellis
Project Chair, Apache Cassandra
co-founder of DataStax, the source for professional Cassandra support
http://www.datastax.com

Re: Problem after upgrade to 1.0.1

Reply via email to