Rushabh Shah created HBASE-28437:
------------------------------------

             Summary: Region Server crash in our production environment.
                 Key: HBASE-28437
                 URL: https://issues.apache.org/jira/browse/HBASE-28437
             Project: HBase
          Issue Type: Bug
            Reporter: Rushabh Shah


Recently we are seeing lot of RS crash in our production environment creating 
core dump file and hs_err_pid.log file.
HBase:  hbase-2.5
Java: openjdk 1.8

Copying contents from hs_err_pid.log below:
{noformat}
#
# A fatal error has been detected by the Java Runtime Environment:
#
#  SIGSEGV (0xb) at pc=0x00007f9fb1415ba2, pid=50172, tid=0x00007f92a97ec700
#
# JRE version: OpenJDK Runtime Environment (Zulu 8.76.0.18-SA-linux64) 
(8.0_402-b06) (build 1.8.0_402-b06)
# Java VM: OpenJDK 64-Bit Server VM (25.402-b06 mixed mode linux-amd64 )
# Problematic frame:
# J 19801 C2 
org.apache.hadoop.hbase.util.ByteBufferUtils.copyBufferToStream(Ljava/io/OutputStream;Ljava/nio/ByteBuffer;II)V
 (75 bytes) @ 0x00007f9fb1415ba2 [0x00007f9fb14159a0+0x202]
#
# Core dump written. Default location: /home/sfdc/core or core.50172
#
# If you would like to submit a bug report, please visit:
#   http://www.azul.com/support/
#

---------------  T H R E A D  ---------------

Current thread (0x00007f9fa2d13000):  JavaThread "RS-EventLoopGroup-1-92" 
daemon [_thread_in_Java, id=54547, stack(0x00007f92a96ec000,0x00007f92a97ed000)]

siginfo: si_signo: 11 (SIGSEGV), si_code: 1 (SEGV_MAPERR), si_addr: 
0x0000559869daf000

Registers:
RAX=0x00007f9dbd8b6460, RBX=0x0000000000080000, RCX=0x000000000005c86b, 
RDX=0x00007f9dbd8b6460
RSP=0x00007f92a97eaf20, RBP=0x0000000000000002, RSI=0x00007f92d225e970, 
RDI=0x0000000000000069
R8 =0x000055986975f028, R9 =0x000000000064ffd8, R10=0x000000000000005f, 
R11=0x00007f94a778b290
R12=0x00007f9e62855ae8, R13=0x0000000000000000, R14=0x00007f9e5a14b1e0, 
R15=0x00007f9fa2d13000
RIP=0x00007f9fb1415ba2, EFLAGS=0x0000000000010216, CSGSFS=0x0000000000000033, 
ERR=0x0000000000000004
  TRAPNO=0x000000000000000e

Top of Stack: (sp=0x00007f92a97eaf20)
0x00007f92a97eaf20:   000000690064ff79 00007f9dbd8b6460
0x00007f92a97eaf30:   00007f9dbd8b6460 0000005700000003
0x00007f92a97eaf40:   00007f94a778b290 0004000100040000
0x00007f92a97eaf50:   00040000d090c130 00007f9db5522220
0x00007f92a97eaf60:   0008000000040001 00007f92a97eaf90
0x00007f92a97eaf70:   00007f92d0908648 0000000000000001
0x00007f92a97eaf80:   0000000000000001 000000000000005c
0x00007f92a97eaf90:   00007f94ee8078d0 0000000000000206
0x00007f92a97eafa0:   00007f9db5545a00 00007f9fafb63670
0x00007f92a97eafb0:   00007f9e5a13ed70 0000006900000001
0x00007f92a97eafc0:   00007f93ab8965b8 00007f93b9959210
0x00007f92a97eafd0:   00007f9db5545a00 00007f9fb04b3e30
0x00007f92a97eafe0:   00007f9e5a13ed70 00007f9300000001
0x00007f92a97eaff0:   00007f93ab8965b8 00007f93a8ae3920
0x00007f92a97eb000:   00007f93b9959210 00007f94a778b290
0x00007f92a97eb010:   00007f9b60707c20 00007f93a8938c28
0x00007f92a97eb020:   00007f94ee8078d0 00007f9b60708608
0x00007f92a97eb030:   00007f9b60707bc0 00007f9b60707c20
0x00007f92a97eb040:   0000000000000069 00007f93ab8965b8
0x00007f92a97eb050:   00007f94a778b290 00007f94a778b290
0x00007f92a97eb060:   0005c80d0005c80c 00000000a828a590
0x00007f92a97eb070:   00007f9e5a13ed70 000000010000270e
0x00007f92a97eb080:   00007f9db5545790 0144002200000000
0x00007f92a97eb090:   00007f95ddc800c0 00007f93ab89a6c8
0x00007f92a97eb0a0:   00007f93ae65c270 00007f9fb24af990
0x00007f92a97eb0b0:   00007f93ae65c290 00007f93ae65c270
0x00007f92a97eb0c0:   00007f9e5a13ed70 00007f92ca328528
0x00007f92a97eb0d0:   00007f9e5a13ed98 00007f9e5e1e88b0
0x00007f92a97eb0e0:   00007f92ca32d870 00007f9e5a13ed98
0x00007f92a97eb0f0:   00007f9e5e1e88b0 00007f93b9956288
0x00007f92a97eb100:   00007f9e5a13ed70 00007f9fb23c3aac
0x00007f92a97eb110:   00007f9317c9c8d0 00007f9b60708608 

Instructions: (pc=0x00007f9fb1415ba2)
0x00007f9fb1415b82:   44 3b d7 0f 8d 6d fe ff ff 4c 8b 40 10 45 8b ca
0x00007f9fb1415b92:   44 03 0c 24 c4 c1 f9 7e c3 4d 8b 5b 18 4d 63 c9
0x00007f9fb1415ba2:   47 0f be 04 08 4d 85 db 0f 84 49 03 00 00 4d 8b
0x00007f9fb1415bb2:   4b 08 48 b9 10 1c be 10 93 7f 00 00 4c 3b c9 0f 

Register to memory mapping:

RAX=0x00007f9dbd8b6460 is an oop
java.nio.DirectByteBuffer 
 - klass: 'java/nio/DirectByteBuffer'
RBX=0x0000000000080000 is an unknown value
RCX=0x000000000005c86b is an unknown value
RDX=0x00007f9dbd8b6460 is an oop
java.nio.DirectByteBuffer 
 - klass: 'java/nio/DirectByteBuffer'
RSP=0x00007f92a97eaf20 is pointing into the stack for thread: 0x00007f9fa2d13000
RBP=0x0000000000000002 is an unknown value
RSI=0x00007f92d225e970 is pointing into metadata
RDI=0x0000000000000069 is an unknown value
R8 =0x000055986975f028 is an unknown value
R9 =0x000000000064ffd8 is an unknown value
R10=0x000000000000005f is an unknown value
R11=0x00007f94a778b290 is an oop
org.apache.hbase.thirdparty.io.netty.buffer.PooledUnsafeDirectByteBuf 
 - klass: 
'org/apache/hbase/thirdparty/io/netty/buffer/PooledUnsafeDirectByteBuf'
R12=0x00007f9e62855ae8 is an oop
java.lang.StringBuilder 
 - klass: 'java/lang/StringBuilder'
R13=0x0000000000000000 is an unknown value
R14=0x00007f9e5a14b1e0 is an oop
java.util.ArrayList$Itr 
 - klass: 'java/util/ArrayList$Itr'
R15=0x00007f9fa2d13000 is a thread


Stack: [0x00007f92a96ec000,0x00007f92a97ed000],  sp=0x00007f92a97eaf20,  free 
space=1019k
Native frames: (J=compiled Java code, j=interpreted, Vv=VM code, C=native code)
J 19801 C2 
org.apache.hadoop.hbase.util.ByteBufferUtils.copyBufferToStream(Ljava/io/OutputStream;Ljava/nio/ByteBuffer;II)V
 (75 bytes) @ 0x00007f9fb1415ba2 [0x00007f9fb14159a0+0x202]
J 38915 C2 
org.apache.hadoop.hbase.ByteBufferKeyValue.write(Ljava/io/OutputStream;Z)I (21 
bytes) @ 0x00007f9fafb63670 [0x00007f9fafb63620+0x50]
J 37217 C2 
org.apache.hadoop.hbase.KeyValueUtil.oswrite(Lorg/apache/hadoop/hbase/Cell;Ljava/io/OutputStream;Z)I
 (256 bytes) @ 0x00007f9fb04b3e30 [0x00007f9fb04b3de0+0x50]
J 37223 C2 
org.apache.hadoop.hbase.ipc.CellBlockBuilder.encodeCellsTo(Ljava/io/OutputStream;Lorg/apache/hadoop/hbase/CellScanner;Lorg/apache/hadoop/hbase/codec/Codec;Lorg/apache/hadoop/io/compress/CompressionCodec;)V
 (138 bytes) @ 0x00007f9fb24af990 [0x00007f9fb24af140+0x850]
J 28122 C2 
org.apache.hadoop.hbase.ipc.NettyRpcDuplexHandler.writeRequest(Lorg/apache/hbase/thirdparty/io/netty/channel/ChannelHandlerContext;Lorg/apache/hadoop/hbase/ipc/Call;Lorg/apache/hbase/thirdparty/io/netty/channel/ChannelPromise;)V
 (376 bytes) @ 0x00007f9fb2a4bac4 [0x00007f9fb2a4b600+0x4c4]
J 28121 C2 
org.apache.hadoop.hbase.ipc.NettyRpcDuplexHandler.write(Lorg/apache/hbase/thirdparty/io/netty/channel/ChannelHandlerContext;Ljava/lang/Object;Lorg/apache/hbase/thirdparty/io/netty/channel/ChannelPromise;)V
 (144 bytes) @ 0x00007f9fb15531d4 [0x00007f9fb1553140+0x94]
J 41543 C2 
org.apache.hbase.thirdparty.io.netty.channel.AbstractChannelHandlerContext.write(Ljava/lang/Object;ZLorg/apache/hbase/thirdparty/io/netty/channel/ChannelPromise;)V
 (149 bytes) @ 0x00007f9fb57da15c [0x00007f9fb57d96c0+0xa9c]
J 39549 C2 
org.apache.hadoop.hbase.ipc.HBaseRpcControllerImpl.notifyOnCancel(Lorg/apache/hbase/thirdparty/com/google/protobuf/RpcCallback;Lorg/apache/hadoop/hbase/ipc/HBaseRpcController$CancellationCallback;)V
 (36 bytes) @ 0x00007f9fb42e2c44 [0x00007f9fb42e2520+0x724]
J 39675 C2 org.apache.hadoop.hbase.ipc.NettyRpcConnection$$Lambda$135.run()V 
(16 bytes) @ 0x00007f9faeb56e3c [0x00007f9faeb56ac0+0x37c]
J 24927 C2 
org.apache.hbase.thirdparty.io.netty.util.concurrent.SingleThreadEventExecutor.runAllTasks(J)Z
 (109 bytes) @ 0x00007f9fb260a408 [0x00007f9fb2609fc0+0x448]
J 25959% C2 
org.apache.hbase.thirdparty.io.netty.channel.epoll.EpollEventLoop.run()V (603 
bytes) @ 0x00007f9faf772248 [0x00007f9faf772000+0x248]
j  
org.apache.hbase.thirdparty.io.netty.util.concurrent.SingleThreadEventExecutor$4.run()V+44
j  
org.apache.hbase.thirdparty.io.netty.util.internal.ThreadExecutorMap$2.run()V+11
j  
org.apache.hbase.thirdparty.io.netty.util.concurrent.FastThreadLocalRunnable.run()V+4
J 8122 C1 java.lang.Thread.run()V (17 bytes) @ 0x00007f9faf7920c4 
[0x00007f9faf791f80+0x144]
v  ~StubRoutines::call_stub
V  [libjvm.so+0x6aa9d6]  JavaCalls::call_helper(JavaValue*, methodHandle*, 
JavaCallArguments*, Thread*)+0xea6
V  [libjvm.so+0x6a7f73]  JavaCalls::call_virtual(JavaValue*, KlassHandle, 
Symbol*, Symbol*, JavaCallArguments*, Thread*)+0x263
V  [libjvm.so+0x6a8577]  JavaCalls::call_virtual(JavaValue*, Handle, 
KlassHandle, Symbol*, Symbol*, Thread*)+0x57
V  [libjvm.so+0x749f6c]  thread_entry(JavaThread*, Thread*)+0x6c
V  [libjvm.so+0xae8613]  JavaThread::thread_main_inner()+0x1f3
V  [libjvm.so+0xae89ad]  JavaThread::run()+0x36d

V  [libjvm.so+0x971532]  java_start(Thread*)+0x102
C  [libpthread.so.0+0x7ea5]  start_thread+0xc5

{noformat}




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to