[ 
https://issues.apache.org/jira/browse/IMPALA-13165?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Fang-Yu Rao updated IMPALA-13165:
---------------------------------
    Attachment: generate_junitxml.finalize.minidumps.20240616_21_41_14.xml

> Impala daemon crashed with OMException in Ozone build
> -----------------------------------------------------
>
>                 Key: IMPALA-13165
>                 URL: https://issues.apache.org/jira/browse/IMPALA-13165
>             Project: IMPALA
>          Issue Type: Bug
>            Reporter: Fang-Yu Rao
>            Assignee: Yida Wu
>            Priority: Major
>              Labels: broken-build
>         Attachments: 
> generate_junitxml.finalize.minidumps.20240616_21_41_14.xml
>
>
> We found from an internal build that Impala daemon crashed with a lot of 
> OMException in an Ozone build.
> For instance, the backend test 
> [Multi8RandomSpillToRemoteMix()|https://github.com/apache/impala/blob/master/be/src/runtime/bufferpool/buffer-pool-test.cc#L2065C24-L2070]
>  failed with the following stack trace collected from the generated minidump.
> {code}
> Thread 502 (crashed)
>  0  libc.so.6 + 0x36387
>     rax = 0x0000000000000000   rdx = 0x0000000000000006
>     rcx = 0xffffffffffffffff   rbx = 0x000000000607d920
>     rsi = 0x0000000000000cfa   rdi = 0x00000000000028ec
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0428
>      r8 = 0x0000000000000000    r9 = 0x00007fd6662f02e0
>     r10 = 0x0000000000000008   r11 = 0x0000000000000202
>     r12 = 0x000000000607d920   r13 = 0x000000000607d980
>     r14 = 0x0000000000000152   r15 = 0x0000000000000223
>     rip = 0x00007fd77dbd1387
>     Found by: given as instruction pointer in context
>  1  libc.so.6 + 0x37a78
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0430
>     rip = 0x00007fd77dbd2a78
>     Found by: stack scanning
>  2  buffer-pool-test!google_breakpad::ExceptionHandler::HandleSignal(int, 
> siginfo_t*, void*) + 0x1a0
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f04b8
>     rip = 0x0000000003a29e40
>     Found by: stack scanning
>  3  buffer-pool-test!tcmalloc::ThreadCache::FetchFromCentralCache(unsigned 
> int, int, void* (*)(unsigned long)) + 0x68
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f04f0
>     rip = 0x0000000003b6f858
>     Found by: stack scanning
>  4  buffer-pool-test!tcmalloc::malloc_oom(unsigned long) + 0xc0
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0500
>     rip = 0x0000000003d07f20
>     Found by: stack scanning
>  5  buffer-pool-test!google::(anonymous namespace)::FailureSignalHandler(int, 
> siginfo_t*, void*) [clone .part.0] + 0xad0
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0558
>     rip = 0x00000000039faa00
>     Found by: stack scanning
>  6  buffer-pool-test!google::DumpStackTraceAndExit() [clone .cold] + 0x5
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0560
>     rip = 0x0000000000f00e4f
>     Found by: stack scanning
>  7  libstdc++.so.6 + 0x13aa48
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0570
>     rip = 0x00007fd78132ea48
>     Found by: stack scanning
>  8  libstdc++.so.6 + 0x13aa48
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0580
>     rip = 0x00007fd78132ea48
>     Found by: stack scanning
>  9  libstdc++.so.6 + 0x11f8e2
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f05b0
>     rip = 0x00007fd7813138e2
>     Found by: stack scanning
> 10  
> buffer-pool-test!google::LogDestination::WaitForSinks(google::LogMessage::LogMessageData*)
>  + 0x110
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f05e0
>     rip = 0x00000000039f6460
>     Found by: stack scanning
> 11  buffer-pool-test!google::LogMessage::Fail() + 0xd 
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0610
>     rip = 0x00000000039ef6bd
>     Found by: stack scanning
> 12  buffer-pool-test!google::LogMessage::SendToLog() + 0x244
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0620
>     rip = 0x00000000039f15f4
>     Found by: stack scanning
> 13  libstdc++.so.6 + 0x12cae4
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0640
>     rip = 0x00007fd781320ae4
>     Found by: stack scanning
> 14  buffer-pool-test!_fini + 0x19b3
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0648
>     rip = 0x0000000003d0cb03
>     Found by: stack scanning
> 15  buffer-pool-test!_fini + 0xa7c14
>     rbp = 0x00007fd6662f06e0   rsp = 0x00007fd6662f0658
>     rip = 0x0000000003db2d64
>     Found by: stack scanning
> 16  buffer-pool-test!google::LogMessage::Flush() + 0x1ec
>     rsp = 0x00007fd6662f06f0   rip = 0x00000000039ef09c
>     Found by: stack scanning
> 17  libstdc++.so.6 + 0x12cae4
>     rsp = 0x00007fd6662f0730   rip = 0x00007fd781320ae4
>     Found by: stack scanning
> 18  buffer-pool-test!google::LogMessageFatal::~LogMessageFatal() + 0x9 
>     rsp = 0x00007fd6662f0790   rip = 0x00000000039f1b19
>     Found by: stack scanning
> 19  
> buffer-pool-test!impala::BufferPoolTest::TestRandomInternalImpl(impala::BufferPool*,
>  impala::TmpFileGroup*, impala::MemTracker*, 
> std::mersenne_twister_engine<unsigned long, 32ul, 624ul, 397ul, 31ul, 
> 2567483615ul, 11ul, 4294967295ul, 7ul, 2636928640ul, 15ul, 4022730752ul, 
> 18ul, 1812433253ul>*, int, bool) [buffer-pool.h : 338 + 0x8]
>     rsp = 0x00007fd6662f07a0   rip = 0x0000000000f8721f
>     Found by: stack scanning
> {code}
> During the crash we also saw quite a few OMException from the console output.
> {code}
> 08:46:11 
> hdfsOpenFile(ofs://localhost:9862/impala/tmp/impala-scratch/a44cc3c871369491_8dcaa671747530a3_0000000000000000_0000000000000000/impala-scratch-ae339172-59d6-41ef-9a6a-249c4d9ff537):
>  
> FileSystem#create((Lorg/apache/hadoop/fs/Path;ZISJ)Lorg/apache/hadoop/fs/FSDataOutputStream;)hdfsOpenFile(ofs://localhost:9862/impala/tmp/impala-scratch/a44cc3c871369491_8dcaa671747530a3_0000000000000000_0000000000000000/impala-scratch-b305d4f9-8e61-4b96-afd4-9940bd8f48b6):
>  
> FileSystem#create((Lorg/apache/hadoop/fs/Path;ZISJ)Lorg/apache/hadoop/fs/FSDataOutputStream;)
>  error:
> 08:46:11 
> hdfsOpenFile(ofs://localhost:9862/impala/tmp/impala-scratch/a44cc3c871369491_8dcaa671747530a3_0000000000000000_0000000000000000/impala-scratch-45a6a781-55f7-44aa-9a06-2ed6a6242e92):
>  
> FileSystem#create((Lorg/apache/hadoop/fs/Path;ZISJ)Lorg/apache/hadoop/fs/FSDataOutputStream;)hdfsOpenFile(ofs://localhost:9862/impala/tmp/impala-scratch/a44cc3c871369491_8dcaa671747530a3_0000000000000000_0000000000000000/impala-scratch-0f52e72b-2cc0-4697-a1aa-838891422844):
>  
> FileSystem#create((Lorg/apache/hadoop/fs/Path;ZISJ)Lorg/apache/hadoop/fs/FSDataOutputStream;)hdfsOpenFile(ofs://localhost:9862/impala/tmp/impala-scratch/a44cc3c871369491_8dcaa671747530a3_0000000000000000_0000000000000000/impala-scratch-4d43c8b9-e062-474a-945c-9e20e4d50998):
>  
> FileSystem#create((Lorg/apache/hadoop/fs/Path;ZISJ)Lorg/apache/hadoop/fs/FSDataOutputStream;)
>  error:
> 08:46:11 
> hdfsOpenFile(ofs://localhost:9862/impala/tmp/impala-scratch/a44cc3c871369491_8dcaa671747530a3_0000000000000000_0000000000000000/impala-scratch-40a6653e-3b88-4c62-8d4e-1f23e2e5eb9c):
>  
> FileSystem#create((Lorg/apache/hadoop/fs/Path;ZISJ)Lorg/apache/hadoop/fs/FSDataOutputStream;)
>  error:
> 08:46:11  error:
> 08:46:11  error:
> 08:46:11 OMException: Allocated 0 blocks. Requested 1 blocksINTERNAL_ERROR 
> org.apache.hadoop.ozone.om.exceptions.OMException: Allocated 0 blocks. 
> Requested 1 blocks
> 08:46:11      at 
> org.apache.hadoop.ozone.om.protocolPB.OzoneManagerProtocolClientSideTranslatorPB.handleError(OzoneManagerProtocolClientSideTranslatorPB.java:728)
> 08:46:11      at 
> org.apache.hadoop.ozone.om.protocolPB.OzoneManagerProtocolClientSideTranslatorPB.createFile(OzoneManagerProtocolClientSideTranslatorPB.java:2133)
> 08:46:11      at 
> org.apache.hadoop.ozone.client.rpc.RpcClient.createFile(RpcClient.java:2001)
> 08:46:11      at 
> org.apache.hadoop.ozone.client.OzoneBucket.createFile(OzoneBucket.java:822)
> 08:46:11      at 
> org.apache.hadoop.fs.ozone.BasicRootedOzoneClientAdapterImpl.createFile(BasicRootedOzoneClientAdapterImpl.java:389)
> 08:46:11      at 
> org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystem.createOutputStream(BasicRootedOzoneFileSystem.java:299)
> 08:46:11      at 
> org.apache.hadoop.fs.ozone.BasicRootedOzoneFileSystem.create(BasicRootedOzoneFileSystem.java:261)
> 08:46:11      at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:1177)
> 08:46:11      at org.apache.hadoop.fs.FileSystem.create(FileSystem.java:1157)
> {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to