[ 
https://issues.apache.org/jira/browse/KUDU-3479?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17727790#comment-17727790
 ] 

ASF subversion and git services commented on KUDU-3479:
-------------------------------------------------------

Commit 8cb9a7d843fa1c5b97237de1cf201436d274fe53 in kudu's branch 
refs/heads/master from Marton Greber
[ https://gitbox.apache.org/repos/asf?p=kudu.git;h=8cb9a7d84 ]

KUDU-3479 Put some patches back into LLVM 11

Commit fb6dc60621aa49f214b06d70527860c6a26c0e15 added two patches for
LLVM to fix build issue on Ubuntu 21.10. However, in commit
543e128d473f8f7836e605bba8cd6512fa918550 when I upgraded LLVM from 9 to
11, I forgot to check whether the previously added patches were
contained in the newer LLVM version. As it turns out not. This commit
backports those two patches for LLVM 11 and adds them back into the
build.

This issue came up during RHEL9 build experiments.

Change-Id: Ic7f9fa27aa1cde5e6ee93a87e536e71b20a67785
Reviewed-on: http://gerrit.cloudera.org:8080/19933
Tested-by: Kudu Jenkins
Reviewed-by: Zoltan Chovan <zcho...@cloudera.com>
Reviewed-by: Alexey Serbin <ale...@apache.org>


> codegen is broken if compiled with GCC11 and newer
> --------------------------------------------------
>
>                 Key: KUDU-3479
>                 URL: https://issues.apache.org/jira/browse/KUDU-3479
>             Project: Kudu
>          Issue Type: Bug
>          Components: codegen
>    Affects Versions: 1.17.0
>            Reporter: Alexey Serbin
>            Assignee: Marton Greber
>            Priority: Critical
>
> When compiled with GCC11 and newer, Kudu masters and tablet servers are 
> crashing.  The stack trace of {{kudu-master}} looks like the following:
> {noformat}
> PC: @     0x7ff822b7cd1d __memmove_avx_unaligned_erms
> *** SIGSEGV (@0x8000) received by PID 189412 (TID 0x7ff81b4d5700) from PID 
> 32768; stack trace: ***
>     @           0xe26c81 google::(anonymous namespace)::FailureSignalHandler()
>     @     0x7ff8246298c0 (unknown)
>     @     0x7ff822b7cd1d __memmove_avx_unaligned_erms
>     @     0x7ff824ea20fc (unknown)
>     @          0x132ceeb kudu::tablet::(anonymous 
> namespace)::MRSRowProjectorImpl<>::ProjectRowForRead()
>     @          0x132c543 kudu::tablet::MemRowSet::Iterator::FetchRows()
>     @          0x132cb61 kudu::tablet::MemRowSet::Iterator::NextBlock()
>     @          0x2cddb7c kudu::PredicateEvaluatingIterator::NextBlock()
>     @          0x2cde488 kudu::UnionIterator::NextBlock()
>     @          0x12b5a29 kudu::tablet::Tablet::Iterator::NextBlock()
>     @           0xd6dfda kudu::master::SysCatalogTable::ProcessRows<>()
>     @           0xd66cae kudu::master::SysCatalogTable::VisitTables()
>     @           0xddeba8 
> kudu::master::MasterPathHandlers::HandleDumpEntities()
>     @          0x1275c2b kudu::Webserver::RunPathHandler()
>     @          0x12767b1 kudu::Webserver::BeginRequestCallback()
>     @          0x12b08fc handle_request
>     @          0x12b377c process_new_connection
>     @          0x12b3e80 worker_thread
>     @     0x7ff82461d6ea start_thread
>     @     0x7ff822b13a6f __GI___clone
> {noformat}
> The litmus test is to run {{codegen-test}} that is crashing with a similar 
> stack trace:
> {noformat}
> # ./bin/codegen-test 
> [==========] Running 12 tests from 1 test suite.
> [----------] Global test environment set-up.
> [----------] 12 tests from CodegenTest
> [ RUN      ] CodegenTest.ObservablesTest
> I0420 17:19:27.839332 175031 test_util.cc:255] Using random seed: -1104489386
> [       OK ] CodegenTest.ObservablesTest (217 ms)
> [ RUN      ] CodegenTest.TestEmpty
> I0420 17:19:28.048970 175031 test_util.cc:255] Using random seed: -1104279736
> [       OK ] CodegenTest.TestEmpty (138 ms)
> [ RUN      ] CodegenTest.TestKey
> I0420 17:19:28.186726 175031 test_util.cc:255] Using random seed: -1104141979
> [       OK ] CodegenTest.TestKey (125 ms)
> [ RUN      ] CodegenTest.TestInts
> I0420 17:19:28.312000 175031 test_util.cc:255] Using random seed: -1104016705
> [       OK ] CodegenTest.TestInts (144 ms)
> [ RUN      ] CodegenTest.TestStrings
> I0420 17:19:28.455729 175031 test_util.cc:255] Using random seed: -1103872977
> *** Aborted at 1682011168 (unix time) try "date -d @1682011168" if you are 
> using GNU date ***
> PC: @     0x7f3a14924508 __memmove_evex_unaligned_erms
> *** SIGSEGV (@0x0) received by PID 175031 (TID 0x7f3a1621bcc0) from PID 0; 
> stack trace: ***
>     @           0x7c9d92 google::(anonymous namespace)::FailureSignalHandler()
>     @     0x7f3a15c7a8c0 (unknown)
>     @     0x7f3a14924508 __memmove_evex_unaligned_erms
>     @     0x7f3a162c50d4 (unknown)
>     @           0x7b5a0f kudu::CodegenTest::ProjectTestRows<>()
>     @           0x7bc13c kudu::CodegenTest::TestProjection<>()
>     @           0x7ad8ee kudu::CodegenTest_TestStrings_Test::TestBody()
>     @           0x84e517 
> testing::internal::HandleExceptionsInMethodIfSupported<>()
>     @           0x842e86 testing::Test::Run()
>     @           0x842ff5 testing::TestInfo::Run()
>     @           0x8430e5 testing::TestSuite::Run()
>     @           0x84362e testing::internal::UnitTestImpl::RunAllTests()
>     @           0x84e9f7 
> testing::internal::HandleExceptionsInMethodIfSupported<>()
>     @           0x843837 testing::UnitTest::Run()
>     @           0x75876a main
>     @     0x7f3a147ca29d __libc_start_main
>     @           0x7aaf0a _start
> Segmentation fault (core dumped)
> {noformat}
> As a workaround, disable the codegen when running {{kudu-master}} and 
> {{kudu-tserver}} processes:
> {noformat}
> --mrs_use_codegen=false
> {noformat}
> Fixing this issue should unblock Kudu adoption on contemporary Linux 
> distributions where GCC11 or newer is a system compiler (RH/CentOS 9, Ubuntu 
> 22, etc.)



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to