Paul Rogers created IMPALA-8114:
-----------------------------------

             Summary: Build test failure in test_breakpad.py
                 Key: IMPALA-8114
                 URL: https://issues.apache.org/jira/browse/IMPALA-8114
             Project: IMPALA
          Issue Type: Bug
          Components: Infrastructure
    Affects Versions: Impala 3.1.0
            Reporter: Paul Rogers
            Assignee: Tim Armstrong


Recent builds have failed due to a failure in {{test_breakpad.py}}. Assigning 
to Tim as the person who most recently touched this file.

Test output:

{noformat}
09:04:35 ==================================== ERRORS 
====================================
09:04:35 ___ ERROR at teardown of 
TestBreakpadExhaustive.test_minidump_cleanup_thread ___
09:04:35 custom_cluster/test_breakpad.py:49: in teardown_method
09:04:35     self.kill_cluster(SIGKILL)
09:04:35 custom_cluster/test_breakpad.py:80: in kill_cluster
09:04:35     self.kill_processes(processes, signal)
09:04:35 custom_cluster/test_breakpad.py:85: in kill_processes
09:04:35     process.kill(signal)
09:04:35 common/impala_cluster.py:330: in kill
09:04:35     assert 0, "No processes %s found" % self.cmd
09:04:35 E   AssertionError: No processes 
['/data/jenkins/workspace/impala-cdh6.x-exhaustive-release/repos/Impala/be/build/latest/service/impalad',
 '-kudu_client_rpc_timeout_ms', '0', '-kudu_master_hosts', 'localhost', 
'--mem_limit=12884901888', '-logbufsecs=5', '-v=1', '-max_log_files=0', 
'-log_filename=impalad', 
'-log_dir=/data/jenkins/workspace/impala-cdh6.x-exhaustive-release/repos/Impala/logs/custom_cluster_tests',
 '-beeswax_port=21000', '-hs2_port=21050', '-be_port=22000', 
'-krpc_port=27000', '-state_store_subscriber_port=23000', 
'-webserver_port=25000', '-max_minidumps=2', '-logbufsecs=1', 
'-minidump_path=/tmp/tmpKaSw_w', '--default_query_options='] found
{noformat}

Distilled {{TEST-impala-custom-cluster.xml}} output:

{noformat}
-- 2019-01-23 08:00:43,585 INFO     MainThread: Found 3 impalad/1 statestored/1 
catalogd process(es)
…
-- 2019-01-23 08:00:43,667 INFO     MainThread: Killing: 
/data/jenkins/workspace/impala-cdh6.x-exhaustive-release/repos/Impala/be/build/latest/service/statestored
 -logbufsecs=5 -v=1 -max_log_files=0 -log_filename=statestored 
-log_dir=/data/jenkins/workspace/impala-cdh6.x-exhaustive-release/repos/Impala/logs/custom_cluster_tests
 -max_minidumps=2 -logbufsecs=1 -minidump_path=/tmp/tmpKaSw_w (PID: 16809) with 
signal 10
-- 2019-01-23 08:00:43,692 INFO     MainThread: Found 6 impalad/1 statestored/1 
catalogd process(es)
...
E   AssertionError: No processes 
['/data/jenkins/workspace/impala-cdh6.x-exhaustive-release/repos/Impala/be/build/latest/service/impalad
{noformat}

Notice that the main thread appaars to be killing statestore, but fails to kill 
impalad. Notice that a message appears that says that all impalads are running 
in the midst of the code that tries to shut down the cluster. Is this test 
multi-threaded? Is there more than one “main thread” Are these main threads 
working at cross purposes? What recent change may have caused this?

Also, looks like the script is sending signal 10 (SIGUSR1) while the statestore 
(in its log) says it got a SIGTERM (15):

{noformat}
I0123 08:00:44.086009 16868 thrift-client.cc:78] Couldn't open transport for 
impala-ec2-centoCaught signal: SIGTERM. Daemon will exit.
{noformat}

Not terribly familiar with this area of the product, so bumping it over to the 
BE team.



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-all-unsubscr...@impala.apache.org
For additional commands, e-mail: issues-all-h...@impala.apache.org

Reply via email to