[ 
https://issues.apache.org/jira/browse/HIVE-26806?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17643213#comment-17643213
 ] 

Stamatis Zampetakis commented on HIVE-26806:
--------------------------------------------

The recent builds on master (1513, 1514) are now back to normal and each split 
takes at most ~2h.

[~asolimando] [~ayushtkn] I am planning to revert the timeout back to 6h by 
committing directly to master in a few hours. Please speak up if there is any 
reason not do to this.

[~akshatm] The Jenkins plugin that is used to split the test into buckets uses 
the last successful build of the job as a guide. Each PR corresponds to a 
separate Jenkins Job 
(http://ci.hive.apache.org/job/hive-precommit/view/change-requests/). The last 
successful build for your PR is 
http://ci.hive.apache.org/job/hive-precommit/job/PR-3803/8/ so this is what 
will be used to split the tests. This is not good cause the successful run has 
3K less tests than what exists in master so the splitting will be pretty bad. I 
see three  ways to unblock the current situation and overcome the problem: 
# Close PR-3803 and open a new one.
# Manually delete every successful build for JOB PR-3803 and start a new one.
# Increase the timeout on the JenkinsFile and try again.
None of these is perfect but I have higher hopes for 1 and 2.

> Precommit tests in CI are timing out after HIVE-26796
> -----------------------------------------------------
>
>                 Key: HIVE-26806
>                 URL: https://issues.apache.org/jira/browse/HIVE-26806
>             Project: Hive
>          Issue Type: Bug
>          Components: Testing Infrastructure
>            Reporter: Stamatis Zampetakis
>            Assignee: Stamatis Zampetakis
>            Priority: Major
>
> http://ci.hive.apache.org/job/hive-precommit/job/master/1506/
> {noformat}
> ancelling nested steps due to timeout
> 15:22:08  Sending interrupt signal to process
> 15:22:08  Killing processes
> 15:22:09  kill finished with exit code 0
> 15:22:19  Terminated
> 15:22:19  script returned exit code 143
> [Pipeline] }
> [Pipeline] // withEnv
> [Pipeline] }
> 15:22:19  Deleting 1 temporary files
> [Pipeline] // configFileProvider
> [Pipeline] }
> [Pipeline] // stage
> [Pipeline] stage
> [Pipeline] { (PostProcess)
> [Pipeline] sh
> [Pipeline] sh
> [Pipeline] sh
> [Pipeline] junit
> 15:22:25  Recording test results
> 15:22:32  [Checks API] No suitable checks publisher found.
> [Pipeline] }
> [Pipeline] // stage
> [Pipeline] }
> [Pipeline] // container
> [Pipeline] }
> [Pipeline] // node
> [Pipeline] }
> [Pipeline] // timeout
> [Pipeline] }
> [Pipeline] // podTemplate
> [Pipeline] }
> 15:22:32  Failed in branch split-01
> [Pipeline] // parallel
> [Pipeline] }
> [Pipeline] // stage
> [Pipeline] stage
> [Pipeline] { (Archive)
> [Pipeline] podTemplate
> [Pipeline] {
> [Pipeline] timeout
> 15:22:33  Timeout set to expire in 6 hr 0 min
> {noformat}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to