[
https://issues.apache.org/jira/browse/SPARK-23775?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
]
Dongjoon Hyun updated SPARK-23775:
----------------------------------
Description:
DataFrameRangeSuite.test("Cancelling stage in a query with Range.") stays
sometimes in an infinite loop and times out the build.
I presume the original intention of this test is to start a job with range and
just cancel it.
The submitted job has 2 stages but I think the author tried to cancel the first
stage with ID 0 which is not the case here:
{code:java}
eventually(timeout(10.seconds), interval(1.millis)) {
assert(DataFrameRangeSuite.stageToKill > 0)
}
{code}
All in all if the first stage is slower than 10 seconds it throws
TestFailedDueToTimeoutException and cancelStage will be never ever called.
-
https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-2.7/4780/
was:
DataFrameRangeSuite.test("Cancelling stage in a query with Range.") stays
sometimes in an infinite loop and times out the build.
I presume the original intention of this test is to start a job with range and
just cancel it.
The submitted job has 2 stages but I think the author tried to cancel the first
stage with ID 0 which is not the case here:
{code:java}
eventually(timeout(10.seconds), interval(1.millis)) {
assert(DataFrameRangeSuite.stageToKill > 0)
}
{code}
All in all if the first stage is slower than 10 seconds it throws
TestFailedDueToTimeoutException and cancelStage will be never ever called.
> Flaky test: DataFrameRangeSuite
> -------------------------------
>
> Key: SPARK-23775
> URL: https://issues.apache.org/jira/browse/SPARK-23775
> Project: Spark
> Issue Type: Bug
> Components: SQL, Tests
> Affects Versions: 2.4.0
> Reporter: Gabor Somogyi
> Priority: Major
> Attachments: filtered.log, filtered_more_logs.log
>
>
> DataFrameRangeSuite.test("Cancelling stage in a query with Range.") stays
> sometimes in an infinite loop and times out the build.
> I presume the original intention of this test is to start a job with range
> and just cancel it.
> The submitted job has 2 stages but I think the author tried to cancel the
> first stage with ID 0 which is not the case here:
> {code:java}
> eventually(timeout(10.seconds), interval(1.millis)) {
> assert(DataFrameRangeSuite.stageToKill > 0)
> }
> {code}
> All in all if the first stage is slower than 10 seconds it throws
> TestFailedDueToTimeoutException and cancelStage will be never ever called.
> -
> https://amplab.cs.berkeley.edu/jenkins/view/Spark%20QA%20Test%20(Dashboard)/job/spark-master-test-maven-hadoop-2.7/4780/
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)
---------------------------------------------------------------------
To unsubscribe, e-mail: [email protected]
For additional commands, e-mail: [email protected]