roa created FLINK-29550:
---------------------------

             Summary: example "basic-checkpoint-ha.yaml" not working
                 Key: FLINK-29550
                 URL: https://issues.apache.org/jira/browse/FLINK-29550
             Project: Flink
          Issue Type: Bug
          Components: Kubernetes Operator
    Affects Versions: 1.15.0
         Environment: * Kubernetes: EKS 1.22
 * Node: bottlerocket linux
 * Manifest: 
https://github.com/apache/flink-kubernetes-operator/blob/release-1.1/examples/basic-checkpoint-ha.yaml
            Reporter: roa


Hi,

I'm a flink beginner. and I'm considering using the kubernetes operator.

Before using it, we are testing these features and examples.

But, when I tried to apply basic-checkpoint-ha.yaml, I faced the below error.
{code:java}
2022-10-08 17:04:08,261 ERROR 
org.apache.flink.runtime.entrypoint.ClusterEntrypoint        [] - Fatal error 
occurred in the cluster entrypoint.
java.util.concurrent.CompletionException: java.lang.IllegalStateException: The 
base directory of the JobResultStore isn't accessible. No dirty JobResults can 
be restored.
    at java.util.concurrent.CompletableFuture.encodeThrowable(Unknown Source) 
~[?:?]
    at java.util.concurrent.CompletableFuture.completeThrowable(Unknown Source) 
[?:?]
    at java.util.concurrent.CompletableFuture$AsyncSupply.run(Unknown Source) 
[?:?]
    at java.util.concurrent.ThreadPoolExecutor.runWorker(Unknown Source) [?:?]
    at java.util.concurrent.ThreadPoolExecutor$Worker.run(Unknown Source) [?:?]
    at java.lang.Thread.run(Unknown Source) [?:?]
Caused by: java.lang.IllegalStateException: The base directory of the 
JobResultStore isn't accessible. No dirty JobResults can be restored.
    at org.apache.flink.util.Preconditions.checkState(Preconditions.java:193) 
~[flink-dist-1.15.2.jar:1.15.2]
    at 
org.apache.flink.runtime.highavailability.FileSystemJobResultStore.getDirtyResultsInternal(FileSystemJobResultStore.java:181)
 ~[flink-dist-1.15.2.jar:1.15.2]
    at 
org.apache.flink.runtime.highavailability.AbstractThreadsafeJobResultStore.withReadLock(AbstractThreadsafeJobResultStore.java:118)
 ~[flink-dist-1.15.2.jar:1.15.2]
    at 
org.apache.flink.runtime.highavailability.AbstractThreadsafeJobResultStore.getDirtyResults(AbstractThreadsafeJobResultStore.java:100)
 ~[flink-dist-1.15.2.jar:1.15.2]
    at 
org.apache.flink.runtime.dispatcher.runner.SessionDispatcherLeaderProcess.getDirtyJobResults(SessionDispatcherLeaderProcess.java:190)
 ~[flink-dist-1.15.2.jar:1.15.2]
    at 
org.apache.flink.runtime.dispatcher.runner.AbstractDispatcherLeaderProcess.supplyUnsynchronizedIfRunning(AbstractDispatcherLeaderProcess.java:198)
 ~[flink-dist-1.15.2.jar:1.15.2]
    at 
org.apache.flink.runtime.dispatcher.runner.SessionDispatcherLeaderProcess.getDirtyJobResultsIfRunning(SessionDispatcherLeaderProcess.java:184)
 ~[flink-dist-1.15.2.jar:1.15.2]
    ... 4 more
2022-10-08 17:04:08,268 INFO  org.apache.flink.runtime.blob.BlobServer          
           [] - Stopped BLOB server at 0.0.0.0:6124
2022-10-08 17:04:08,270 INFO  
org.apache.flink.runtime.entrypoint.ClusterEntrypoint        [] - Shutting 
KubernetesApplicationClusterEntrypoint down with application status UNKNOWN. 
Diagnostics Cluster entrypoint has been closed externally.. {code}


Could you let me know why that error occurs?



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Reply via email to