Shahar Frank created BEAM-6289:
----------------------------------
Summary: Running a join on two Cassandra tables using FlinkRunner
fails
Key: BEAM-6289
URL: https://issues.apache.org/jira/browse/BEAM-6289
Project: Beam
Issue Type: Bug
Components: io-java-cassandra, runner-flink
Affects Versions: 2.9.0, 2.8.0
Environment: Tested on Ubuntu 18
Beam 2.8
Tested with Flink:
1) [local]
2) Cluster inside a K8S cluster on minikube
3) Cluster inside a K8S cluster on GCP
Tested using Cassandra [cqlsh 5.0.1 | Cassandra 3.11.3 | CQL spec 3.4.4 |
Native protocol v4]:
1) In a local container
2) Cluster inside a K8S cluster on minikube
3) Cluster inside a K8S cluster on GCP
Reporter: Shahar Frank
Assignee: Jean-Baptiste Onofré
Can't make a simple join on two Cassandra tables when using FlinkRunner.
The same code works with a DirectRunner fails when used with FlinkRunner giving
these (as well as many other) errors:
{code:java}
Caused by: akka.pattern.AskTimeoutException: Ask timed out on
[Actor[akka://flink/user/dispatchere1f5abe7-6299-43ea-9182-24a2193e078f#-1757043920]]
after [10000 ms]. Sender[null] sent message of type
"org.apache.flink.runtime.rpc.messages.LocalFencedMessage".
at
akka.pattern.PromiseActorRef$$anonfun$1.apply$mcV$sp(AskSupport.scala:604)
at akka.actor.Scheduler$$anon$4.run(Scheduler.scala:126)
at
scala.concurrent.Future$InternalCallbackExecutor$.unbatchedExecute(Future.scala:601)
at
scala.concurrent.BatchingExecutor$class.execute(BatchingExecutor.scala:109)
at
scala.concurrent.Future$InternalCallbackExecutor$.execute(Future.scala:599)
at
akka.actor.LightArrayRevolverScheduler$TaskHolder.executeTask(LightArrayRevolverScheduler.scala:329)
at
akka.actor.LightArrayRevolverScheduler$$anon$4.executeBucket$1(LightArrayRevolverScheduler.scala:280)
at
akka.actor.LightArrayRevolverScheduler$$anon$4.nextTick(LightArrayRevolverScheduler.scala:284)
at
akka.actor.LightArrayRevolverScheduler$$anon$4.run(LightArrayRevolverScheduler.scala:236)
at java.lang.Thread.run(Thread.java:748)
{code}
The code can be found [here|https://github.com/srfrnk/beam-playground]
Steps to reproduce:
# Clone the repo to a linux (I;m on Ubuntu 18 but any *nix system would
probably work - i.e. repl.it)
# Follow the README to set up a Cassandra container + schema
# Run with
{code}
gradle --console=plain join-from-cassandra -Drunner=flink > output/build.log
2>&1{code}
to use FlinkRunner. See error in log at ./output/build.log
# Run with
{code}
gradle --console=plain join-from-cassandra -Drunner=direct > output/build.log
2>&1{code}
to use DirectRunner. See error in log at ./output/build.log
--
This message was sent by Atlassian JIRA
(v7.6.3#76005)