date:20180921

Flink can't initialize operator state backend when starting from checkpoint

2018-09-21 Thread Marvin777

Hi all, When Flink(1.4.2) job starts, it could find checkpoint files at HDFS, but exception occurs during deserializing: [image: image.png] Do you have any insight on this? Thanks, Qingxiang Ma

Re: multiple flink applications on yarn are shown in one application.

2018-09-21 Thread Stefan Richter

Hi, I see from your command that you are using the same jar file twice, so I want to double-check first how you even determine which job should be started? I am also adding Till (in CC) depending on your answer to my first question, he might have some additional thoughts. Best, Stefan > Am 21

Re: Flink can't initialize operator state backend when starting from checkpoint

2018-09-21 Thread vino yang

Hi Qingxiang, Several days ago, Stefan described the causes of this anomaly in a problem similar to this: Typically, these problems have been observed when something was wrong with a serializer or a stateful serializer was used from multiple threads. Thanks, vino. Marvin777 于2018年9月21日周五下午3:20

Re: S3 connector Hadoop class mismatch

2018-09-21 Thread Paul Lam

Hi Stefan, Stephan, Yes, the `hadoop.security.group.mapping` option is explicitly set to `org.apache.hadoop.security.LdapGroupsMapping`. Guess that was why the classloader found an unshaded class. I don’t have the permission to change the Hadoop cluster configurations so I modified the `core-

How to join stream and batch data in Flink?

2018-09-21 Thread 徐涛

Hi All, Sometimes some “dimension table” need to be joined from the "fact table", if data are not joined before sent to Kafka. So if the data are joined in Flink, does the “dimension table” have to be import as a stream, or there are some other ways can achieve it? Thanks

Re: Flink can't initialize operator state backend when starting from checkpoint

2018-09-21 Thread Stefan Richter

Hi, that is correct. If you are using custom serializers you should double check their correctness, maybe using our test base for type serializers. Another reason could be that you modified the job in a way that silently changed the schema somehow. Concurrent use of serializers across different

Re: How to join stream and batch data in Flink?

2018-09-21 Thread vino yang

Hi Henry, There are three ways I can think of: 1) use DataStream API, implement a flatmap UDF to access dimension table; 2) use table/sql API, implement a UDTF to access dimension table; 3) customize the table/sql join API/statement's implementation (and change the physical plan) Thanks, vino.

How does flink read data from kafka number of TM's are more than topic partitions

2018-09-21 Thread Taher Koitawala

Hi All, Let's say a topic in kafka has 5 partitions. If I spawn 10 Task Managers with 1 slot each and parallelism is 10 then how will records be read from the kafka topic if I use the FlinkKafkaConsumer to read. Will 5 TM's read and the rest be ideal in that case? Is over subscribing the

Re: HA failing for 1.6.0 job cluster with docker-compose

2018-09-21 Thread Tzanko Matev

Hi Vino and TIll, That's great news. Thank you! Cheers, Tzanko On Thu, Sep 20, 2018 at 11:43 AM vino yang wrote: > Hi all, > > Oh, I took this ticket, will fix it as soon as possible. > > Thanks, vino. > > Till Rohrmann 于2018年9月20日周四下午4:35写道： > >> Hi Tzanko, >> >> in order to make the cont

Re: UNCHECKED Error while confirming Checkpoint

2018-09-21 Thread Stefan Richter

Hi, could you provide some logs for this problematic job because I would like to double check the reason why this violated precondition did actually happen? Thanks, Stefan > Am 20.09.2018 um 17:24 schrieb Stefan Richter : > > FYI, here a link to my PR: https://github.com/apache/flink/pull/6723

Re: Data loss when restoring from savepoint

2018-09-21 Thread Andrey Zagrebin

Hi Juho, so it means that the savepoint does not loose at least some dropped records. If it is feasible for your setup, I suggest to insert one more map function after reduce and before sink. The map function should be called right after window is triggered but before flushing to s3. The resul

Re: Flink can't initialize operator state backend when starting from checkpoint

2018-09-21 Thread Marvin777

Hi, I do not use custom serializers, and my job contains only source and sink(BucketingSink). What causes this phenomenon in general? I suggest that you also update to a newer version, at least the latest > bugfix release Which version does this sentence refer to? And could you please help l

Re: Data loss when restoring from savepoint

2018-09-21 Thread Juho Autio

Thanks, Andrey. > so it means that the savepoint does not loose at least some dropped records. I'm not sure what you mean by that? I mean, it was known from the beginning, that not everything is lost before/after restoring a savepoint, just some records around the time of restoration. It's not 10

Re: How does flink read data from kafka number of TM's are more than topic partitions

2018-09-21 Thread Piotr Nowojski

Hi, Yes, in your case half of the Kafka source tasks wouldn’t read/process any records (you can check that in web UI). This shouldn’t harm you, unless your records will be redistributed after the source. For example: source.keyBy(..).process(new MyVeryHeavyOperator()).print() Should be fine, b

Re: How does flink read data from kafka number of TM's are more than topic partitions

2018-09-21 Thread Taher Koitawala

Thanks a lot for the explanation. That was exactly what I thought should happen. However, it is always good to a clear confirmation. Regards, Taher Koitawala GS Lab Pune +91 8407979163 On Fri, Sep 21, 2018 at 6:26 PM Piotr Nowojski wrote: > Hi, > > Yes, in your case half of the Kafka source t

Re: How does flink read data from kafka number of TM's are more than topic partitions

2018-09-21 Thread Piotr Nowojski

No problem :) Piotrek > On 21 Sep 2018, at 15:04, Taher Koitawala wrote: > > Thanks a lot for the explanation. That was exactly what I thought should > happen. However, it is always good to a clear confirmation. > > > Regards, > Taher Koitawala > GS Lab Pune > +91 8407979163 > > > On Fri,

Re: Running Flink in Google Cloud Platform (GCP) - can Flink be truly elastic?

2018-09-21 Thread Dawid Wysakowicz

Hi Alexander, I've redirected your question to user mailing list. The goal of community list is for "Broader community discussions related to meetups, conferences, blog posts and job offers" Quick answer to your question is that dynamic scaling of flink job's is a work in progress. Maybe Gary or

Flink 1.5.4 -- issues w/ TaskManager connecting to ResourceManager

2018-09-21 Thread Jamie Grier

Anybody else seen this? I'm running both the JM and TM on the same host in this setup. This was working fine w/ Flink 1.5.3. On the TaskManager: 00:31:30.268 INFO o.a.f.r.t.TaskExecutor - Could not resolve ResourceManager address akka.tcp://flink@localhost:6123/user/resourcemanager, retrying i

Re: How to join stream and batch data in Flink?

2018-09-21 Thread Hequn Cheng

Hi +1 for vino's answer. Also, this kind of join will be supported in FLINK-9712 . You can check more details in the jira. Best, Hequn On Fri, Sep 21, 2018 at 4:51 PM vino yang wrote: > Hi Henry, > > There are three ways I can think of: > > 1)

Flink can't initialize operator state backend when starting from checkpoint

Re: multiple flink applications on yarn are shown in one application.

Re: Flink can't initialize operator state backend when starting from checkpoint

Re: S3 connector Hadoop class mismatch

How to join stream and batch data in Flink?

Re: Flink can't initialize operator state backend when starting from checkpoint

Re: How to join stream and batch data in Flink?

How does flink read data from kafka number of TM's are more than topic partitions

Re: HA failing for 1.6.0 job cluster with docker-compose

Re: UNCHECKED Error while confirming Checkpoint

Re: Data loss when restoring from savepoint

Re: Flink can't initialize operator state backend when starting from checkpoint

Re: Data loss when restoring from savepoint

Re: How does flink read data from kafka number of TM's are more than topic partitions

Re: How does flink read data from kafka number of TM's are more than topic partitions

Re: How does flink read data from kafka number of TM's are more than topic partitions

Re: Running Flink in Google Cloud Platform (GCP) - can Flink be truly elastic?

Flink 1.5.4 -- issues w/ TaskManager connecting to ResourceManager

Re: How to join stream and batch data in Flink?

19 matches

Site Navigation

Mail list logo

Footer information