Re: delay one of the datastream when performing join operation on event-time and watermark

2018-12-06 Thread jincheng sun
Hi Pakesh Kuma, I think you can using the interval-join, e.g.: orderStream .keyBy() .intervalJoin(invoiceStream.keyBy()) .between(Time.minutes(-5), Time.minutes(5)) The semantics of interval-join and detailed usage description can refer to

runtime.resourcemanager

2018-12-06 Thread Alieh
Hello all, I have an algorithm x () which contains several joins and usage of 3 times of gelly ConnectedComponents. The problem is that if I call x() inside a script more than three times, I receive the messages listed below in the log and the program is somehow stopped. It happens even if I

Re: number of files in checkpoint directory grows endlessly

2018-12-06 Thread Andrey Zagrebin
Hi Bernd, Does this directory contains approximately 300 files on task executor? The 'transactionState' seems to be ok and compacted. It takes most of data size, up to 100Mb. Do you use any Flink timers? They are also kept in RocksDb and take very little, just about 300Kb. It might need more

Re: Backpressure metrics as JMX Bean

2018-12-06 Thread sayat
got it. thank you! On Thu, Dec 6, 2018 at 4:26 PM Chesnay Schepler wrote: > No this is not possible. > > On 06.12.2018 16:04, sayat wrote: > > Dear Flink community, > > > > Does anyone know if it is possible to expose Flink BackPressure number > > via JMX MBean? The one that shows in Flink UI?

Re: Backpressure metrics as JMX Bean

2018-12-06 Thread Chesnay Schepler
No this is not possible. On 06.12.2018 16:04, sayat wrote: Dear Flink community, Does anyone know if it is possible to expose Flink BackPressure number via JMX MBean? The one that shows in Flink UI?

Backpressure metrics as JMX Bean

2018-12-06 Thread sayat
Dear Flink community, Does anyone know if it is possible to expose Flink BackPressure number via JMX MBean? The one that shows in Flink UI?

Re: Query big mssql Data Source [Batch]

2018-12-06 Thread Flavio Pompermaier
That inputformat is a batch one, so there's no state backend. You need to output the fetched data somewhere AFAIK On Thu, Dec 6, 2018 at 3:49 PM miki haiat wrote: > Hi Flavio , > That working fine for and im able to pull ~17m rows in 20 seconds. > > Im a bit confuse regarding the state backhand

Re: Query big mssql Data Source [Batch]

2018-12-06 Thread miki haiat
Hi Flavio , That working fine for and im able to pull ~17m rows in 20 seconds. Im a bit confuse regarding the state backhand , I could find a way to configure it so im guessing the data is in the memory ... thanks, Miki On Thu, Dec 6, 2018 at 12:06 PM Flavio Pompermaier wrote: > the

Re: Run simple flink application via "java -jar"

2018-12-06 Thread Timo Walther
Hi Krishna, yes this should work given that you included all dependencies that are marked as "provided" in a Flink example project. In general, when you develop a Flink application, you can can simply press the run button in your IDE. This will start a mini cluster locally for debugging

Re: Flink 1.7 job cluster (restore from checkpoint error)

2018-12-06 Thread Hao Sun
Thanks for the tip! I did change the jobGraph this time. Hao Sun Team Lead 1019 Market St. 7F San Francisco, CA 94103 On Thu, Dec 6, 2018 at 2:47 AM Till Rohrmann wrote: > Hi Hao, > > if Flink tries to recover from a checkpoint, then the JobGraph should not > be modified and the system should

Run simple flink application via "java -jar"

2018-12-06 Thread Krishna Kalyan
Hello, This is a very n00b question. Can we run a flink job (for example wordcount) using "java -jar " in standalone mode. I usually see examples using "$FLINK_HOME/bin/flink run ". If yes can someone please point me to an example. Regards, Krishna -- Standorte in Stuttgart und Berlin

Re: flink-s3-fs-presto:1.7.0 is missing shaded com/facebook/presto/hadoop

2018-12-06 Thread Chesnay Schepler
I've filed a JIRA: https://issues.apache.org/jira/browse/FLINK-11085 On 06.12.2018 14:03, Sergei Poganshev wrote: When I try to configure checkpointing using Presto in 1.7.0 the following exception occurs: java.lang.NoClassDefFoundError:

flink-s3-fs-presto:1.7.0 is missing shaded com/facebook/presto/hadoop

2018-12-06 Thread Sergei Poganshev
When I try to configure checkpointing using Presto in 1.7.0 the following exception occurs: java.lang.NoClassDefFoundError: org/apache/flink/fs/s3presto/shaded/com/facebook/presto/hadoop/HadoopFileStatus at

AW: number of files in checkpoint directory grows endlessly

2018-12-06 Thread Bernd.Winterstein
Seems that some file deletion is disabled by default. There are some log entries in the file Von: Andrey Zagrebin [mailto:and...@data-artisans.com] Gesendet: Donnerstag, 6. Dezember 2018 12:07 An: Winterstein, Bernd Cc: Yun Tang; Kostas Kloudas; user; Stefan Richter; Till Rohrmann; Stephan Ewen

Re: number of files in checkpoint directory grows endlessly

2018-12-06 Thread Yun Tang
Hi Bernd RocksDB would not delete expired entries until compaction triggered. I saw your code set the 'readOnly' as false already, and from your description, the files in shared directory continue to increase each day. I think the mechanism of TTL for rocksDB should work fine. If you want to

delay one of the datastream when performing join operation on event-time and watermark

2018-12-06 Thread Rakesh Kumar
Hi, I have two data sources one is for order data and another one is for invoice data, these two data i am pushing into kafka topic in json form. I wanted to delay order data for 5 mins because invoice data comes only after order data is generated. So, for that i have written a flink program

Re: number of files in checkpoint directory grows endlessly

2018-12-06 Thread Andrey Zagrebin
Hi Bernd, Thanks for sharing the code. I understand your TTL requirement. It definitely makes sense for your application. My recommendation is still to try running job with original backend without TtlDb modification to narrow down the problem and understand where these small files come

Re: Flink 1.7 job cluster (restore from checkpoint error)

2018-12-06 Thread Till Rohrmann
Hi Hao, if Flink tries to recover from a checkpoint, then the JobGraph should not be modified and the system should be able to restore the state. Have you changed the JobGraph and are you now trying to recover from the latest checkpoint which is stored in ZooKeeper? If so, then you can also

Re: Is there an example of flink cluster "as a job" deployment on k8s ?

2018-12-06 Thread Dawid Wysakowicz
Hi Vishal, You might want to have a look at the flink-container/kubernetes module: https://github.com/apache/flink/tree/master/flink-container/kubernetes Best, Dawid On 05/12/2018 22:50, Vishal Santoshi wrote: > signature.asc Description: OpenPGP digital signature

Re: NoClassDefFoundError javax.xml.bind.DatatypeConverterImpl

2018-12-06 Thread Chesnay Schepler
Small correction: Flink 1.7 does not support jdk9; we only fixed some of the issues, not all of them. On 06.12.2018 07:13, Mike Mintz wrote: Hi Flink developers, We're running some new DataStream jobs on Flink 1.7.0 using the shaded Hadoop S3 file system, and running into frequent errors

Re: Discuss [FLINK-9740] Support group windows over intervals of months

2018-12-06 Thread Timo Walther
Hi, thanks for working on this. The user mailing list is not the right place to start development discussions. Please use the dev@ mailing list. Can you attach you design to the Jira issue? We can then further discuss there. Thanks, Timo Am 06.12.18 um 09:45 schrieb x1q1j1: Hi! Timo

Re: Query big mssql Data Source [Batch]

2018-12-06 Thread Flavio Pompermaier
the constructor of NumericBetweenParametersProvider takes 3 params: long fetchSize, long minVal, long maxVal. If you want parallelism you should use a 1 < fetchSize < maxVal. In your case, if you do new NumericBetweenParametersProvider(50, 3, 300) you will produce 6 parallel tasks: 1. SELECT

Re: Query big mssql Data Source [Batch]

2018-12-06 Thread miki haiat
hi Flavio , This is the query that im trying to coordinate > .setQuery("SELECT a, b, c, \n" + > "FROM dbx.dbo.x as tls\n"+ > "WHERE tls.a BETWEEN ? and ?" > > And this is the way im trying to parameterized ParameterValuesProvider pramProvider = new

Discuss [FLINK-9740] Support group windows over intervals of months

2018-12-06 Thread x1q1j1
Hi! Timo Walther??Jark Thank you very much. I have some thoughts and ideas about this isuse, which are attached in the email. https://issues.apache.org/jira/browse/FLINK-9740 Best wishes. Yours truly, ForwardXu About [FLINK-9740] Support group windows over intervals of months.pdf

About [FLINK-9740] Support group windows over intervals of months

2018-12-06 Thread x1q1j1
Hi! Timo Walther Thank you very much. I have some thoughts and ideas about this isuse, which are attached in the email. https://issues.apache.org/jira/browse/FLINK-9740 Best wishes. Yours truly, ForwardXu About [FLINK-9740] Support group windows over intervals of months.pdf Description: