[jira] [Created] (FLINK-27696) Add bin-pack strategy to split the whole bucket data files into several small splits for append-only table.

2022-05-19 Thread Zheng Hu (Jira)
Zheng Hu created FLINK-27696: Summary: Add bin-pack strategy to split the whole bucket data files into several small splits for append-only table. Key: FLINK-27696 URL: https://issues.apache.org/jira/browse/FLINK-2769

Re: [DISCUSS] FLIP-91: Support SQL Client Gateway

2022-05-19 Thread Timo Walther
Hi Shengkai, thanks again for the update. I don't have much to add: > I still think we should use a new state machine Thanks for the explanation about async/sync behavior. I thought we also want to use the Gateway for job status updates. But if FINISHED only refers to the job submission, the

[DISCUSS] FLINK-27625: Add a new query hint for async lookup join

2022-05-19 Thread Lincoln Lee
Hi everyone, I would like to open a discussion on adding a new query hint for async lookup join. Since the changes were relatively minor, no new flip was created, and if needed, I will create one. FLINK-27623 adds a global parameter 'table.exec.async-lookup.output-mode' for table users so that al

Re: [VOTE] FLIP-229: Introduces Join Hint for Flink SQL Batch Job

2022-05-19 Thread Timo Walther
+1 (binding) Thanks for driving this! Timo On 19.05.22 08:44, Leonard Xu wrote: Thanks Xuyang for driving this work. +1(binding) Best, Leonard 2022年5月19日 上午10:46,Yun Tang 写道: Thanks for driving, +1 (binding) Best Yun Tang From: Jark Wu Sent: Wednes

Re: [DISCUSS] FLIP-221 Abstraction for lookup source cache and metric

2022-05-19 Thread Александр Смирнов
Hi Qingsheng and Jark, 1. Builders vs 'of' I understand that builders are used when we have multiple parameters. I suggested them because we could add parameters later. To prevent Builder for ScanRuntimeProvider from looking redundant I can suggest one more config now - "rescanStartTime". It's a t

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread Yangze Guo
Thanks for driving this. Xintong and Weijie. I believe this feature will make Flink a better batch/OLAP engine. +1 for the overall design. Some questions: 1. How do we decide the size of the buffer pool in MemoryDataManager and the read buffers in FileDataManager? 2. Is there an upper limit for t

[jira] [Created] (FLINK-27697) Restoring from a checkpoint will start duplicated application when mixing use streaming sinks and sql sinks in code.

2022-05-19 Thread Ying Lin (Jira)
Ying Lin created FLINK-27697: Summary: Restoring from a checkpoint will start duplicated application when mixing use streaming sinks and sql sinks in code. Key: FLINK-27697 URL: https://issues.apache.org/jira/browse/F

Flink application on yarn cluster - main method not found

2022-05-19 Thread Zain Haider Nemati
Hi, Im running flink application on yarn cluster it is giving me this error, it is working fine on standalone cluster. Any idea what could be causing this? Exception in thread "main" java.lang.NoSuchMethodError: org.apache.flink.client.deployment.application.ClassPathPackagedProgramRetriever.newBu

[jira] [Created] (FLINK-27698) [JUnit5 Migration] Module: flink-table-api-java-bridge

2022-05-19 Thread Sergey Nuyanzin (Jira)
Sergey Nuyanzin created FLINK-27698: --- Summary: [JUnit5 Migration] Module: flink-table-api-java-bridge Key: FLINK-27698 URL: https://issues.apache.org/jira/browse/FLINK-27698 Project: Flink

Re: [DISCUSS] FLIP-221 Abstraction for lookup source cache and metric

2022-05-19 Thread Александр Смирнов
Also I have a few additions: 1) maybe rename 'lookup.cache.maximum-size' to 'lookup.cache.max-rows'? I think it will be more clear that we talk not about bytes, but about the number of rows. Plus it fits more, considering my optimization with filters. 2) How will users enable rescanning? Are we goi

Re: [DISCUSS] FLIP-224: Blacklist Mechanism

2022-05-19 Thread Lijie Wang
Hi Konstantin, We found that Flink REST URL does not support the format ":merge" , which will be recognized as a parameter in the URL(due to start with a colon). We will keep the previous way, i.e. POST: http://{jm_rest_address:port}/blocklist/taskmanagers and the "id" and "merge" flag are put i

Final reminder: ApacheCon North America call for presentations closing soon

2022-05-19 Thread Rich Bowen
[Note: You're receiving this because you are subscribed to one or more Apache Software Foundation project mailing lists.] This is your final reminder that the Call for Presetations for ApacheCon North America 2022 will close at 00:01 GMT on Monday, May 23rd, 2022. Please don't wait! Get your talk

[DISCUSS] FLIP-234: Support Retryable Lookup Join To Solve Delayed Updates Issue In External Systems

2022-05-19 Thread Lincoln Lee
Dear Flink developers, I would like to open a discussion on FLIP 234 [1] to support retryable lookup join to solve delayed updates issue, as a pre-work for this solution, we proposed FLIP-232[2] which adds a generic retry support for Async I/O. We prefer to offer this retry capability via query hi

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread weijie guo
Yangze, Thank you for the feedback! Here's my thoughts for your questions: >>> How do we decide the size of the buffer pool in MemoryDataManager and the read buffers in FileDataManager? The BufferPool in MemoryDataManager is the LocalBufferPool used by ResultPartition, and the size is the same as

Re: [VOTE] Creating an Apache Flink slack workspace

2022-05-19 Thread Yuan Mei
+1 (binding) This facilitates people collaborating on the same project from different organizations. I really like this idea. On Thu, May 19, 2022 at 12:43 PM Peter Huang wrote: > +1 (non-binding) > > > Best Regards > Peter Huang > > On Wed, May 18, 2022 at 9:33 PM Leonard Xu wrote: > > > Than

Re: Flink application on yarn cluster - main method not found

2022-05-19 Thread Zain Haider Nemati
Hi Folks, Would appreciate it if someone could help me out with this ! Cheers On Thu, May 19, 2022 at 1:49 PM Zain Haider Nemati wrote: > Hi, > Im running flink application on yarn cluster it is giving me this error, > it is working fine on standalone cluster. Any idea what could be causing > t

[jira] [Created] (FLINK-27699) Align Pulsar source atPublishTime method

2022-05-19 Thread LuNng Wang (Jira)
LuNng Wang created FLINK-27699: -- Summary: Align Pulsar source atPublishTime method Key: FLINK-27699 URL: https://issues.apache.org/jira/browse/FLINK-27699 Project: Flink Issue Type: Improvement

Re: [VOTE] Creating an Apache Flink slack workspace

2022-05-19 Thread Konstantin Knauf
+1 for the user community, not the contributor/dev community Am Do., 19. Mai 2022 um 12:44 Uhr schrieb Yuan Mei : > +1 (binding) > > This facilitates people collaborating on the same project from different > organizations. I really like this idea. > > On Thu, May 19, 2022 at 12:43 PM Peter Huang

Re: [DISCUSS] FLIP-91: Support SQL Client Gateway

2022-05-19 Thread Shengkai Fang
Hi, Timo. Thanks for your feedback! > SQLGatewayService.getFunction / UserDefinedFunctionInfo Yes. I miss some parts in the FLIP. I have fix the errors now. > configure_session Thanks for your inputs. Considering the difference, I am still prone to use the `configure_session`. > `./sql-gat

Re: Flink application on yarn cluster - main method not found

2022-05-19 Thread Weihua Hu
Hi, Which version of flink are you using? It looks like there is a conflict between the flink version of the cluster and the version in userjar Best, Weihua > 2022年5月19日 下午4:49,Zain Haider Nemati 写道: > > Hi, > Im running flink application on yarn cluster it is giving me this error, it > is w

[jira] [Created] (FLINK-27701) HashMapStateBackendWindowITCase. testAggregateWindowStateReader failed with Not all required tasks are currently running

2022-05-19 Thread Huang Xingbo (Jira)
Huang Xingbo created FLINK-27701: Summary: HashMapStateBackendWindowITCase. testAggregateWindowStateReader failed with Not all required tasks are currently running Key: FLINK-27701 URL: https://issues.apache.org

[jira] [Created] (FLINK-27702) Flink table code splitter does not throw anything if result is not compiled

2022-05-19 Thread Sergey Nuyanzin (Jira)
Sergey Nuyanzin created FLINK-27702: --- Summary: Flink table code splitter does not throw anything if result is not compiled Key: FLINK-27702 URL: https://issues.apache.org/jira/browse/FLINK-27702 Pro

[jira] [Created] (FLINK-27703) FileChannelManagerImplTest.testDirectoriesCleanupOnKillWithoutCallerHook failed with The marker file was not found within 10000 msecs

2022-05-19 Thread Huang Xingbo (Jira)
Huang Xingbo created FLINK-27703: Summary: FileChannelManagerImplTest.testDirectoriesCleanupOnKillWithoutCallerHook failed with The marker file was not found within 1 msecs Key: FLINK-27703 URL: https://issue

Re:[VOTE] Creating an Apache Flink slack workspace

2022-05-19 Thread Roc Marshal
+1. (non-binding.) Best Regards Roc. At 2022-05-17 16:23:59, "Xintong Song" wrote: >Hi everyone, > >As previously discussed in [1], I would like to open a vote on creating an >Apache Flink slack workspace channel. > >The proposed actions include: >- Creating a dedicated slack workspace with t

Re: [DISCUSS] FLIP-233: Introduce HTTP Connector

2022-05-19 Thread Ber, Jeremy
Hi Austin, Thanks for the recommendations! After internal discussion I have decided to park this FLIP for now until I have more capacity to commit to it. Jeremy On 5/17/22, 10:26 AM, "Austin Cawley-Edwards" wrote: CAUTION: This email originated from outside of the organization. Do not c

Re: [DISCUSS] FLIP-91: Support SQL Client Gateway

2022-05-19 Thread Jark Wu
Thank Shengkai for the new round of updating. I don't have comments on the updates. +1 for starting the vote. Best, Jark On Thu, 19 May 2022 at 20:18, Shengkai Fang wrote: > Hi, Timo. > > Thanks for your feedback! > > > SQLGatewayService.getFunction / UserDefinedFunctionInfo > > Yes. I miss so

[jira] [Created] (FLINK-27704) Java 17 compatibility

2022-05-19 Thread Jordan Kaye (Jira)
Jordan Kaye created FLINK-27704: --- Summary: Java 17 compatibility Key: FLINK-27704 URL: https://issues.apache.org/jira/browse/FLINK-27704 Project: Flink Issue Type: Improvement Repor

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread Aitozi
Hi Weijie: Thanks for the nice FLIP, I have couple questions about this: 1) In the hybrid shuffle mode, the shuffle mode is decided by the resource. If there is an unbounded data source, but only have resource to schedule the first stage, will it bring the big burden to the disk/shuffle serv

Job Logs - Yarn Application Mode

2022-05-19 Thread Zain Haider Nemati
Hey All, How can I check logs for my job when it is running in application mode via yarn

[RESULT] [VOTE] FLIP-229: Introduces Join Hint for Flink SQL Batch Job

2022-05-19 Thread Xuyang
Hi everyone, FLIP-229 [1] has been accepted. There [2] were 6 binding votes in favor. None against. Votes are in the order of arrival: - Binding: Godfrey He- Binding: Jingsong Li- Binding: Jark Wu- Binding: Yun Tang- Binding: Leonard Xu- Binding: Timo Walther Thanks again for every one who con

Re: Job Logs - Yarn Application Mode

2022-05-19 Thread Weihua Hu
Hi, You can get the logs from Flink Web UI if job is running. Best, Weihua > 2022年5月19日 下午10:56,Zain Haider Nemati 写道: > > Hey All, > How can I check logs for my job when it is running in application mode via > yarn

[jira] [Created] (FLINK-27705) num-sorted-run.compaction-trigger should not interfere the num-levels

2022-05-19 Thread Jane Chan (Jira)
Jane Chan created FLINK-27705: - Summary: num-sorted-run.compaction-trigger should not interfere the num-levels Key: FLINK-27705 URL: https://issues.apache.org/jira/browse/FLINK-27705 Project: Flink

[VOTE] FLIP-91: Support SQL Gateway

2022-05-19 Thread Shengkai Fang
Hi, everyone. Thanks for your feedback for FLIP-91: Support SQL Gateway[1] on the discussion thread[2]. I'd like to start a vote for it. The vote will be open for at least 72 hours unless there is an objection or not enough votes. Best, Shengkai [1] https://cwiki.apache.org/confluence/display/FL

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread weijie guo
Hi, Aitozi: Thank you for the feedback! Here are some of my thoughts on your question >>> 1.If there is an unbounded data source, but only have resource to schedule the first stage, will it bring the big burden to the disk/shuffle service which will occupy all the resource I think. First of all,

Re: [VOTE] FLIP-91: Support SQL Gateway

2022-05-19 Thread Jark Wu
+1 (binding) Best, Jark On Fri, 20 May 2022 at 10:39, Shengkai Fang wrote: > Hi, everyone. > > Thanks for your feedback for FLIP-91: Support SQL Gateway[1] on the > discussion thread[2]. I'd like to start a vote for it. The vote will be > open for at least 72 hours unless there is an objection

Re: Job Logs - Yarn Application Mode

2022-05-19 Thread Shengkai Fang
Hi. I am not familiar with the YARN application mode. Because the job manager is started when submit the jobs. So how can users know the address of the JM? Do we need to look up the Yarn UI to search the submitted job with the JobID? Best, Shengkai Weihua Hu 于2022年5月20日周五 10:23写道: > Hi, > You

Re: Job Logs - Yarn Application Mode

2022-05-19 Thread Biao Geng
Hi there, @Zain, Weihua's suggestion should be able to fulfill the request to check JM logs. If you do want to use YARN cli for running Flink applications, it is possible to check JM's log with the YARN command like: *yarn logs -applicationId application_xxx_yyy -am -1 -logFiles jobmanager.log* For

[jira] [Created] (FLINK-27706) Refactor all subclasses of FileStoreTableITCase to use the batchSql.

2022-05-19 Thread Zheng Hu (Jira)
Zheng Hu created FLINK-27706: Summary: Refactor all subclasses of FileStoreTableITCase to use the batchSql. Key: FLINK-27706 URL: https://issues.apache.org/jira/browse/FLINK-27706 Project: Flink

[jira] [Created] (FLINK-27707) Implement TableStoreFactory#onCompactTable

2022-05-19 Thread Jane Chan (Jira)
Jane Chan created FLINK-27707: - Summary: Implement TableStoreFactory#onCompactTable Key: FLINK-27707 URL: https://issues.apache.org/jira/browse/FLINK-27707 Project: Flink Issue Type: Sub-task

Re: [VOTE] FLIP-91: Support SQL Gateway

2022-05-19 Thread Paul Lam
+1 (non-binding) Best, Paul Lam > 2022年5月20日 10:48,Jark Wu 写道: > > +1 (binding) > > Best, > Jark > > On Fri, 20 May 2022 at 10:39, Shengkai Fang wrote: > >> Hi, everyone. >> >> Thanks for your feedback for FLIP-91: Support SQL Gateway[1] on the >> discussion thread[2]. I'd like to start a

Re: [DISCUSS] FLIP-222: Support full query lifecycle statements in SQL client

2022-05-19 Thread Paul Lam
Hi Jark, WRT “DROP QUERY”, I agree that it’s not very intuitive, and that’s part of the reason why I proposed “STOP/CANCEL QUERY” at the beginning. The downside of it is that it’s not ANSI-SQL compatible. Another question is, what should be the syntax for ungracefully canceling a query? As Sheng

[jira] [Created] (FLINK-27708) Add background compaction task for append-only table when ingesting.

2022-05-19 Thread Zheng Hu (Jira)
Zheng Hu created FLINK-27708: Summary: Add background compaction task for append-only table when ingesting. Key: FLINK-27708 URL: https://issues.apache.org/jira/browse/FLINK-27708 Project: Flink

[jira] [Created] (FLINK-27709) Add comment to schema

2022-05-19 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-27709: Summary: Add comment to schema Key: FLINK-27709 URL: https://issues.apache.org/jira/browse/FLINK-27709 Project: Flink Issue Type: Sub-task Componen

Re: [DISCUSS] FLIP-231: Introduce SupportStatisticReport to support reporting statistics from source connectors

2022-05-19 Thread Jark Wu
Hi Godfrey, I just left some comments here: 1) SupportStatisticReport => SupportsStatisticReport All the ability interfaces begin with "Supports" instead of "Support". 2) table.optimizer.source.connect-statistics-enabled The "connect" word should be "collect"? 3) CatalogStatistics I was a littl

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread Aitozi
Thanks Weijie for your answer. So you mean the hybrid shuffle mode will limit its usage to the bounded source, Right ? One more question, with the bounded data and partly of the stage is running in the Pipelined shuffle mode, what will be the behavior of the task failure, Is the checkpoint enabled

[jira] [Created] (FLINK-27710) Improve logs to better display Execution

2022-05-19 Thread Zhu Zhu (Jira)
Zhu Zhu created FLINK-27710: --- Summary: Improve logs to better display Execution Key: FLINK-27710 URL: https://issues.apache.org/jira/browse/FLINK-27710 Project: Flink Issue Type: Improvement

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread Xintong Song
Hi Aitozi, In which case we can use the hybrid shuffle mode Just to directly answer this question, in addition to Weijie's explanations. For batch workload, if you want the workload to take advantage of as many resources as available, which ranges from a single slot to as many slots as the total

[VOTE] FLIP-224: Blocklist Mechanism

2022-05-19 Thread Lijie Wang
Hi everyone, Thanks for the feedback for FLIP-224: Blocklist Mechanism [1] on the discussion thread [2] I'd like to start a vote for it. The vote will last for at least 72 hours unless there is an objection or insufficient votes. [1] https://cwiki.apache.org/confluence/display/FLINK/FLIP-224%3A+

Re: [DISCUSS] FLIP-224: Blacklist Mechanism

2022-05-19 Thread Lijie Wang
Hi everyone, I have started a vote for this FLIP [1]. Please cast your vote there or ask additional questions here. [1] https://lists.apache.org/thread/3416vks1j35co9608gkmsoplvcjjz7bg Best, Lijie Lijie Wang 于2022年5月19日周四 17:34写道: > Hi Konstantin, > > We found that Flink REST URL does not supp

Re: [DISCUSS] FLIP-235: Hybrid Shuffle Mode

2022-05-19 Thread Aitozi
Hi Xintong Thanks for your detailed explanation, I misunderstand the spill behavior at first glance, I get your point now. I think it will be a good addition to the current execution mode. Looking forward to it :) Best, Aitozi Xintong Song 于2022年5月20日周五 12:26写道: > Hi Aitozi, > > In which ca

[jira] [Created] (FLINK-27711) Align setTopicPattern for Pulsar Connector

2022-05-19 Thread LuNng Wang (Jira)
LuNng Wang created FLINK-27711: -- Summary: Align setTopicPattern for Pulsar Connector Key: FLINK-27711 URL: https://issues.apache.org/jira/browse/FLINK-27711 Project: Flink Issue Type: Improvemen

[jira] [Created] (FLINK-27712) Job failed to start due to "Time should be non negative"

2022-05-19 Thread Sharon Xie (Jira)
Sharon Xie created FLINK-27712: -- Summary: Job failed to start due to "Time should be non negative" Key: FLINK-27712 URL: https://issues.apache.org/jira/browse/FLINK-27712 Project: Flink Issue Ty

Re: Job Logs - Yarn Application Mode

2022-05-19 Thread Shengkai Fang
Thanks for Biao's explanation. Best, Shengkai Biao Geng 于2022年5月20日周五 11:16写道: > Hi there, > @Zain, Weihua's suggestion should be able to fulfill the request to check > JM logs. If you do want to use YARN cli for running Flink applications, it > is possible to check JM's log with the YARN comma

Re: [DISCUSS] FLIP-232: Add Retry Support For Async I/O In DataStream API

2022-05-19 Thread Lincoln Lee
Hi everyone, By comparing the two internal implementations of delayed retries, we prefer the timer-based solution, which obtains precise delay control through simple logic and only needs to pay (what we consider to be acceptable) timer instance cost for the retry element. The FLIP[1] doc has b