date:20240321



boring-cyborg[bot] commented on PR #108:
URL: 
https://github.com/apache/flink-connector-jdbc/pull/108#issuecomment-2011343599

   Thanks for opening this pull request! Please check out our contributing 
guidelines. (https://flink.apache.org/contributing/how-to-contribute.html)
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] [FLINK-34668][checkpoint] Report state handle of file merging directory to JM [flink]



masteryhx commented on code in PR #24513:
URL: https://github.com/apache/flink/pull/24513#discussion_r155369


##
flink-runtime/src/test/java/org/apache/flink/runtime/state/OperatorStateBackendTest.java:
##
@@ -418,6 +430,62 @@ void testSnapshotEmpty() throws Exception {
 assertThat(stateHandle).isNull();
 }
 
+@Test
+void testFileMergingSnapshotEmpty(@TempDir File tmpFolder) throws 
Exception {

Review Comment:
   Could we also test the registery of the new handle ?
   Or test that the subsumed checkpoint will discard correctly.



##
flink-runtime/src/main/java/org/apache/flink/runtime/state/filemerging/DirectoryStreamStateHandle.java:
##
@@ -0,0 +1,92 @@
+/*
+ * Licensed to the Apache Software Foundation (ASF) under one
+ * or more contributor license agreements.  See the NOTICE file
+ * distributed with this work for additional information
+ * regarding copyright ownership.  The ASF licenses this file
+ * to you under the Apache License, Version 2.0 (the
+ * "License"); you may not use this file except in compliance
+ * with the License.  You may obtain a copy of the License at
+ *
+ * http://www.apache.org/licenses/LICENSE-2.0
+ *
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.
+ */
+
+package org.apache.flink.runtime.state.filemerging;
+
+import org.apache.flink.core.fs.FSDataInputStream;
+import org.apache.flink.runtime.state.DirectoryStateHandle;
+import org.apache.flink.runtime.state.PhysicalStateHandleID;
+import org.apache.flink.runtime.state.SharedStateRegistryKey;
+import org.apache.flink.runtime.state.StreamStateHandle;
+import org.apache.flink.util.FileUtils;
+
+import javax.annotation.Nonnull;
+
+import java.io.IOException;
+import java.nio.file.Path;
+import java.util.Optional;
+
+/** Wrap {@link DirectoryStateHandle} to a {@link StreamStateHandle}. */
+public class DirectoryStreamStateHandle extends DirectoryStateHandle 
implements StreamStateHandle {
+
+private static final long serialVersionUID = -6453596108675892492L;
+
+public DirectoryStreamStateHandle(@Nonnull Path directory, long 
directorySize) {
+super(directory, directorySize);
+}
+
+@Override
+public FSDataInputStream openInputStream() {
+throw new UnsupportedOperationException();
+}
+
+@Override
+public Optional asBytesIfInMemory() {
+return Optional.empty();
+}
+
+@Override
+public PhysicalStateHandleID getStreamStateHandleID() {
+return new PhysicalStateHandleID(getDirectory().toString());
+}
+
+@Override
+public boolean equals(Object o) {
+if (this == o) {
+return true;
+}
+if (o == null || getClass() != o.getClass()) {
+return false;
+}
+
+DirectoryStreamStateHandle that = (DirectoryStreamStateHandle) o;
+
+return getDirectory().equals(that.getDirectory());
+}
+
+@Override
+public String toString() {
+return "DirectoryStreamStateHandle{" + "directory=" + getDirectory() + 
'}';
+}
+
+public static DirectoryStreamStateHandle forPathWithSize(@Nonnull Path 
directory) {
+long size;
+try {
+size = FileUtils.getDirectoryFilesSize(directory);
+} catch (IOException e) {
+size = 0L;
+}
+return new DirectoryStreamStateHandle(directory, size);
+}
+
+public static SharedStateRegistryKey createStateRegistryKey(

Review Comment:
   Could this be a member function ?



##
flink-runtime/src/main/java/org/apache/flink/runtime/state/filemerging/SegmentFileStateHandle.java:
##
@@ -62,19 +77,58 @@ public class SegmentFileStateHandle implements 
StreamStateHandle {
  * @param scope The state's scope, whether it is exclusive or shared.
  */
 public SegmentFileStateHandle(
-Path filePath, long startPos, long stateSize, 
CheckpointedStateScope scope) {
+Path directoryPath,
+Path filePath,
+long startPos,
+long stateSize,
+CheckpointedStateScope scope) {
 this.filePath = filePath;
 this.stateSize = stateSize;
 this.startPos = startPos;
 this.scope = scope;
+this.directoryStateHandle =
+DirectoryStreamStateHandle.forPathWithSize(
+new File(directoryPath.getPath()).toPath());

Review Comment:
   +1, At least we should not calculate for every SegmentFileStateHandle in the 
same directory.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues

[jira] [Commented] (FLINK-34898) Cannot create named STRUCT with a single field

2024-03-21 Thread Chloe He (Jira)



[ 
https://issues.apache.org/jira/browse/FLINK-34898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17829402#comment-17829402
 ] 

Chloe He commented on FLINK-34898:
--

[~hackergin] Thanks for the pointer. 
{code:java}
SELECT CAST(ROW(1) as ROW) AS row1; {code}
works for me.

I want to also wrap this in an ARRAY, so that the data in this cell looks like 
`[\{"a": 1}]` (i.e., it's an ARRAY constructed from named STRUCTs). 
{code:java}
SELECT ARRAY[CAST(ROW(1) as ROW)] AS row1; {code}
does not work and it seems that the only way that I can get this to work 
properly is to wrap this in two ROWs, i.e.,
{code:java}
SELECT ROW(ROW(CAST(ROW(1) as ROW))) AS row1; {code}
Is this the only way to achieve this?

> Cannot create named STRUCT with a single field
> --
>
> Key: FLINK-34898
> URL: https://issues.apache.org/jira/browse/FLINK-34898
> Project: Flink
>  Issue Type: Bug
>Reporter: Chloe He
>Priority: Major
> Attachments: image-2024-03-21-12-00-00-183.png
>
>
> I'm trying to create named structs using Flink SQL and I found a previous 
> ticket https://issues.apache.org/jira/browse/FLINK-9161 that mentions the use 
> of the following syntax:
> {code:java}
> SELECT CAST(('a', 1) as ROW) AS row1;
> {code}
> However, my named struct has a single field and effectively it should look 
> something like `\{"a": 1}`. I can't seem to be able to find a way to 
> construct this. I have experimented with a few different syntax and it either 
> throws parsing error or casting error:
> {code:java}
> Cast function cannot convert value of type INTEGER to type 
> RecordType(VARCHAR(2147483647) a) {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Created] (FLINK-34902) INSERT INTO column mismatch leads to IndexOutOfBoundsException

2024-03-21 Thread Timo Walther (Jira)

Timo Walther created FLINK-34902:


 Summary: INSERT INTO column mismatch leads to 
IndexOutOfBoundsException
 Key: FLINK-34902
 URL: https://issues.apache.org/jira/browse/FLINK-34902
 Project: Flink
  Issue Type: Bug
  Components: Table SQL / Planner
Reporter: Timo Walther


SQL:
{code}
INSERT INTO t (a, b) SELECT 1;
{code}

 

Stack trace:
{code}

org.apache.flink.table.api.ValidationException: SQL validation failed. Index 1 
out of bounds for length 1
    at 
org.apache.flink.table.planner.calcite.FlinkPlannerImpl.org$apache$flink$table$planner$calcite$FlinkPlannerImpl$$validate(FlinkPlannerImpl.scala:200)
    at 
org.apache.flink.table.planner.calcite.FlinkPlannerImpl.validate(FlinkPlannerImpl.scala:117)
    at
Caused by: java.lang.IndexOutOfBoundsException: Index 1 out of bounds for 
length 1
    at 
java.base/jdk.internal.util.Preconditions.outOfBounds(Preconditions.java:64)
    at 
java.base/jdk.internal.util.Preconditions.outOfBoundsCheckIndex(Preconditions.java:70)
    at 
java.base/jdk.internal.util.Preconditions.checkIndex(Preconditions.java:248)
    at java.base/java.util.Objects.checkIndex(Objects.java:374)
    at java.base/java.util.ArrayList.get(ArrayList.java:459)
    at 
org.apache.flink.table.planner.calcite.PreValidateReWriter$.$anonfun$reorder$1(PreValidateReWriter.scala:355)
    at 
org.apache.flink.table.planner.calcite.PreValidateReWriter$.$anonfun$reorder$1$adapted(PreValidateReWriter.scala:355)

{code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [FLINK-34731][runtime] Remove SpeculativeScheduler and incorporate its features into AdaptiveBatchScheduler. [flink]



zhuzhurk commented on code in PR #24524:
URL: https://github.com/apache/flink/pull/24524#discussion_r1533346303


##
flink-runtime/src/test/java/org/apache/flink/runtime/scheduler/DefaultSchedulerBuilder.java:
##
@@ -322,39 +321,14 @@ public DefaultScheduler build() throws Exception {
 }
 
 public AdaptiveBatchScheduler buildAdaptiveBatchJobScheduler() throws 
Exception {
-return new AdaptiveBatchScheduler(
-log,
-jobGraph,
-ioExecutor,
-jobMasterConfiguration,
-componentMainThreadExecutor -> {},
-delayExecutor,
-userCodeLoader,
-checkpointCleaner,
-checkpointRecoveryFactory,
-jobManagerJobMetricGroup,
-new 
VertexwiseSchedulingStrategy.Factory(inputConsumableDeciderFactory),
-failoverStrategyFactory,
-restartBackoffTimeStrategy,
-executionOperations,
-executionVertexVersioner,
-executionSlotAllocatorFactory,
-System.currentTimeMillis(),
-mainThreadExecutor,
-jobStatusListener,
-failureEnrichers,
-createExecutionGraphFactory(true),
-shuffleMaster,
-rpcTimeout,
-vertexParallelismAndInputInfosDecider,
-defaultMaxParallelism,
-hybridPartitionDataConsumeConstraint,
-
ForwardGroupComputeUtil.computeForwardGroupsAndCheckParallelism(
-jobGraph.getVerticesSortedTopologicallyFromSources()));
+return buildAdaptiveBatchJobScheduler(false);
 }
 
-public SpeculativeScheduler buildSpeculativeScheduler() throws Exception {
-return new SpeculativeScheduler(
+public AdaptiveBatchScheduler buildAdaptiveBatchJobScheduler(boolean 
enableSpeculativeExecution)
+throws Exception {
+jobMasterConfiguration.set(
+BatchExecutionOptions.SPECULATIVE_ENABLED, 
enableSpeculativeExecution);
+return new AdaptiveBatchScheduler(

Review Comment:
   Is it possible that we use `AdaptiveBatchSchedulerFactory` to create 
scheduler, so that more production code can be covered by this test?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Updated] (FLINK-34731) Remove SpeculativeScheduler and incorporate its features into AdaptiveBatchScheduler

2024-03-21 Thread ASF GitHub Bot (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34731?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-34731:
---
Labels: pull-request-available  (was: )

> Remove SpeculativeScheduler and incorporate its features into 
> AdaptiveBatchScheduler
> 
>
> Key: FLINK-34731
> URL: https://issues.apache.org/jira/browse/FLINK-34731
> Project: Flink
>  Issue Type: Technical Debt
>  Components: Runtime / Coordination
>Reporter: Junrui Li
>Assignee: Junrui Li
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.20.0
>
>
> Presently, speculative execution is exposed to users as a feature of the 
> AdaptiveBatchScheduler.
> To streamline our codebase and reduce maintenance overhead, this ticket will 
> consolidate the SpeculativeScheduler into the AdaptiveBatchScheduler, 
> eliminating the need for a separate SpeculativeScheduler class.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [FLINK-26088][Connectors/ElasticSearch] Add Elasticsearch 8.0 support [flink-connector-elasticsearch]



drorventura commented on PR #53:
URL: 
https://github.com/apache/flink-connector-elasticsearch/pull/53#issuecomment-2011373385

   when is the next release planned? 


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Created] (FLINK-34903) Add mysql-pipeline-connector with table.exclude.list option to exclude unnecessary tables

2024-03-21 Thread shiyuyang (Jira)

shiyuyang created FLINK-34903:
-

 Summary: Add mysql-pipeline-connector with  table.exclude.list 
option to exclude unnecessary tables 
 Key: FLINK-34903
 URL: https://issues.apache.org/jira/browse/FLINK-34903
 Project: Flink
  Issue Type: Improvement
  Components: Flink CDC
Reporter: shiyuyang
 Fix For: cdc-3.1.0


    When using the MySQL Pipeline connector for whole-database synchronization, 
users currently cannot exclude unnecessary tables. Taking reference from 
Debezium's parameters, specifically the {*}table.exclude.list{*}, if the 
*table.include.list* is declared, then the *table.exclude.list* parameter will 
not take effect. However, the tables specified in the tables parameter of the 
MySQL Pipeline connector are effectively added to the *table.include.list* in 
Debezium's context.

    In summary, it is necessary to introduce an externally-exposed 
*table.exclude.list* parameter within the MySQL Pipeline connector to 
facilitate the exclusion of tables. This is because the current setup does not 
allow for excluding unnecessary tables when including others through the tables 
parameter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (FLINK-34903) Add mysql-pipeline-connector with table.exclude.list option to exclude unnecessary tables



[ 
https://issues.apache.org/jira/browse/FLINK-34903?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17829418#comment-17829418
 ] 

Thorne commented on FLINK-34903:


I will take a pr for this

> Add mysql-pipeline-connector with  table.exclude.list option to exclude 
> unnecessary tables 
> ---
>
> Key: FLINK-34903
> URL: https://issues.apache.org/jira/browse/FLINK-34903
> Project: Flink
>  Issue Type: Improvement
>  Components: Flink CDC
>Reporter: Thorne
>Priority: Major
>  Labels: cdc
> Fix For: cdc-3.1.0
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
>     When using the MySQL Pipeline connector for whole-database 
> synchronization, users currently cannot exclude unnecessary tables. Taking 
> reference from Debezium's parameters, specifically the 
> {*}table.exclude.list{*}, if the *table.include.list* is declared, then the 
> *table.exclude.list* parameter will not take effect. However, the tables 
> specified in the tables parameter of the MySQL Pipeline connector are 
> effectively added to the *table.include.list* in Debezium's context.
>     In summary, it is necessary to introduce an externally-exposed 
> *table.exclude.list* parameter within the MySQL Pipeline connector to 
> facilitate the exclusion of tables. This is because the current setup does 
> not allow for excluding unnecessary tables when including others through the 
> tables parameter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-34903) Add mysql-pipeline-connector with table.exclude.list option to exclude unnecessary tables



 [ 
https://issues.apache.org/jira/browse/FLINK-34903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thorne updated FLINK-34903:
---
Docs Text:   (was: i will take a pr for this)

> Add mysql-pipeline-connector with  table.exclude.list option to exclude 
> unnecessary tables 
> ---
>
> Key: FLINK-34903
> URL: https://issues.apache.org/jira/browse/FLINK-34903
> Project: Flink
>  Issue Type: Improvement
>  Components: Flink CDC
>Reporter: Thorne
>Priority: Major
>  Labels: cdc
> Fix For: cdc-3.1.0
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
>     When using the MySQL Pipeline connector for whole-database 
> synchronization, users currently cannot exclude unnecessary tables. Taking 
> reference from Debezium's parameters, specifically the 
> {*}table.exclude.list{*}, if the *table.include.list* is declared, then the 
> *table.exclude.list* parameter will not take effect. However, the tables 
> specified in the tables parameter of the MySQL Pipeline connector are 
> effectively added to the *table.include.list* in Debezium's context.
>     In summary, it is necessary to introduce an externally-exposed 
> *table.exclude.list* parameter within the MySQL Pipeline connector to 
> facilitate the exclusion of tables. This is because the current setup does 
> not allow for excluding unnecessary tables when including others through the 
> tables parameter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-34903) Add mysql-pipeline-connector with table.exclude.list option to exclude unnecessary tables



 [ 
https://issues.apache.org/jira/browse/FLINK-34903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thorne updated FLINK-34903:
---
Attachment: screenshot-1.png

> Add mysql-pipeline-connector with  table.exclude.list option to exclude 
> unnecessary tables 
> ---
>
> Key: FLINK-34903
> URL: https://issues.apache.org/jira/browse/FLINK-34903
> Project: Flink
>  Issue Type: Improvement
>  Components: Flink CDC
>Reporter: Thorne
>Priority: Major
>  Labels: cdc
> Fix For: cdc-3.1.0
>
> Attachments: screenshot-1.png
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
>     When using the MySQL Pipeline connector for whole-database 
> synchronization, users currently cannot exclude unnecessary tables. Taking 
> reference from Debezium's parameters, specifically the 
> {*}table.exclude.list{*}, if the *table.include.list* is declared, then the 
> *table.exclude.list* parameter will not take effect. However, the tables 
> specified in the tables parameter of the MySQL Pipeline connector are 
> effectively added to the *table.include.list* in Debezium's context.
>     In summary, it is necessary to introduce an externally-exposed 
> *table.exclude.list* parameter within the MySQL Pipeline connector to 
> facilitate the exclusion of tables. This is because the current setup does 
> not allow for excluding unnecessary tables when including others through the 
> tables parameter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-34903) Add mysql-pipeline-connector with table.exclude.list option to exclude unnecessary tables



 [ 
https://issues.apache.org/jira/browse/FLINK-34903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thorne updated FLINK-34903:
---
Attachment: screenshot-2.png

> Add mysql-pipeline-connector with  table.exclude.list option to exclude 
> unnecessary tables 
> ---
>
> Key: FLINK-34903
> URL: https://issues.apache.org/jira/browse/FLINK-34903
> Project: Flink
>  Issue Type: Improvement
>  Components: Flink CDC
>Reporter: Thorne
>Priority: Major
>  Labels: cdc
> Fix For: cdc-3.1.0
>
> Attachments: screenshot-1.png, screenshot-2.png
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
>     When using the MySQL Pipeline connector for whole-database 
> synchronization, users currently cannot exclude unnecessary tables. Taking 
> reference from Debezium's parameters, specifically the 
> {*}table.exclude.list{*}, if the *table.include.list* is declared, then the 
> *table.exclude.list* parameter will not take effect. However, the tables 
> specified in the tables parameter of the MySQL Pipeline connector are 
> effectively added to the *table.include.list* in Debezium's context.
>     In summary, it is necessary to introduce an externally-exposed 
> *table.exclude.list* parameter within the MySQL Pipeline connector to 
> facilitate the exclusion of tables. This is because the current setup does 
> not allow for excluding unnecessary tables when including others through the 
> tables parameter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-34903) Add mysql-pipeline-connector with table.exclude.list option to exclude unnecessary tables



 [ 
https://issues.apache.org/jira/browse/FLINK-34903?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Thorne updated FLINK-34903:
---
Attachment: screenshot-3.png

> Add mysql-pipeline-connector with  table.exclude.list option to exclude 
> unnecessary tables 
> ---
>
> Key: FLINK-34903
> URL: https://issues.apache.org/jira/browse/FLINK-34903
> Project: Flink
>  Issue Type: Improvement
>  Components: Flink CDC
>Reporter: Thorne
>Priority: Major
>  Labels: cdc
> Fix For: cdc-3.1.0
>
> Attachments: screenshot-1.png, screenshot-2.png, screenshot-3.png
>
>   Original Estimate: 72h
>  Remaining Estimate: 72h
>
>     When using the MySQL Pipeline connector for whole-database 
> synchronization, users currently cannot exclude unnecessary tables. Taking 
> reference from Debezium's parameters, specifically the 
> {*}table.exclude.list{*}, if the *table.include.list* is declared, then the 
> *table.exclude.list* parameter will not take effect. However, the tables 
> specified in the tables parameter of the MySQL Pipeline connector are 
> effectively added to the *table.include.list* in Debezium's context.
>     In summary, it is necessary to introduce an externally-exposed 
> *table.exclude.list* parameter within the MySQL Pipeline connector to 
> facilitate the exclusion of tables. This is because the current setup does 
> not allow for excluding unnecessary tables when including others through the 
> tables parameter.



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [Flink 32701] [cep] Fix CEP Operator Memory Leak Issue [flink]



dawidwys merged PR #24084:
URL: https://github.com/apache/flink/pull/24084


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Assigned] (FLINK-32701) Potential Memory Leak in Flink CEP due to Persistent Starting States in NFAState

2024-03-21 Thread Dawid Wysakowicz (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-32701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dawid Wysakowicz reassigned FLINK-32701:


Assignee: Puneet Duggal

> Potential Memory Leak in Flink CEP due to Persistent Starting States in 
> NFAState
> 
>
> Key: FLINK-32701
> URL: https://issues.apache.org/jira/browse/FLINK-32701
> Project: Flink
>  Issue Type: Bug
>  Components: Library / CEP
>Affects Versions: 1.17.0, 1.16.1, 1.16.2, 1.17.1
>Reporter: Puneet Duggal
>Assignee: Puneet Duggal
>Priority: Major
>  Labels: CEP, auto-deprioritized-critical, cep
> Attachments: Screenshot 2023-07-26 at 11.45.06 AM.png, Screenshot 
> 2023-07-26 at 11.50.28 AM.png
>
>
> Our team has encountered a potential memory leak issue while working with the 
> Complex Event Processing (CEP) library in Flink v1.17.
> h2. Context
> The CEP Operator maintains a keyed state called NFAState, which holds two 
> queues: one for partial matches and one for completed matches. When a key is 
> first encountered, the CEP creates a starting computation state and stores it 
> in the partial matches queue. As more events occur that match the defined 
> conditions (e.g., a TAKE condition), additional computation states get added 
> to the queue, with their specific type (normal, pending, end) depending on 
> the pattern sequence.
> However, I have noticed that the starting computation state remains in the 
> partial matches queue even after the pattern sequence has been completely 
> matched. This is also the case for keys that have already timed out. As a 
> result, the state gets stored for all keys that the CEP ever encounters, 
> leading to a continual increase in the checkpoint size.
> h2.  How to reproduce this
>  # Pattern Sequence - A not_followed_by B within 5 mins
>  # Time Characteristic - EventTime
>  # StateBackend - HashMapStateBackend
> On my local machine, I started this pipeline and started sending events at 
> the rate of 10 events per second (only A) and as expected after 5 mins, CEP 
> started sending pattern matched output with the same rate. But the issue was 
> that after every 2 mins (checkpoint interval), checkpoint size kept on 
> increasing. Expectation was that after 5 mins (2-3 checkpoints), checkpoint 
> size will remain constant since any window of 5 mins will consist of the same 
> number of unique keys (older ones will get matched or timed out hence removed 
> from state). But as you can see below attached images, checkpoint size kept 
> on increasing till 40 checkpoints (around 1.5hrs).
> P.S. - After 3 checkpoints (6 mins), the checkpoint size was around 1.78MB. 
> Hence assumption is that ideal checkpoint size for a 5 min window should be 
> less than 1.78MB.
> As you can see after 39 checkpoints, I triggered a savepoint for this 
> pipeline. After that I used a savepoint reader to investigate what all is 
> getting stored in CEP states. Below code investigates NFAState of CEPOperator 
> for potential memory leak.
> {code:java}
> import lombok.AllArgsConstructor;
> import lombok.Data;
> import lombok.NoArgsConstructor;
> import org.apache.flink.api.common.state.ValueState;
> import org.apache.flink.api.common.state.ValueStateDescriptor;
> import org.apache.flink.cep.nfa.NFAState;
> import org.apache.flink.cep.nfa.NFAStateSerializer;
> import org.apache.flink.configuration.Configuration;
> import org.apache.flink.runtime.state.filesystem.FsStateBackend;
> import org.apache.flink.state.api.OperatorIdentifier;
> import org.apache.flink.state.api.SavepointReader;
> import org.apache.flink.state.api.functions.KeyedStateReaderFunction;
> import org.apache.flink.streaming.api.datastream.DataStream;
> import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;
> import org.apache.flink.util.Collector;
> import org.junit.jupiter.api.Test;
> import java.io.Serializable;
> import java.util.Objects;
> public class NFAStateReaderTest {
> private static final String NFA_STATE_NAME = "nfaStateName";
> @Test
> public void testNfaStateReader() throws Exception {
> StreamExecutionEnvironment environment = 
> StreamExecutionEnvironment.getExecutionEnvironment();
> SavepointReader savepointReader =
> SavepointReader.read(environment, 
> "file:///opt/flink/savepoints/savepoint-093404-9bc0a38654df", new 
> FsStateBackend("file:///abc"));
> DataStream stream = 
> savepointReader.readKeyedState(OperatorIdentifier.forUid("select_pattern_events"),
>  new NFAStateReaderTest.NFAStateReaderFunction());
> stream.print();
> environment.execute();
> }
> static class NFAStateReaderFunction extends 
> KeyedStateReaderFunction {
> private ValueState computationStates;

[jira] [Closed] (FLINK-32701) Potential Memory Leak in Flink CEP due to Persistent Starting States in NFAState

2024-03-21 Thread Dawid Wysakowicz (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-32701?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Dawid Wysakowicz closed FLINK-32701.

Fix Version/s: 1.20.0
   Resolution: Fixed

Fixed in a9cde49118bab4b32b2d1ae1f97beb94eb967f9b

> Potential Memory Leak in Flink CEP due to Persistent Starting States in 
> NFAState
> 
>
> Key: FLINK-32701
> URL: https://issues.apache.org/jira/browse/FLINK-32701
> Project: Flink
>  Issue Type: Bug
>  Components: Library / CEP
>Affects Versions: 1.17.0, 1.16.1, 1.16.2, 1.17.1
>Reporter: Puneet Duggal
>Assignee: Puneet Duggal
>Priority: Major
>  Labels: CEP, auto-deprioritized-critical, cep
> Fix For: 1.20.0
>
> Attachments: Screenshot 2023-07-26 at 11.45.06 AM.png, Screenshot 
> 2023-07-26 at 11.50.28 AM.png
>
>
> Our team has encountered a potential memory leak issue while working with the 
> Complex Event Processing (CEP) library in Flink v1.17.
> h2. Context
> The CEP Operator maintains a keyed state called NFAState, which holds two 
> queues: one for partial matches and one for completed matches. When a key is 
> first encountered, the CEP creates a starting computation state and stores it 
> in the partial matches queue. As more events occur that match the defined 
> conditions (e.g., a TAKE condition), additional computation states get added 
> to the queue, with their specific type (normal, pending, end) depending on 
> the pattern sequence.
> However, I have noticed that the starting computation state remains in the 
> partial matches queue even after the pattern sequence has been completely 
> matched. This is also the case for keys that have already timed out. As a 
> result, the state gets stored for all keys that the CEP ever encounters, 
> leading to a continual increase in the checkpoint size.
> h2.  How to reproduce this
>  # Pattern Sequence - A not_followed_by B within 5 mins
>  # Time Characteristic - EventTime
>  # StateBackend - HashMapStateBackend
> On my local machine, I started this pipeline and started sending events at 
> the rate of 10 events per second (only A) and as expected after 5 mins, CEP 
> started sending pattern matched output with the same rate. But the issue was 
> that after every 2 mins (checkpoint interval), checkpoint size kept on 
> increasing. Expectation was that after 5 mins (2-3 checkpoints), checkpoint 
> size will remain constant since any window of 5 mins will consist of the same 
> number of unique keys (older ones will get matched or timed out hence removed 
> from state). But as you can see below attached images, checkpoint size kept 
> on increasing till 40 checkpoints (around 1.5hrs).
> P.S. - After 3 checkpoints (6 mins), the checkpoint size was around 1.78MB. 
> Hence assumption is that ideal checkpoint size for a 5 min window should be 
> less than 1.78MB.
> As you can see after 39 checkpoints, I triggered a savepoint for this 
> pipeline. After that I used a savepoint reader to investigate what all is 
> getting stored in CEP states. Below code investigates NFAState of CEPOperator 
> for potential memory leak.
> {code:java}
> import lombok.AllArgsConstructor;
> import lombok.Data;
> import lombok.NoArgsConstructor;
> import org.apache.flink.api.common.state.ValueState;
> import org.apache.flink.api.common.state.ValueStateDescriptor;
> import org.apache.flink.cep.nfa.NFAState;
> import org.apache.flink.cep.nfa.NFAStateSerializer;
> import org.apache.flink.configuration.Configuration;
> import org.apache.flink.runtime.state.filesystem.FsStateBackend;
> import org.apache.flink.state.api.OperatorIdentifier;
> import org.apache.flink.state.api.SavepointReader;
> import org.apache.flink.state.api.functions.KeyedStateReaderFunction;
> import org.apache.flink.streaming.api.datastream.DataStream;
> import org.apache.flink.streaming.api.environment.StreamExecutionEnvironment;
> import org.apache.flink.util.Collector;
> import org.junit.jupiter.api.Test;
> import java.io.Serializable;
> import java.util.Objects;
> public class NFAStateReaderTest {
> private static final String NFA_STATE_NAME = "nfaStateName";
> @Test
> public void testNfaStateReader() throws Exception {
> StreamExecutionEnvironment environment = 
> StreamExecutionEnvironment.getExecutionEnvironment();
> SavepointReader savepointReader =
> SavepointReader.read(environment, 
> "file:///opt/flink/savepoints/savepoint-093404-9bc0a38654df", new 
> FsStateBackend("file:///abc"));
> DataStream stream = 
> savepointReader.readKeyedState(OperatorIdentifier.forUid("select_pattern_events"),
>  new NFAStateReaderTest.NFAStateReaderFunction());
> stream.print();
> environment.execute();
> }
> static class NFAStateRea

[jira] [Created] (FLINK-34904) [Feature] submit Flink CDC pipeline job to yarn application cluster.

2024-03-21 Thread ZhengYu Chen (Jira)

ZhengYu Chen created FLINK-34904:


 Summary: [Feature] submit Flink CDC pipeline job to yarn 
application cluster.
 Key: FLINK-34904
 URL: https://issues.apache.org/jira/browse/FLINK-34904
 Project: Flink
  Issue Type: Improvement
  Components: Flink CDC
Affects Versions: 3.1.0
Reporter: ZhengYu Chen
 Fix For: 3.1.0


support flink cdc cli submit pipeline job to yarn application cluster.discuss 
in FLINK-34853



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (FLINK-34904) [Feature] submit Flink CDC pipeline job to yarn application cluster.

2024-03-21 Thread ZhengYu Chen (Jira)



[ 
https://issues.apache.org/jira/browse/FLINK-34904?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17829434#comment-17829434
 ] 

ZhengYu Chen commented on FLINK-34904:
--

cc [~czy006] 

> [Feature] submit Flink CDC pipeline job to yarn application cluster.
> 
>
> Key: FLINK-34904
> URL: https://issues.apache.org/jira/browse/FLINK-34904
> Project: Flink
>  Issue Type: Improvement
>  Components: Flink CDC
>Affects Versions: 3.1.0
>Reporter: ZhengYu Chen
>Priority: Minor
> Fix For: 3.1.0
>
>
> support flink cdc cli submit pipeline job to yarn application cluster.discuss 
> in FLINK-34853



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [FLINK-26088][Connectors/ElasticSearch] Add Elasticsearch 8.0 support [flink-connector-elasticsearch]



reswqa commented on PR #53:
URL: 
https://github.com/apache/flink-connector-elasticsearch/pull/53#issuecomment-2011649582

   > when is the next release planned?
   
   TBH, I'm not really sure. But I think we will probably release a series of 
connectors that supporting flink-1.19 in the near future. You can make a 
release request on the flink mailing list.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] [FLINK-34892][ci] Fix python test failure due to config file change [flink-connector-aws]



dannycranmer merged PR #133:
URL: https://github.com/apache/flink-connector-aws/pull/133


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Commented] (FLINK-34892) Nightly AWS connectors build fails on running python tests

2024-03-21 Thread Danny Cranmer (Jira)



[ 
https://issues.apache.org/jira/browse/FLINK-34892?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17829440#comment-17829440
 ] 

Danny Cranmer commented on FLINK-34892:
---

Merged commit 
[{{e8ba71e}}|https://github.com/apache/flink-connector-aws/commit/e8ba71ec3c27903c838701d536a8ae05bc5bb523]
 into apache:main 

> Nightly AWS connectors build fails on running python tests
> --
>
> Key: FLINK-34892
> URL: https://issues.apache.org/jira/browse/FLINK-34892
> Project: Flink
>  Issue Type: Bug
>  Components: Connectors / AWS
>Affects Versions: aws-connector-4.2.0
>Reporter: Aleksandr Pilipenko
>Priority: Major
>  Labels: pull-request-available
>
> Build for externalized python connector code fails: 
> [https://github.com/apache/flink-connector-aws/actions/runs/8351768294/job/22860710449]
> {code:java}
> 2024-03-20T00:14:35.5215863Z __ 
> FlinkKinesisTest.test_kinesis_streams_sink __
> 2024-03-20T00:14:35.5216781Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/testing/test_case_utils.py:149:
>  in setUp
> 2024-03-20T00:14:35.5217584Z self.env = 
> StreamExecutionEnvironment.get_execution_environment()
> 2024-03-20T00:14:35.5218901Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/datastream/stream_execution_environment.py:876:
>  in get_execution_environment
> 2024-03-20T00:14:35.5219751Z gateway = get_gateway()
> 2024-03-20T00:14:35.5220635Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/java_gateway.py:64: in 
> get_gateway
> 2024-03-20T00:14:35.5221378Z _gateway = launch_gateway()
> 2024-03-20T00:14:35.5222111Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/java_gateway.py:110: 
> in launch_gateway
> 2024-03-20T00:14:35.5222956Z p = launch_gateway_server_process(env, args)
> 2024-03-20T00:14:35.5223854Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:262:
>  in launch_gateway_server_process
> 2024-03-20T00:14:35.5224649Z java_executable = find_java_executable()
> 2024-03-20T00:14:35.5225583Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:75:
>  in find_java_executable
> 2024-03-20T00:14:35.5226449Z java_home = 
> read_from_config(KEY_ENV_JAVA_HOME, None, flink_conf_file)
> 2024-03-20T00:14:35.5227099Z _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> 2024-03-20T00:14:35.5227450Z 
> 2024-03-20T00:14:35.5227774Z key = 'env.java.home', default_value = None
> 2024-03-20T00:14:35.5228925Z flink_conf_file = 
> '/home/runner/work/flink-connector-aws/flink-connector-aws/flink-python/.tox/py310-cython/lib/python3.10/site-packages/pyflink/conf/flink-conf.yaml'
> 2024-03-20T00:14:35.5229778Z 
> 2024-03-20T00:14:35.5230010Z def read_from_config(key, default_value, 
> flink_conf_file):
> 2024-03-20T00:14:35.5230581Z value = default_value
> 2024-03-20T00:14:35.5231236Z # get the realpath of tainted path value 
> to avoid CWE22 problem that constructs a path or URI
> 2024-03-20T00:14:35.5232195Z # using the tainted value and might 
> allow an attacker to access, modify, or test the existence
> 2024-03-20T00:14:35.5232940Z # of critical or sensitive files.
> 2024-03-20T00:14:35.5233417Z >   with 
> open(os.path.realpath(flink_conf_file), "r") as f:
> 2024-03-20T00:14:35.5234874Z E   FileNotFoundError: [Errno 2] No such 
> file or directory: 
> '/home/runner/work/flink-connector-aws/flink-connector-aws/flink-python/.tox/py310-cython/lib/python3.10/site-packages/pyflink/conf/flink-conf.yaml'
> 2024-03-20T00:14:35.5235954Z 
> 2024-03-20T00:14:35.5236484Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:58:
>  FileNotFoundError {code}
> Failure started after the release of apache-flink python package for 1.19.0 
> due to change of default config file provided within artifact.
>  
>  
> Issue comes from outdated copy of pyflink_gateway_server.py created as part 
> of [https://github.com/apache/flink-connector-kafka/pull/69] (same change is 
> duplicated in AWS connectors repository).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Assigned] (FLINK-34892) Nightly AWS connectors build fails on running python tests

2024-03-21 Thread Danny Cranmer (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Danny Cranmer reassigned FLINK-34892:
-

Assignee: Aleksandr Pilipenko

> Nightly AWS connectors build fails on running python tests
> --
>
> Key: FLINK-34892
> URL: https://issues.apache.org/jira/browse/FLINK-34892
> Project: Flink
>  Issue Type: Bug
>  Components: Connectors / AWS
>Affects Versions: aws-connector-4.2.0
>Reporter: Aleksandr Pilipenko
>Assignee: Aleksandr Pilipenko
>Priority: Major
>  Labels: pull-request-available
>
> Build for externalized python connector code fails: 
> [https://github.com/apache/flink-connector-aws/actions/runs/8351768294/job/22860710449]
> {code:java}
> 2024-03-20T00:14:35.5215863Z __ 
> FlinkKinesisTest.test_kinesis_streams_sink __
> 2024-03-20T00:14:35.5216781Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/testing/test_case_utils.py:149:
>  in setUp
> 2024-03-20T00:14:35.5217584Z self.env = 
> StreamExecutionEnvironment.get_execution_environment()
> 2024-03-20T00:14:35.5218901Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/datastream/stream_execution_environment.py:876:
>  in get_execution_environment
> 2024-03-20T00:14:35.5219751Z gateway = get_gateway()
> 2024-03-20T00:14:35.5220635Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/java_gateway.py:64: in 
> get_gateway
> 2024-03-20T00:14:35.5221378Z _gateway = launch_gateway()
> 2024-03-20T00:14:35.5222111Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/java_gateway.py:110: 
> in launch_gateway
> 2024-03-20T00:14:35.5222956Z p = launch_gateway_server_process(env, args)
> 2024-03-20T00:14:35.5223854Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:262:
>  in launch_gateway_server_process
> 2024-03-20T00:14:35.5224649Z java_executable = find_java_executable()
> 2024-03-20T00:14:35.5225583Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:75:
>  in find_java_executable
> 2024-03-20T00:14:35.5226449Z java_home = 
> read_from_config(KEY_ENV_JAVA_HOME, None, flink_conf_file)
> 2024-03-20T00:14:35.5227099Z _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> 2024-03-20T00:14:35.5227450Z 
> 2024-03-20T00:14:35.5227774Z key = 'env.java.home', default_value = None
> 2024-03-20T00:14:35.5228925Z flink_conf_file = 
> '/home/runner/work/flink-connector-aws/flink-connector-aws/flink-python/.tox/py310-cython/lib/python3.10/site-packages/pyflink/conf/flink-conf.yaml'
> 2024-03-20T00:14:35.5229778Z 
> 2024-03-20T00:14:35.5230010Z def read_from_config(key, default_value, 
> flink_conf_file):
> 2024-03-20T00:14:35.5230581Z value = default_value
> 2024-03-20T00:14:35.5231236Z # get the realpath of tainted path value 
> to avoid CWE22 problem that constructs a path or URI
> 2024-03-20T00:14:35.5232195Z # using the tainted value and might 
> allow an attacker to access, modify, or test the existence
> 2024-03-20T00:14:35.5232940Z # of critical or sensitive files.
> 2024-03-20T00:14:35.5233417Z >   with 
> open(os.path.realpath(flink_conf_file), "r") as f:
> 2024-03-20T00:14:35.5234874Z E   FileNotFoundError: [Errno 2] No such 
> file or directory: 
> '/home/runner/work/flink-connector-aws/flink-connector-aws/flink-python/.tox/py310-cython/lib/python3.10/site-packages/pyflink/conf/flink-conf.yaml'
> 2024-03-20T00:14:35.5235954Z 
> 2024-03-20T00:14:35.5236484Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:58:
>  FileNotFoundError {code}
> Failure started after the release of apache-flink python package for 1.19.0 
> due to change of default config file provided within artifact.
>  
>  
> Issue comes from outdated copy of pyflink_gateway_server.py created as part 
> of [https://github.com/apache/flink-connector-kafka/pull/69] (same change is 
> duplicated in AWS connectors repository).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Resolved] (FLINK-34892) Nightly AWS connectors build fails on running python tests

2024-03-21 Thread Danny Cranmer (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34892?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Danny Cranmer resolved FLINK-34892.
---
Resolution: Fixed

> Nightly AWS connectors build fails on running python tests
> --
>
> Key: FLINK-34892
> URL: https://issues.apache.org/jira/browse/FLINK-34892
> Project: Flink
>  Issue Type: Bug
>  Components: Connectors / AWS
>Affects Versions: aws-connector-4.2.0
>Reporter: Aleksandr Pilipenko
>Assignee: Aleksandr Pilipenko
>Priority: Major
>  Labels: pull-request-available
>
> Build for externalized python connector code fails: 
> [https://github.com/apache/flink-connector-aws/actions/runs/8351768294/job/22860710449]
> {code:java}
> 2024-03-20T00:14:35.5215863Z __ 
> FlinkKinesisTest.test_kinesis_streams_sink __
> 2024-03-20T00:14:35.5216781Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/testing/test_case_utils.py:149:
>  in setUp
> 2024-03-20T00:14:35.5217584Z self.env = 
> StreamExecutionEnvironment.get_execution_environment()
> 2024-03-20T00:14:35.5218901Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/datastream/stream_execution_environment.py:876:
>  in get_execution_environment
> 2024-03-20T00:14:35.5219751Z gateway = get_gateway()
> 2024-03-20T00:14:35.5220635Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/java_gateway.py:64: in 
> get_gateway
> 2024-03-20T00:14:35.5221378Z _gateway = launch_gateway()
> 2024-03-20T00:14:35.5222111Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/java_gateway.py:110: 
> in launch_gateway
> 2024-03-20T00:14:35.5222956Z p = launch_gateway_server_process(env, args)
> 2024-03-20T00:14:35.5223854Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:262:
>  in launch_gateway_server_process
> 2024-03-20T00:14:35.5224649Z java_executable = find_java_executable()
> 2024-03-20T00:14:35.5225583Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:75:
>  in find_java_executable
> 2024-03-20T00:14:35.5226449Z java_home = 
> read_from_config(KEY_ENV_JAVA_HOME, None, flink_conf_file)
> 2024-03-20T00:14:35.5227099Z _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ _ 
> 2024-03-20T00:14:35.5227450Z 
> 2024-03-20T00:14:35.5227774Z key = 'env.java.home', default_value = None
> 2024-03-20T00:14:35.5228925Z flink_conf_file = 
> '/home/runner/work/flink-connector-aws/flink-connector-aws/flink-python/.tox/py310-cython/lib/python3.10/site-packages/pyflink/conf/flink-conf.yaml'
> 2024-03-20T00:14:35.5229778Z 
> 2024-03-20T00:14:35.5230010Z def read_from_config(key, default_value, 
> flink_conf_file):
> 2024-03-20T00:14:35.5230581Z value = default_value
> 2024-03-20T00:14:35.5231236Z # get the realpath of tainted path value 
> to avoid CWE22 problem that constructs a path or URI
> 2024-03-20T00:14:35.5232195Z # using the tainted value and might 
> allow an attacker to access, modify, or test the existence
> 2024-03-20T00:14:35.5232940Z # of critical or sensitive files.
> 2024-03-20T00:14:35.5233417Z >   with 
> open(os.path.realpath(flink_conf_file), "r") as f:
> 2024-03-20T00:14:35.5234874Z E   FileNotFoundError: [Errno 2] No such 
> file or directory: 
> '/home/runner/work/flink-connector-aws/flink-connector-aws/flink-python/.tox/py310-cython/lib/python3.10/site-packages/pyflink/conf/flink-conf.yaml'
> 2024-03-20T00:14:35.5235954Z 
> 2024-03-20T00:14:35.5236484Z 
> .tox/py310-cython/lib/python3.10/site-packages/pyflink/pyflink_gateway_server.py:58:
>  FileNotFoundError {code}
> Failure started after the release of apache-flink python package for 1.19.0 
> due to change of default config file provided within artifact.
>  
>  
> Issue comes from outdated copy of pyflink_gateway_server.py created as part 
> of [https://github.com/apache/flink-connector-kafka/pull/69] (same change is 
> duplicated in AWS connectors repository).



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Created] (FLINK-34905) The default length of CHAR/BINARY data type of Add column DDL

2024-03-21 Thread Qishang Zhong (Jira)

Qishang Zhong created FLINK-34905:
-

 Summary: The default length of CHAR/BINARY data type of Add column 
DDL
 Key: FLINK-34905
 URL: https://issues.apache.org/jira/browse/FLINK-34905
 Project: Flink
  Issue Type: Bug
  Components: Flink CDC
Reporter: Qishang Zhong


I run the DDL in mysql
{code:java}
ALTER TABLE test.products ADD Column1 BINARY NULL;  
ALTER TABLE test.products ADD Column2 CHAR NULL; {code}
Encountered the follow error:
{code:java}

Caused by: java.lang.IllegalArgumentException: Binary string length must be 
between 1 and 2147483647 (both inclusive).
at 
org.apache.flink.cdc.common.types.BinaryType.(BinaryType.java:53)
at 
org.apache.flink.cdc.common.types.BinaryType.(BinaryType.java:61)
at org.apache.flink.cdc.common.types.DataTypes.BINARY(DataTypes.java:42)
at 
org.apache.flink.cdc.connectors.mysql.utils.MySqlTypeUtils.convertFromColumn(MySqlTypeUtils.java:221)
at 
org.apache.flink.cdc.connectors.mysql.utils.MySqlTypeUtils.fromDbzColumn(MySqlTypeUtils.java:111)
at 
org.apache.flink.cdc.connectors.mysql.source.parser.CustomAlterTableParserListener.toCdcColumn(CustomAlterTableParserListener.java:256)
at 
org.apache.flink.cdc.connectors.mysql.source.parser.CustomAlterTableParserListener.lambda$exitAlterByAddColumn$0(CustomAlterTableParserListener.java:126)
at 
io.debezium.connector.mysql.antlr.MySqlAntlrDdlParser.runIfNotNull(MySqlAntlrDdlParser.java:358)
at 
org.apache.flink.cdc.connectors.mysql.source.parser.CustomAlterTableParserListener.exitAlterByAddColumn(CustomAlterTableParserListener.java:98)
at 
io.debezium.ddl.parser.mysql.generated.MySqlParser$AlterByAddColumnContext.exitRule(MySqlParser.java:15459)
at 
io.debezium.antlr.ProxyParseTreeListenerUtil.delegateExitRule(ProxyParseTreeListenerUtil.java:64)
at 
org.apache.flink.cdc.connectors.mysql.source.parser.CustomMySqlAntlrDdlParserListener.exitEveryRule(CustomMySqlAntlrDdlParserListener.java:124)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.exitRule(ParseTreeWalker.java:48)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.walk(ParseTreeWalker.java:30)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.walk(ParseTreeWalker.java:28)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.walk(ParseTreeWalker.java:28)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.walk(ParseTreeWalker.java:28)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.walk(ParseTreeWalker.java:28)
at 
org.antlr.v4.runtime.tree.ParseTreeWalker.walk(ParseTreeWalker.java:28)
at io.debezium.antlr.AntlrDdlParser.parse(AntlrDdlParser.java:87)
at 
org.apache.flink.cdc.connectors.mysql.source.MySqlEventDeserializer.deserializeSchemaChangeRecord(MySqlEventDeserializer.java:88)
at 
org.apache.flink.cdc.debezium.event.SourceRecordEventDeserializer.deserialize(SourceRecordEventDeserializer.java:52)
at 
org.apache.flink.cdc.debezium.event.DebeziumEventDeserializationSchema.deserialize(DebeziumEventDeserializationSchema.java:93)
at 
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlRecordEmitter.emitElement(MySqlRecordEmitter.java:119)
at 
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlRecordEmitter.processElement(MySqlRecordEmitter.java:96)
at 
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlPipelineRecordEmitter.processElement(MySqlPipelineRecordEmitter.java:120)
at 
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlRecordEmitter.emitRecord(MySqlRecordEmitter.java:73)
at 
org.apache.flink.cdc.connectors.mysql.source.reader.MySqlRecordEmitter.emitRecord(MySqlRecordEmitter.java:46)
at 
org.apache.flink.connector.base.source.reader.SourceReaderBase.pollNext(SourceReaderBase.java:160)
at 
org.apache.flink.streaming.api.operators.SourceOperator.emitNext(SourceOperator.java:419)
at 
org.apache.flink.streaming.runtime.io.StreamTaskSourceInput.emitNext(StreamTaskSourceInput.java:68)
at 
org.apache.flink.streaming.runtime.io.StreamOneInputProcessor.processInput(StreamOneInputProcessor.java:65)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask.processInput(StreamTask.java:562)
at 
org.apache.flink.streaming.runtime.tasks.mailbox.MailboxProcessor.runMailboxLoop(MailboxProcessor.java:231)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask.runMailboxLoop(StreamTask.java:858)
at 
org.apache.flink.streaming.runtime.tasks.StreamTask.invoke(StreamTask.java:807)
at 
org.apache.flink.runtime.taskmanager.Task.runWithSystemExitMonitoring(Task.java:953)
at 
org.apache.flink.runtime.taskmanager.Task.restoreAndInvoke(Task.java:932)
at org.apache.flink.runtime.taskmanager.Task.doRun(Task.java:746)
at org.apache.flink.runtime.taskmanager.Task

Re: [PR] [FLINK-34731][runtime] Remove SpeculativeScheduler and incorporate its features into AdaptiveBatchScheduler. [flink]



zhuzhurk commented on code in PR #24524:
URL: https://github.com/apache/flink/pull/24524#discussion_r1533349277


##
flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/AdaptiveBatchScheduler.java:
##
@@ -183,21 +193,59 @@ public AdaptiveBatchScheduler(
 this.hybridPartitionDataConsumeConstraint = 
hybridPartitionDataConsumeConstraint;
 
 this.sourceParallelismFuturesByJobVertexId = new HashMap<>();
+
+// 

+//  Speculative execution handler
+// 


Review Comment:
   I would avoid the `// =` comments.
   It's also better to introduce a `createSpeculativeExecutionHandler` method.



##
flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/DefaultSpeculativeExecutionHandler.java:
##
@@ -7,62 +7,37 @@
  * "License"); you may not use this file except in compliance
  * with the License.  You may obtain a copy of the License at
  *
- *   http://www.apache.org/licenses/LICENSE-2.0
+ * http://www.apache.org/licenses/LICENSE-2.0
  *
- * Unless required by applicable law or agreed to in writing,
- * software distributed under the License is distributed on an
- * "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY
- * KIND, either express or implied.  See the License for the
- * specific language governing permissions and limitations
- * under the License.
+ * Unless required by applicable law or agreed to in writing, software
+ * distributed under the License is distributed on an "AS IS" BASIS,
+ * WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+ * See the License for the specific language governing permissions and
+ * limitations under the License.

Review Comment:
   Is the change to the license format expected?



##
flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/AdaptiveBatchScheduler.java:
##
@@ -183,21 +193,59 @@ public AdaptiveBatchScheduler(
 this.hybridPartitionDataConsumeConstraint = 
hybridPartitionDataConsumeConstraint;
 
 this.sourceParallelismFuturesByJobVertexId = new HashMap<>();
+
+// 

+//  Speculative execution handler
+// 

+if 
(jobMasterConfiguration.get(BatchExecutionOptions.SPECULATIVE_ENABLED)) {
+speculativeExecutionHandler =
+new DefaultSpeculativeExecutionHandler(
+jobMasterConfiguration.get(
+
BatchExecutionOptions.SPECULATIVE_MAX_CONCURRENT_EXECUTIONS),
+jobMasterConfiguration.get(
+
BatchExecutionOptions.BLOCK_SLOW_NODE_DURATION),
+blocklistOperations,
+new 
ExecutionTimeBasedSlowTaskDetector(jobMasterConfiguration),
+new SimpleCounter(),
+this::getExecutionVertex,
+() -> 
getExecutionGraph().getRegisteredExecutions(),
+(newSpeculativeExecutions, verticesToDeploy) ->
+executionDeployer.allocateSlotsAndDeploy(
+newSpeculativeExecutions,
+
executionVertexVersioner.getExecutionVertexVersions(
+verticesToDeploy)),
+log);
+} else {
+speculativeExecutionHandler = new 
DummySpeculativeExecutionHandler();
+}
 }
 
 @Override
 protected void startSchedulingInternal() {
+speculativeExecutionHandler.registerMetrics(jobManagerJobMetricGroup);
+
 tryComputeSourceParallelismThenRunAsync(
 (Void value, Throwable throwable) -> {
 if (getExecutionGraph().getState() == JobStatus.CREATED) {
 initializeVerticesIfPossible();
 super.startSchedulingInternal();
 }
 });
+
+speculativeExecutionHandler.startSlowTaskDetector(

Review Comment:
   I think we can merge the two methods `startSlowTaskDetector` and 
`registerMetrics` to one method `speculativeExecutionHandler.init()` .



##
flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/AdaptiveBatchScheduler.java:
##
@@ -183,21 +193,59 @@ public AdaptiveBatchScheduler(
 this.hybridPartitionDataConsumeConstraint = 
hybridPartitionDataConsumeConstraint;
 
 this.sourceParallelismFuturesByJobVertexId = new HashMap<>();
+
+// 
=

[jira] [Commented] (FLINK-34898) Cannot create named STRUCT with a single field

2024-03-21 Thread Martijn Visser (Jira)



[ 
https://issues.apache.org/jira/browse/FLINK-34898?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17829454#comment-17829454
 ] 

Martijn Visser commented on FLINK-34898:


It does look like this is a user support ticket, not an actual bug problem. 
Questions like these should be posted on the User mailing list, Slack or 
Stackoverflow. 

> Cannot create named STRUCT with a single field
> --
>
> Key: FLINK-34898
> URL: https://issues.apache.org/jira/browse/FLINK-34898
> Project: Flink
>  Issue Type: Bug
>Reporter: Chloe He
>Priority: Major
> Attachments: image-2024-03-21-12-00-00-183.png
>
>
> I'm trying to create named structs using Flink SQL and I found a previous 
> ticket https://issues.apache.org/jira/browse/FLINK-9161 that mentions the use 
> of the following syntax:
> {code:java}
> SELECT CAST(('a', 1) as ROW) AS row1;
> {code}
> However, my named struct has a single field and effectively it should look 
> something like `\{"a": 1}`. I can't seem to be able to find a way to 
> construct this. I have experimented with a few different syntax and it either 
> throws parsing error or casting error:
> {code:java}
> Cast function cannot convert value of type INTEGER to type 
> RecordType(VARCHAR(2147483647) a) {code}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Created] (FLINK-34906) Don't start autoscaling when some tasks are not running

Rui Fan created FLINK-34906:
---

 Summary: Don't start autoscaling when some tasks are not running
 Key: FLINK-34906
 URL: https://issues.apache.org/jira/browse/FLINK-34906
 Project: Flink
  Issue Type: Improvement
  Components: Autoscaler
Reporter: Rui Fan
Assignee: Rui Fan
 Fix For: 1.9.0
 Attachments: image-2024-03-21-17-40-23-523.png

Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
the JobStatus will be RUNNING once job starts schedule, so it doesn't mean all 
tasks are running. Especially, when the resource isn't enough or job recovers 
from large state.

The autoscaler will throw exception and generate the AutoscalerError event when 
tasks are not ready, such as: 

 !image-2024-03-21-17-40-23-523.png! 


Solution: we only scale job that all tasks are running(some of tasks may be 
finished). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-34751) RestClusterClient APIs doesn't work with running Flink application on YARN

2024-03-21 Thread Martijn Visser (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34751?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martijn Visser updated FLINK-34751:
---
Component/s: Deployment / YARN

> RestClusterClient APIs doesn't work with running Flink application on YARN
> --
>
> Key: FLINK-34751
> URL: https://issues.apache.org/jira/browse/FLINK-34751
> Project: Flink
>  Issue Type: Bug
>  Components: Deployment / YARN
>Reporter: Venkata krishnan Sowrirajan
>Priority: Major
>
> Apache YARN uses web proxy in Resource Manager to expose the endpoints 
> available through the AM process (in this case RestServerEndpoint that run as 
> part of AM). Note: this is in the context of running Flink cluster in YARN 
> application mode.
> For eg: in the case of RestClusterClient#listJobs -
> {{Standalone listJobs}} makes the request as - 
> {{{}https://:/v1/{}}}{{{}jobs{}}}{{{}/overview{}}}
> YARN the same request has to be proxified as -  
> {{{}https://:/proxy//v1/{}}}{{{}jobs{}}}{{{}/overview?proxyapproved=true{}}}



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [FLINK-34044] Copy dynamic table options before mapping deprecated configs [flink-connector-aws]



vahmed-hamdy commented on code in PR #132:
URL: 
https://github.com/apache/flink-connector-aws/pull/132#discussion_r1533542650


##
flink-connector-aws/flink-connector-aws-kinesis-streams/src/main/java/org/apache/flink/connector/kinesis/table/util/KinesisStreamsConnectorOptionsUtils.java:
##
@@ -148,13 +149,13 @@ public static class KinesisProducerOptionsMapper {
 public KinesisProducerOptionsMapper(
 ReadableConfig tableOptions, Map 
resolvedOptions) {
 this.tableOptions = tableOptions;
-this.resolvedOptions = resolvedOptions;
+this.resolvedOptions = new HashMap<>(resolvedOptions);
 }
 
 @VisibleForTesting
 public KinesisProducerOptionsMapper(Map allOptions) {

Review Comment:
   The constructor is used differently, the one with `allOPtions` should 
contain both `tableOptions` and `resolvedOptions`. you can see it is parsed 
into both in this constructor.



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Updated] (FLINK-34906) Don't start autoscaling when some tasks are not running



 [ 
https://issues.apache.org/jira/browse/FLINK-34906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rui Fan updated FLINK-34906:

Description: 
Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
the JobStatus will be RUNNING once job starts schedule, so it doesn't mean all 
tasks are running. Especially, when the resource isn't enough or job recovers 
from large state.

The autoscaler will throw exception and generate the AutoscalerError event when 
tasks are not ready, such as: 

 !image-2024-03-21-17-40-23-523.png! 


Also, we don't need to scale it when some tasks are not ready.

Solution: we only scale job that all tasks are running(some of tasks may be 
finished). 

  was:
Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
the JobStatus will be RUNNING once job starts schedule, so it doesn't mean all 
tasks are running. Especially, when the resource isn't enough or job recovers 
from large state.

The autoscaler will throw exception and generate the AutoscalerError event when 
tasks are not ready, such as: 

 !image-2024-03-21-17-40-23-523.png! 


Solution: we only scale job that all tasks are running(some of tasks may be 
finished). 


> Don't start autoscaling when some tasks are not running
> ---
>
> Key: FLINK-34906
> URL: https://issues.apache.org/jira/browse/FLINK-34906
> Project: Flink
>  Issue Type: Improvement
>  Components: Autoscaler
>Reporter: Rui Fan
>Assignee: Rui Fan
>Priority: Major
> Fix For: 1.9.0
>
> Attachments: image-2024-03-21-17-40-23-523.png
>
>
> Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
> the JobStatus will be RUNNING once job starts schedule, so it doesn't mean 
> all tasks are running. Especially, when the resource isn't enough or job 
> recovers from large state.
> The autoscaler will throw exception and generate the AutoscalerError event 
> when tasks are not ready, such as: 
>  !image-2024-03-21-17-40-23-523.png! 
> Also, we don't need to scale it when some tasks are not ready.
> Solution: we only scale job that all tasks are running(some of tasks may be 
> finished). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[PR] [FLINK-34906] Only scale when all tasks are running [flink-kubernetes-operator]

1996fanrui opened a new pull request, #801:
URL: https://github.com/apache/flink-kubernetes-operator/pull/801

## What is the purpose of the change

Currently, the autoscaler will scale a job when the JobStatus is RUNNING.
But the JobStatus will be RUNNING once job starts schedule, so it doesn't mean
all tasks are running. Especially, when the resource isn't enough or job
recovers from large state.

The autoscaler will throw exception and generate the AutoscalerError event
when tasks are not ready. Also, we don't need to scale it when some tasks are
not ready.

## Brief change log

- [FLINK-34906] Only scale when all tasks are running
- Solution: we only scale job that all tasks are running(some of tasks may
be finished).

We can know how many tasks are running from `JobDetailsInfo`:

![image](https://github.com/apache/flink-kubernetes-operator/assets/38427477/b440ac9d-eddc-49b7-b534-b6755fa9e181)

## Verifying this change

Manually test is done, unit test is still writing.

## Does this pull request potentially affect one of the following parts:

- Dependencies (does it add or upgrade a dependency): no
- The public API, i.e., is any changes to the `CustomResourceDescriptors`:
no
- Core observer or reconciler logic that is regularly executed: no

## Documentation

- Does this pull request introduce a new feature? no

--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Updated] (FLINK-34906) Don't start autoscaling when some tasks are not running

2024-03-21 Thread ASF GitHub Bot (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-34906:
---
Labels: pull-request-available  (was: )

> Don't start autoscaling when some tasks are not running
> ---
>
> Key: FLINK-34906
> URL: https://issues.apache.org/jira/browse/FLINK-34906
> Project: Flink
>  Issue Type: Improvement
>  Components: Autoscaler
>Reporter: Rui Fan
>Assignee: Rui Fan
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.9.0
>
> Attachments: image-2024-03-21-17-40-23-523.png
>
>
> Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
> the JobStatus will be RUNNING once job starts schedule, so it doesn't mean 
> all tasks are running. Especially, when the resource isn't enough or job 
> recovers from large state.
> The autoscaler will throw exception and generate the AutoscalerError event 
> when tasks are not ready, such as: 
>  !image-2024-03-21-17-40-23-523.png! 
> Also, we don't need to scale it when some tasks are not ready.
> Solution: we only scale job that all tasks are running(some of tasks may be 
> finished). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Assigned] (FLINK-34746) Switching to the Apache CDN for Dockerfile

2024-03-21 Thread Martijn Visser (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Martijn Visser reassigned FLINK-34746:
--

Assignee: Martijn Visser

> Switching to the Apache CDN for Dockerfile
> --
>
> Key: FLINK-34746
> URL: https://issues.apache.org/jira/browse/FLINK-34746
> Project: Flink
>  Issue Type: Improvement
>  Components: flink-docker
>Reporter: lincoln lee
>Assignee: Martijn Visser
>Priority: Major
> Fix For: 1.18.2, 1.20.0, 1.19.1
>
>
> During publishing the official image, we received some comments
> for Switching to the Apache CDN
>  
> See
> https://github.com/docker-library/official-images/pull/16114
> https://github.com/docker-library/official-images/pull/16430
>  
> Reason for switching: [https://apache.org/history/mirror-history.html] (also 
> [https://www.apache.org/dyn/closer.cgi] and [https://www.apache.org/mirrors])



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Updated] (FLINK-34746) Switching to the Apache CDN for Dockerfile

2024-03-21 Thread ASF GitHub Bot (Jira)



 [ 
https://issues.apache.org/jira/browse/FLINK-34746?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

ASF GitHub Bot updated FLINK-34746:
---
Labels: pull-request-available  (was: )

> Switching to the Apache CDN for Dockerfile
> --
>
> Key: FLINK-34746
> URL: https://issues.apache.org/jira/browse/FLINK-34746
> Project: Flink
>  Issue Type: Improvement
>  Components: flink-docker
>Reporter: lincoln lee
>Assignee: Martijn Visser
>Priority: Major
>  Labels: pull-request-available
> Fix For: 1.18.2, 1.20.0, 1.19.1
>
>
> During publishing the official image, we received some comments
> for Switching to the Apache CDN
>  
> See
> https://github.com/docker-library/official-images/pull/16114
> https://github.com/docker-library/official-images/pull/16430
>  
> Reason for switching: [https://apache.org/history/mirror-history.html] (also 
> [https://www.apache.org/dyn/closer.cgi] and [https://www.apache.org/mirrors])



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [FLINK-34906] Only scale when all tasks are running [flink-kubernetes-operator]



gyfora commented on PR #801:
URL: 
https://github.com/apache/flink-kubernetes-operator/pull/801#issuecomment-2011816958

   This issue only affects the standalone autoscaler as the kubernetes operator 
has this logic already in place for setting the RUNNING state. Can we somehow 
deduplicate this logic?


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

[jira] [Updated] (FLINK-34906) Don't start autoscaling when some tasks are not running



 [ 
https://issues.apache.org/jira/browse/FLINK-34906?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Rui Fan updated FLINK-34906:

Fix Version/s: kubernetes-operator-1.9.0
   (was: 1.9.0)

> Don't start autoscaling when some tasks are not running
> ---
>
> Key: FLINK-34906
> URL: https://issues.apache.org/jira/browse/FLINK-34906
> Project: Flink
>  Issue Type: Improvement
>  Components: Autoscaler
>Reporter: Rui Fan
>Assignee: Rui Fan
>Priority: Major
>  Labels: pull-request-available
> Fix For: kubernetes-operator-1.9.0
>
> Attachments: image-2024-03-21-17-40-23-523.png
>
>
> Currently, the autoscaler will scale a job when the JobStatus is RUNNING. But 
> the JobStatus will be RUNNING once job starts schedule, so it doesn't mean 
> all tasks are running. Especially, when the resource isn't enough or job 
> recovers from large state.
> The autoscaler will throw exception and generate the AutoscalerError event 
> when tasks are not ready, such as: 
>  !image-2024-03-21-17-40-23-523.png! 
> Also, we don't need to scale it when some tasks are not ready.
> Solution: we only scale job that all tasks are running(some of tasks may be 
> finished). 



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Commented] (FLINK-34643) JobIDLoggingITCase failed

2024-03-21 Thread Ryan Skraba (Jira)



[ 
https://issues.apache.org/jira/browse/FLINK-34643?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17829466#comment-17829466
 ] 

Ryan Skraba commented on FLINK-34643:
-

* 
[https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58455&view=logs&j=2c3cbe13-dee0-5837-cf47-3053da9a8a78&t=b78d9d30-509a-5cea-1fef-db7abaa325ae&l=8349]
 * 
[https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58455&view=logs&j=8fd9202e-fd17-5b26-353c-ac1ff76c8f28&t=ea7cf968-e585-52cb-e0fc-f48de023a7ca&l=7898]

> JobIDLoggingITCase failed
> -
>
> Key: FLINK-34643
> URL: https://issues.apache.org/jira/browse/FLINK-34643
> Project: Flink
>  Issue Type: Bug
>  Components: Runtime / Coordination
>Affects Versions: 1.20.0
>Reporter: Matthias Pohl
>Assignee: Roman Khachatryan
>Priority: Major
>  Labels: pull-request-available, test-stability
> Fix For: 1.20.0
>
>
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=8fd9202e-fd17-5b26-353c-ac1ff76c8f28&t=ea7cf968-e585-52cb-e0fc-f48de023a7ca&l=7897
> {code}
> Mar 09 01:24:23 01:24:23.498 [ERROR] Tests run: 1, Failures: 0, Errors: 1, 
> Skipped: 0, Time elapsed: 4.209 s <<< FAILURE! -- in 
> org.apache.flink.test.misc.JobIDLoggingITCase
> Mar 09 01:24:23 01:24:23.498 [ERROR] 
> org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(ClusterClient) 
> -- Time elapsed: 1.459 s <<< ERROR!
> Mar 09 01:24:23 java.lang.IllegalStateException: Too few log events recorded 
> for org.apache.flink.runtime.jobmaster.JobMaster (12) - this must be a bug in 
> the test code
> Mar 09 01:24:23   at 
> org.apache.flink.util.Preconditions.checkState(Preconditions.java:215)
> Mar 09 01:24:23   at 
> org.apache.flink.test.misc.JobIDLoggingITCase.assertJobIDPresent(JobIDLoggingITCase.java:148)
> Mar 09 01:24:23   at 
> org.apache.flink.test.misc.JobIDLoggingITCase.testJobIDLogging(JobIDLoggingITCase.java:132)
> Mar 09 01:24:23   at java.lang.reflect.Method.invoke(Method.java:498)
> Mar 09 01:24:23   at 
> java.util.concurrent.RecursiveAction.exec(RecursiveAction.java:189)
> Mar 09 01:24:23   at 
> java.util.concurrent.ForkJoinTask.doExec(ForkJoinTask.java:289)
> Mar 09 01:24:23   at 
> java.util.concurrent.ForkJoinPool$WorkQueue.runTask(ForkJoinPool.java:1056)
> Mar 09 01:24:23   at 
> java.util.concurrent.ForkJoinPool.runWorker(ForkJoinPool.java:1692)
> Mar 09 01:24:23   at 
> java.util.concurrent.ForkJoinWorkerThread.run(ForkJoinWorkerThread.java:175)
> Mar 09 01:24:23 
> {code}
> The other test failures of this build were also caused by the same test:
> * 
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=2c3cbe13-dee0-5837-cf47-3053da9a8a78&t=b78d9d30-509a-5cea-1fef-db7abaa325ae&l=8349
> * 
> https://dev.azure.com/apache-flink/apache-flink/_build/results?buildId=58187&view=logs&j=a596f69e-60d2-5a4b-7d39-dc69e4cdaed3&t=712ade8c-ca16-5b76-3acd-14df33bc1cb1&l=8209



--
This message was sent by Atlassian Jira
(v8.20.10#820010)

[jira] [Created] (FLINK-34907) jobRunningTs should be the timestamp that all tasks are running

Rui Fan created FLINK-34907:
---

 Summary: jobRunningTs should be the timestamp that all tasks are 
running
 Key: FLINK-34907
 URL: https://issues.apache.org/jira/browse/FLINK-34907
 Project: Flink
  Issue Type: Improvement
  Components: Autoscaler
Reporter: Rui Fan
Assignee: Rui Fan
 Fix For: kubernetes-operator-1.9.0


Currently, we consider the timestamp that JobStatus is changed to RUNNING as 
jobRunningTs. But the JobStatus will be RUNNING once job starts schedule, so it 
doesn't mean all tasks are running. 

It will let the isStabilizing or estimating restart time are not accurate.

Solution: jobRunningTs should be the timestamp that all tasks are running.

It can be got from SubtasksTimesHeaders rest api.




--
This message was sent by Atlassian Jira
(v8.20.10#820010)

Re: [PR] [hotfix][Connectors/AWS] Update Flink versions in CI [flink-connector-aws]



dannycranmer merged PR #134:
URL: https://github.com/apache/flink-connector-aws/pull/134


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Re: [PR] [FLINK-34906] Only scale when all tasks are running [flink-kubernetes-operator]