[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062196039 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/DefaultVertexParallelismDecider.java: ## @@ -150,6 +193,202 @@ private int

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062195484 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/DefaultVertexParallelismDecider.java: ## @@ -150,6 +193,202 @@ private int

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062195219 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/DefaultVertexParallelismDecider.java: ## @@ -77,46 +92,74 @@ private

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062194655 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/DefaultVertexParallelismDecider.java: ## @@ -55,7 +70,7 @@ public class

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062194496 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/DefaultVertexParallelismDecider.java: ## @@ -19,15 +19,30 @@ package

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062193741 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/VertexParallelismDecider.java: ## @@ -18,19 +18,27 @@ package

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062193326 ## flink-runtime/src/main/java/org/apache/flink/runtime/scheduler/adaptivebatch/AbstractBlockingResultInfo.java: ## @@ -43,12 +43,21 @@ abstract class

[jira] [Commented] (FLINK-30561) ChangelogStreamHandleReaderWithCache cause FileNotFoundException

2023-01-04 Thread Feifan Wang (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654843#comment-17654843 ] Feifan Wang commented on FLINK-30561: - Hi [~masteryhx] ,  FLINK-30107 and FLINK-28898 are caused by

[GitHub] [flink] wanglijie95 commented on a diff in pull request #21570: [FLINK-29666][runtime] Let adaptive batch scheduler divide subpartition range according to amount of data

2023-01-04 Thread GitBox
wanglijie95 commented on code in PR #21570: URL: https://github.com/apache/flink/pull/21570#discussion_r1062192798 ## flink-runtime/src/main/java/org/apache/flink/runtime/executiongraph/ExecutionVertexInputInfo.java: ## @@ -68,4 +68,9 @@ public boolean equals(Object obj) {

[jira] [Updated] (FLINK-30568) Add benchmark for PolyNomialExpansion, Normalizer, Binarizer, Interaction, MaxAbsScaler, VectorSlicer, ElementWiseProduct and Featurehasher

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30568?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30568: --- Labels: pull-request-available (was: ) > Add benchmark for PolyNomialExpansion,

[GitHub] [flink-ml] weibozhao opened a new pull request, #198: [FLINK-30568] Add benchmark for PolyNomialExpansion, Normalizer, Binarizer, Interaction, MaxAbsScaler, VectorSlicer, ElementWiseProduct a

2023-01-04 Thread GitBox
weibozhao opened a new pull request, #198: URL: https://github.com/apache/flink-ml/pull/198 ## What is the purpose of the change Add benchmark configurations for some of the existing algorithms. ## Brief change log Add benchmark configurations of PolyNomialExpansion,

[jira] [Comment Edited] (FLINK-29427) LookupJoinITCase failed with classloader problem

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654837#comment-17654837 ] Matthias Pohl edited comment on FLINK-29427 at 1/5/23 7:41 AM: --- Same

[jira] [Commented] (FLINK-29427) LookupJoinITCase failed with classloader problem

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29427?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654837#comment-17654837 ] Matthias Pohl commented on FLINK-29427: ---

[jira] [Commented] (FLINK-28881) PubSubConsumingTest.testStoppingConnectorWhenDeserializationSchemaIndicatesEndOfStream test failure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-28881?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654836#comment-17654836 ] Matthias Pohl commented on FLINK-28881: ---

[jira] [Updated] (FLINK-28881) PubSubConsumingTest.testStoppingConnectorWhenDeserializationSchemaIndicatesEndOfStream test failure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-28881?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthias Pohl updated FLINK-28881: -- Affects Version/s: gcp-pubsub-3.0.0 >

[jira] [Comment Edited] (FLINK-29859) TPC-DS end-to-end test with adaptive batch scheduler failed due to oo non-empty .out files.

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29859?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654835#comment-17654835 ] Matthias Pohl edited comment on FLINK-29859 at 1/5/23 7:38 AM: --- I'm

[jira] [Reopened] (FLINK-29859) TPC-DS end-to-end test with adaptive batch scheduler failed due to oo non-empty .out files.

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29859?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthias Pohl reopened FLINK-29859: --- I'm reopening this issue because the error reappeared even though it contained the fix of

[GitHub] [flink-ml] weibozhao commented on pull request #189: [FLINK-30348] Add HasSeed param for RandomSplitter

2023-01-04 Thread GitBox
weibozhao commented on PR #189: URL: https://github.com/apache/flink-ml/pull/189#issuecomment-1371875726 > OK -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To

[jira] [Comment Edited] (FLINK-25500) ZooKeeperLeaderElectionITCase.testJobExecutionOnClusterWithLeaderChange failed on azure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654827#comment-17654827 ] Matthias Pohl edited comment on FLINK-25500 at 1/5/23 7:36 AM: ---

[jira] [Comment Edited] (FLINK-25500) ZooKeeperLeaderElectionITCase.testJobExecutionOnClusterWithLeaderChange failed on azure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654827#comment-17654827 ] Matthias Pohl edited comment on FLINK-25500 at 1/5/23 7:35 AM: ---

[jira] [Updated] (FLINK-30516) Support files table in table store

2023-01-04 Thread Shammon (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shammon updated FLINK-30516: Description: Add files table in Table Store and user can query row count from `mytable$files` (was: Add

[GitHub] [flink-ml] weibozhao commented on a diff in pull request #189: [FLINK-30348] Add HasSeed param for RandomSplitter

2023-01-04 Thread GitBox
weibozhao commented on code in PR #189: URL: https://github.com/apache/flink-ml/pull/189#discussion_r1062181069 ## docs/content/docs/operators/feature/randomsplitter.md: ## @@ -31,9 +31,10 @@ An AlgoOperator which splits a table into N tables according to the given weight

[jira] [Comment Edited] (FLINK-25500) ZooKeeperLeaderElectionITCase.testJobExecutionOnClusterWithLeaderChange failed on azure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654827#comment-17654827 ] Matthias Pohl edited comment on FLINK-25500 at 1/5/23 7:34 AM: ---

[jira] [Updated] (FLINK-25500) ZooKeeperLeaderElectionITCase.testJobExecutionOnClusterWithLeaderChange failed on azure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthias Pohl updated FLINK-25500: -- Affects Version/s: 1.16.0 >

[jira] [Updated] (FLINK-30516) Support files table in table store

2023-01-04 Thread Shammon (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30516?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shammon updated FLINK-30516: Summary: Support files table in table store (was: Add file count and row count in snapshots table) >

[jira] [Updated] (FLINK-25500) ZooKeeperLeaderElectionITCase.testJobExecutionOnClusterWithLeaderChange failed on azure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25500?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthias Pohl updated FLINK-25500: -- Priority: Major (was: Minor) >

[jira] [Commented] (FLINK-25500) ZooKeeperLeaderElectionITCase.testJobExecutionOnClusterWithLeaderChange failed on azure

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25500?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654827#comment-17654827 ] Matthias Pohl commented on FLINK-25500: ---

[GitHub] [flink-ml] zhipeng93 commented on pull request #189: [FLINK-30348] Refine Transformer for RandomSplitter

2023-01-04 Thread GitBox
zhipeng93 commented on PR #189: URL: https://github.com/apache/flink-ml/pull/189#issuecomment-1371872841 @weibozhao Thanks for the update :) Can you also update the PR title as well as the commit message to make it more informative? (e.g., `Add HasSeed param for RandomSplitter`) -- This

[GitHub] [flink-ml] zhipeng93 commented on a diff in pull request #189: [FLINK-30348] Refine Transformer for RandomSplitter

2023-01-04 Thread GitBox
zhipeng93 commented on code in PR #189: URL: https://github.com/apache/flink-ml/pull/189#discussion_r1062179082 ## docs/content/docs/operators/feature/randomsplitter.md: ## @@ -31,9 +31,10 @@ An AlgoOperator which splits a table into N tables according to the given weight

[GitHub] [flink-ml] zhipeng93 commented on a diff in pull request #189: [FLINK-30348] Refine Transformer for RandomSplitter

2023-01-04 Thread GitBox
zhipeng93 commented on code in PR #189: URL: https://github.com/apache/flink-ml/pull/189#discussion_r1062179082 ## docs/content/docs/operators/feature/randomsplitter.md: ## @@ -31,9 +31,10 @@ An AlgoOperator which splits a table into N tables according to the given weight

[jira] [Commented] (FLINK-30093) [Flink SQL][Protobuf] CompileException when querying Kafka topic using google.protobuf.Timestamp

2023-01-04 Thread Tomoyuki NAKAMURA (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30093?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654822#comment-17654822 ] Tomoyuki NAKAMURA commented on FLINK-30093: --- I have investigated the Flink 1.16 implementation

[GitHub] [flink-ml] lindong28 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
lindong28 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062178236 ## flink-ml-benchmark/src/main/resources/robustscaler-benchmark.json: ## @@ -0,0 +1,42 @@ +// Licensed to the Apache Software Foundation (ASF) under one or more +//

[GitHub] [flink-ml] zhipeng93 commented on pull request #189: [FLINK-30348] Refine Transformer for RandomSplitter

2023-01-04 Thread GitBox
zhipeng93 commented on PR #189: URL: https://github.com/apache/flink-ml/pull/189#issuecomment-1371870055 Thanks for the PR. LGTM. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Closed] (FLINK-30533) SourceOperator#emitNext() should push records to DataOutput in a while loop

2023-01-04 Thread Dong Lin (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30533?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lin closed FLINK-30533. Resolution: Fixed > SourceOperator#emitNext() should push records to DataOutput in a while loop >

[jira] [Updated] (FLINK-30507) EventTimeWindowCheckpointingITCase.testTumblingTimeWindowWithKVStateMinMaxParallelism could not be initialized

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30507?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Matthias Pohl updated FLINK-30507: -- Labels: test-stability (was: ) >

[GitHub] [flink-ml] lindong28 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
lindong28 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062172506 ## flink-ml-benchmark/src/test/java/org/apache/flink/ml/benchmark/BenchmarkTest.java: ## @@ -54,6 +60,44 @@ public void testParseJsonFile() throws Exception {

[GitHub] [flink-ml] jiangxin369 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
jiangxin369 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062172361 ## flink-ml-benchmark/src/main/resources/robustscaler-benchmark.json: ## @@ -0,0 +1,42 @@ +// Licensed to the Apache Software Foundation (ASF) under one or more +//

[jira] [Commented] (FLINK-29594) RMQSourceITCase.testMessageDelivery timed out

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29594?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654815#comment-17654815 ] Matthias Pohl commented on FLINK-29594: ---

[jira] [Commented] (FLINK-30507) EventTimeWindowCheckpointingITCase.testTumblingTimeWindowWithKVStateMinMaxParallelism could not be initialized

2023-01-04 Thread Matthias Pohl (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30507?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654814#comment-17654814 ] Matthias Pohl commented on FLINK-30507: ---

[jira] [Commented] (FLINK-30516) Add file count and row count in snapshots table

2023-01-04 Thread Shammon (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30516?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654813#comment-17654813 ] Shammon commented on FLINK-30516: - For this issue, I think maybe we should add statistics such as total

[GitHub] [flink-ml] lindong28 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
lindong28 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062167754 ## flink-ml-benchmark/src/test/java/org/apache/flink/ml/benchmark/BenchmarkTest.java: ## @@ -54,6 +60,44 @@ public void testParseJsonFile() throws Exception {

[GitHub] [flink-ml] jiangxin369 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
jiangxin369 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062165249 ## flink-ml-benchmark/src/test/java/org/apache/flink/ml/benchmark/BenchmarkTest.java: ## @@ -54,6 +60,44 @@ public void testParseJsonFile() throws Exception {

[jira] [Closed] (FLINK-30558) The metric 'numRestarts' reported in SchedulerBase will be overridden by metric 'fullRestarts'

2023-01-04 Thread Zhu Zhu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhu Zhu closed FLINK-30558. --- Resolution: Fixed master: b50f42673fcd44bdd6de73f77d646df5c2ad1eed 1.16:

[jira] [Updated] (FLINK-30569) File Format can not change with data file exists

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30569?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30569: --- Labels: pull-request-available (was: ) > File Format can not change with data file exists

[jira] [Created] (FLINK-30569) File Format can not change with data file exists

2023-01-04 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-30569: Summary: File Format can not change with data file exists Key: FLINK-30569 URL: https://issues.apache.org/jira/browse/FLINK-30569 Project: Flink Issue Type:

[GitHub] [flink-table-store] JingsongLi opened a new pull request, #459: [FLINK-30569] File Format can not change with data file exists

2023-01-04 Thread GitBox
JingsongLi opened a new pull request, #459: URL: https://github.com/apache/flink-table-store/pull/459 1. Set file format to orc 2. Write records. 3. Set file format to parquet. 4. Write records 5. Read -> throw exception... We should support change file format. -- This

[GitHub] [flink-ml] jiangxin369 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
jiangxin369 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062159232 ## flink-ml-benchmark/src/test/java/org/apache/flink/ml/benchmark/BenchmarkTest.java: ## @@ -54,6 +60,44 @@ public void testParseJsonFile() throws Exception {

[jira] [Updated] (FLINK-30558) The metric 'numRestarts' reported in SchedulerBase will be overridden by metric 'fullRestarts'

2023-01-04 Thread Zhu Zhu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhu Zhu updated FLINK-30558: Affects Version/s: 1.16.0 > The metric 'numRestarts' reported in SchedulerBase will be overridden by >

[GitHub] [flink] yuchuanchen commented on a diff in pull request #21563: [FLINK-19889][connectors/hive/filesystem][format/parquet] Supports nested projection pushdown for filesystem connector of colum

2023-01-04 Thread GitBox
yuchuanchen commented on code in PR #21563: URL: https://github.com/apache/flink/pull/21563#discussion_r1062158485 ## flink-formats/flink-parquet/src/main/java/org/apache/flink/formats/parquet/ParquetVectorizedInputFormat.java: ## @@ -123,7 +151,7 @@ public ParquetReader

[jira] [Updated] (FLINK-30558) The metric 'numRestarts' reported in SchedulerBase will be overridden by metric 'fullRestarts'

2023-01-04 Thread Zhu Zhu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30558?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhu Zhu updated FLINK-30558: Fix Version/s: 1.16.1 > The metric 'numRestarts' reported in SchedulerBase will be overridden by >

[GitHub] [flink] zhuzhurk merged pull request #21598: [FLINK-30558][runtime] Use different metric object to avoid metric 'numRestarts' being overridden by 'fullRestarts' in SchedulerBase

2023-01-04 Thread GitBox
zhuzhurk merged PR #21598: URL: https://github.com/apache/flink/pull/21598 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [flink] xintongsong commented on a diff in pull request #21579: [FLINK-30536][runtime] Remove CountingOutput from per-record code path for most operators

2023-01-04 Thread GitBox
xintongsong commented on code in PR #21579: URL: https://github.com/apache/flink/pull/21579#discussion_r1062140088 ## flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/ChainingOutput.java: ## @@ -40,33 +38,24 @@ private static final Logger LOG =

[jira] [Commented] (FLINK-30561) ChangelogStreamHandleReaderWithCache cause FileNotFoundException

2023-01-04 Thread Hangxiang Yu (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30561?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654744#comment-17654744 ] Hangxiang Yu commented on FLINK-30561: -- It seems FLINK-30107 and FLINK-28898 are caused by this ?

[GitHub] [flink] flinkbot commented on pull request #21602: [FLINK-30567][hive] convert table name to lower case when decide whether it's overwrite

2023-01-04 Thread GitBox
flinkbot commented on PR #21602: URL: https://github.com/apache/flink/pull/21602#issuecomment-1371783090 ## CI report: * 21d92c71e94f00b97ffad0180dd88ec06d4e17f2 UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run azure`

[jira] [Updated] (FLINK-30567) Wrong insert overwrite behavior when the table contains uppercase character with Hive dialect

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30567?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30567: --- Labels: pull-request-available (was: ) > Wrong insert overwrite behavior when the table

[GitHub] [flink] luoyuxia opened a new pull request, #21602: [FLINK-30567][hive] convert table name to lower case when decide whether it's overwrite

2023-01-04 Thread GitBox
luoyuxia opened a new pull request, #21602: URL: https://github.com/apache/flink/pull/21602 ## What is the purpose of the change *(For example: This pull request makes task deployment go through the blob server, rather than through RPC. That way we avoid re-transferring them

[jira] [Closed] (FLINK-30560) Add more description of 'Overwriting a Partition' to doc 'Writing Tables'

2023-01-04 Thread Jingsong Lee (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jingsong Lee closed FLINK-30560. Fix Version/s: table-store-0.3.0 Assignee: yuzelin Resolution: Fixed master:

[GitHub] [flink-table-store] JingsongLi merged pull request #457: [FLINK-30560] Add more description of 'Overwriting a Partition' to doc 'Writing Tables'

2023-01-04 Thread GitBox
JingsongLi merged PR #457: URL: https://github.com/apache/flink-table-store/pull/457 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Updated] (FLINK-30565) Flink-parquet free for flink-table-store-format

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30565?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30565: --- Labels: pull-request-available (was: ) > Flink-parquet free for flink-table-store-format >

[GitHub] [flink-table-store] JingsongLi opened a new pull request, #458: [FLINK-30565] Flink-parquet free for flink-table-store-format

2023-01-04 Thread GitBox
JingsongLi opened a new pull request, #458: URL: https://github.com/apache/flink-table-store/pull/458 Copy codes from flink-parquet. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Closed] (FLINK-30497) Introduce TableChange to represent DROP change

2023-01-04 Thread Shengkai Fang (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30497?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Shengkai Fang closed FLINK-30497. - Resolution: Implemented Merged into master: cce059c556bc2538cba8918e281320516c4be222 >

[GitHub] [flink] fsk119 merged pull request #21592: [FLINK-30497][table-api] Introduce TableChange to represent DROP change

2023-01-04 Thread GitBox
fsk119 merged PR #21592: URL: https://github.com/apache/flink/pull/21592 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Created] (FLINK-30568) Add benchmark for PolyNomialExpansion, Normalizer, Binarizer, Interaction, MaxAbsScaler, VectorSlicer, ElementWiseProduct and Featurehasher

2023-01-04 Thread weibo zhao (Jira)
weibo zhao created FLINK-30568: -- Summary: Add benchmark for PolyNomialExpansion, Normalizer, Binarizer, Interaction, MaxAbsScaler, VectorSlicer, ElementWiseProduct and Featurehasher Key: FLINK-30568 URL:

[jira] [Commented] (FLINK-30567) Wrong insert overwrite behavior when the table contains uppercase character with Hive dialect

2023-01-04 Thread luoyuxia (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30567?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654725#comment-17654725 ] luoyuxia commented on FLINK-30567: -- Introduced by FLINK-29337, we miss to convert the table name into

[jira] [Created] (FLINK-30567) Wrong insert overwrite behavior when the table contains uppercase character with Hive dialect

2023-01-04 Thread luoyuxia (Jira)
luoyuxia created FLINK-30567: Summary: Wrong insert overwrite behavior when the table contains uppercase character with Hive dialect Key: FLINK-30567 URL: https://issues.apache.org/jira/browse/FLINK-30567

[jira] [Assigned] (FLINK-30566) Add benchmark configurations for agglomerativeclustering, hashingtf, idf, kbinsdiscretizer, linearregression, linearsvc, logisticregression, ngram, regextokenizer, tok

2023-01-04 Thread Zhipeng Zhang (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zhipeng Zhang reassigned FLINK-30566: - Assignee: Zhipeng Zhang > Add benchmark configurations for agglomerativeclustering,

[jira] [Updated] (FLINK-30566) Add benchmark configurations for agglomerativeclustering, hashingtf, idf, kbinsdiscretizer, linearregression, linearsvc, logisticregression, ngram, regextokenizer, toke

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30566?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30566: --- Labels: pull-request-available (was: ) > Add benchmark configurations for

[GitHub] [flink-ml] zhipeng93 opened a new pull request, #197: [FLINK-30566] Add benchmark for AgglomerativeClustering, HashingTF, IDF, KBinsDiscretizer, LinearRegression, LinearSVC, LogisticRegre

2023-01-04 Thread GitBox
zhipeng93 opened a new pull request, #197: URL: https://github.com/apache/flink-ml/pull/197 ## What is the purpose of the change - Add benchmark configurations for some of the existing algorithms. ## Brief change log - Add benchmark configuration files for

[jira] [Created] (FLINK-30566) Add benchmark configurations for agglomerativeclustering, hashingtf, idf, kbinsdiscretizer, linearregression, linearsvc, logisticregression, ngram, regextokenizer, toke

2023-01-04 Thread Zhipeng Zhang (Jira)
Zhipeng Zhang created FLINK-30566: - Summary: Add benchmark configurations for agglomerativeclustering, hashingtf, idf, kbinsdiscretizer, linearregression, linearsvc, logisticregression, ngram, regextokenizer, tokenizer and vectorindexer

[jira] [Updated] (FLINK-30563) Update training exercises to use Flink 1.16

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30563?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30563: --- Labels: pull-request-available (was: ) > Update training exercises to use Flink 1.16 >

[GitHub] [flink-training] TanYuxin-tyx opened a new pull request, #50: [FLINK-30563] Update to Flink 1.16

2023-01-04 Thread GitBox
TanYuxin-tyx opened a new pull request, #50: URL: https://github.com/apache/flink-training/pull/50 Update the Flink version to Flink 1.16. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the

[jira] [Commented] (FLINK-30563) Update training exercises to use Flink 1.16

2023-01-04 Thread Yuxin Tan (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30563?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654714#comment-17654714 ] Yuxin Tan commented on FLINK-30563: --- Thanks [~danderson] for reporting this. I want to take a look at

[jira] [Created] (FLINK-30565) Flink-parquet free for flink-table-store-format

2023-01-04 Thread Jingsong Lee (Jira)
Jingsong Lee created FLINK-30565: Summary: Flink-parquet free for flink-table-store-format Key: FLINK-30565 URL: https://issues.apache.org/jira/browse/FLINK-30565 Project: Flink Issue Type:

[jira] [Closed] (FLINK-30552) Pulsar connector shouldn't assert the BatchMessageId size.

2023-01-04 Thread Zili Chen (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30552?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Zili Chen closed FLINK-30552. - Resolution: Fixed master via https://github.com/apache/flink-connector-pulsar/pull/11 > Pulsar

[GitHub] [flink-connector-pulsar] tisonkun merged pull request #11: [FLINK-30552][Connector/Pulsar] drop batch message size assertion, better set the cursor position.

2023-01-04 Thread GitBox
tisonkun merged PR #11: URL: https://github.com/apache/flink-connector-pulsar/pull/11 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[GitHub] [flink] xintongsong commented on a diff in pull request #21565: [FLINK-18229][ResourceManager] Support cancel pending workers if no longer needed.

2023-01-04 Thread GitBox
xintongsong commented on code in PR #21565: URL: https://github.com/apache/flink/pull/21565#discussion_r1062064266 ## flink-yarn/src/main/java/org/apache/flink/yarn/YarnResourceManagerDriver.java: ## @@ -435,9 +487,19 @@ private void startTaskExecutorInContainerAsync(

[GitHub] [flink] flinkbot commented on pull request #21601: [FLINK-27920][hive] Supports native avg function for hive dialect

2023-01-04 Thread GitBox
flinkbot commented on PR #21601: URL: https://github.com/apache/flink/pull/21601#issuecomment-1371700252 ## CI report: * 2b2b2edad3807c999105bb9163bf21da8bffe6c6 UNKNOWN Bot commands The @flinkbot bot supports the following commands: - `@flinkbot run azure`

[GitHub] [flink] lsyldliu opened a new pull request, #21601: [FLINK-27920][hive] Supports native avg function for hive dialect

2023-01-04 Thread GitBox
lsyldliu opened a new pull request, #21601: URL: https://github.com/apache/flink/pull/21601 ## What is the purpose of the change *Implementing a native hive avg function based on the expression aggregate function, can improve the agg performance by using hash-agg strategy.*

[GitHub] [flink-table-store] yuzelin commented on a diff in pull request #457: [FLINK-30560] Add more description of 'Overwriting a Partition' to doc 'Writing Tables'

2023-01-04 Thread GitBox
yuzelin commented on code in PR #457: URL: https://github.com/apache/flink-table-store/pull/457#discussion_r1062053382 ## docs/content/docs/how-to/writing-tables.md: ## @@ -87,3 +87,66 @@ INSERT OVERWRITE MyTable PARTITION (key1 = value1, key2 = value2, ...) SELECT .. {{<

[jira] [Closed] (FLINK-30547) Flink-table-runtime free for flink-table-store-common

2023-01-04 Thread Jingsong Lee (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30547?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Jingsong Lee closed FLINK-30547. Resolution: Fixed master: f2314e46617d4d0dcbdc5f65a2699209072a08e3 > Flink-table-runtime free

[jira] [Comment Edited] (FLINK-30556) Improve the logic for enumerating splits for Hive source to avoid potential OOM

2023-01-04 Thread luoyuxia (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654702#comment-17654702 ] luoyuxia edited comment on FLINK-30556 at 1/5/23 2:15 AM: -- [~Wencong Liu]

[GitHub] [flink-table-store] JingsongLi merged pull request #452: [FLINK-30547] Flink-table-runtime free for flink-table-store-common

2023-01-04 Thread GitBox
JingsongLi merged PR #452: URL: https://github.com/apache/flink-table-store/pull/452 -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail:

[jira] [Closed] (FLINK-29796) pyflink protobuf requirement out of date

2023-01-04 Thread Huang Xingbo (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29796?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Huang Xingbo closed FLINK-29796. Resolution: Fixed > pyflink protobuf requirement out of date >

[jira] [Commented] (FLINK-30556) Improve the logic for enumerating splits for Hive source to avoid potential OOM

2023-01-04 Thread luoyuxia (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654702#comment-17654702 ] luoyuxia commented on FLINK-30556: -- [~Wencong Liu] Thanks for invesgation. TBH, that's what my thought.

[jira] [Updated] (FLINK-30560) Add more description of 'Overwriting a Partition' to doc 'Writing Tables'

2023-01-04 Thread ASF GitHub Bot (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30560?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] ASF GitHub Bot updated FLINK-30560: --- Labels: pull-request-available (was: ) > Add more description of 'Overwriting a Partition'

[GitHub] [flink-table-store] JingsongLi commented on a diff in pull request #457: [FLINK-30560] Add more description of 'Overwriting a Partition' to doc 'Writing Tables'

2023-01-04 Thread GitBox
JingsongLi commented on code in PR #457: URL: https://github.com/apache/flink-table-store/pull/457#discussion_r1062035797 ## docs/content/docs/how-to/writing-tables.md: ## @@ -87,3 +87,66 @@ INSERT OVERWRITE MyTable PARTITION (key1 = value1, key2 = value2, ...) SELECT .. {{<

[jira] [Assigned] (FLINK-30084) Remove unused ResourceActions.notifyAllocationFailure

2023-01-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30084?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xintong Song reassigned FLINK-30084: Assignee: Weihua Hu > Remove unused ResourceActions.notifyAllocationFailure >

[jira] [Assigned] (FLINK-29869) make ResourceActions declarative

2023-01-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29869?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xintong Song reassigned FLINK-29869: Assignee: Weihua Hu > make ResourceActions declarative >

[jira] [Assigned] (FLINK-29870) split ResourceNotEnoughNotifier from ResourceActions

2023-01-04 Thread Xintong Song (Jira)
[ https://issues.apache.org/jira/browse/FLINK-29870?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Xintong Song reassigned FLINK-29870: Assignee: Weihua Hu > split ResourceNotEnoughNotifier from ResourceActions >

[GitHub] [flink-ml] weibozhao commented on pull request #189: [FLINK-30348] Refine Transformer for RandomSplitter

2023-01-04 Thread GitBox
weibozhao commented on PR #189: URL: https://github.com/apache/flink-ml/pull/189#issuecomment-1371647105 > > Hi @weibozhao, thanks for the update. Could you please modify the description section of this PR and its corresponding Jira ticket, explaining the changes made in

[jira] [Commented] (FLINK-30030) Unexpected behavior for overwrite in Hive dialect

2023-01-04 Thread luoyuxia (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30030?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17654699#comment-17654699 ] luoyuxia commented on FLINK-30030: -- After some investigation, the expected behavior should consider

[jira] [Closed] (FLINK-30030) Unexpected behavior for overwrite in Hive dialect

2023-01-04 Thread luoyuxia (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30030?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] luoyuxia closed FLINK-30030. Fix Version/s: 1.16.1 1.16.0 Resolution: Invalid > Unexpected behavior for

[jira] [Updated] (FLINK-30561) ChangelogStreamHandleReaderWithCache cause FileNotFoundException

2023-01-04 Thread Feifan Wang (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feifan Wang updated FLINK-30561: Priority: Critical (was: Major) > ChangelogStreamHandleReaderWithCache cause

[jira] [Updated] (FLINK-30561) ChangelogStreamHandleReaderWithCache cause FileNotFoundException

2023-01-04 Thread Feifan Wang (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30561?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Feifan Wang updated FLINK-30561: Priority: Major (was: Critical) > ChangelogStreamHandleReaderWithCache cause

[GitHub] [flink-ml] lindong28 commented on a diff in pull request #195: [FLINK-30532] Add benchmark for DCT, SQLTransformer and StopWordsRemover

2023-01-04 Thread GitBox
lindong28 commented on code in PR #195: URL: https://github.com/apache/flink-ml/pull/195#discussion_r1062018596 ## flink-ml-benchmark/src/main/resources/dct-benchmark.json: ## @@ -0,0 +1,40 @@ +// Licensed to the Apache Software Foundation (ASF) under one or more +//

[GitHub] [flink-ml] lindong28 commented on a diff in pull request #193: [FLINK-30515] Add benchmark for CountVectorizer, Imputer, RobustScale…

2023-01-04 Thread GitBox
lindong28 commented on code in PR #193: URL: https://github.com/apache/flink-ml/pull/193#discussion_r1062011981 ## flink-ml-benchmark/src/test/java/org/apache/flink/ml/benchmark/BenchmarkTest.java: ## @@ -54,6 +60,44 @@ public void testParseJsonFile() throws Exception {

[jira] [Assigned] (FLINK-25509) FLIP-208: Add RecordEvaluator to dynamically stop source based on de-serialized records

2023-01-04 Thread Dong Lin (Jira)
[ https://issues.apache.org/jira/browse/FLINK-25509?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Dong Lin reassigned FLINK-25509: Assignee: Hang Ruan (was: Dong Lin) > FLIP-208: Add RecordEvaluator to dynamically stop source

[jira] [Created] (FLINK-30564) Select from a new table with Kafka LogStore crashes with UnknownTopicOrPartitionException

2023-01-04 Thread Alex Sorokoumov (Jira)
Alex Sorokoumov created FLINK-30564: --- Summary: Select from a new table with Kafka LogStore crashes with UnknownTopicOrPartitionException Key: FLINK-30564 URL: https://issues.apache.org/jira/browse/FLINK-30564

[GitHub] [flink] kristoffSC commented on pull request #21393: [Draft][FLINK-27246_master] - Split generated java methods - Work in progress

2023-01-04 Thread GitBox
kristoffSC commented on PR #21393: URL: https://github.com/apache/flink/pull/21393#issuecomment-1371489783 @flinkbot run azure -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific

[jira] [Assigned] (FLINK-30483) Make Avro format support for TIMESTAMP_LTZ

2023-01-04 Thread Martijn Visser (Jira)
[ https://issues.apache.org/jira/browse/FLINK-30483?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Martijn Visser reassigned FLINK-30483: -- Assignee: Jagadesh Adireddi > Make Avro format support for TIMESTAMP_LTZ >

  1   2   3   >