[jira] [Commented] (FLINK-21005) Introduce new provider for unified Sink API and implement in planner

2021-01-18 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-21005?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17267062#comment-17267062
 ] 

huajiewang commented on FLINK-21005:


please assign to me

> Introduce new provider for unified Sink API and implement in planner
> 
>
> Key: FLINK-21005
> URL: https://issues.apache.org/jira/browse/FLINK-21005
> Project: Flink
>  Issue Type: Sub-task
>  Components: Table SQL / API, Table SQL / Planner
>Reporter: Jark Wu
>Priority: Major
> Fix For: 1.13.0
>
>
> FLIP-143 [1] introduced the unified sink API, we should add a 
> {{SinkRuntimeProvider}} for it and support it in planner. So that Table SQL 
> users can also use the unified sink APIs. 
> [1]: 
> https://cwiki.apache.org/confluence/display/FLINK/FLIP-143%3A+Unified+Sink+API



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-15 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265817#comment-17265817
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 8:39 AM:
--

[~gaoyunhaii]
My code is just an example,What you said makes sense, but in my opinion, this 
is a notification message output by the flink framework to tell the user which 
batch of checkpoint completed, but currently there are no requirements and 
restrictions for the Transaction type, so that users can Free definition, a 
little carelessness will cause this problem, unless the user is very familiar 
with the processing logic of this code, in order to effectively avoid this 
problem, about this Transaction class, if the flink output information when the 
checkpoint is completed requires user participation, then Flink can completely 
define an interface type (Transaction), allowing users to implement this 
interface. So I think this is the issue of Flink


was (Author: benjobs):
[~gaoyunhaii] What you said is reasonable, but in my opinion, this is a 
notification message output from the internal Flink framework, which is used to 
tell the user which batch of checkpoint completed. However, at present, there 
are no requirements and restrictions for the transaction type, so that the user 
can freely define it. A little carelessness will cause this problem, unless the 
user is very familiar with the processing logic of this code, In order to 
effectively avoid this problem, regarding the transaction class, if the user is 
required to participate in the output information when the checkpoint is 
completed, the Flink can completely define an interface type (Transaction) for 
the user to implement the interface. Therefore, I think this is the issue of 
Flink

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-15 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265817#comment-17265817
 ] 

huajiewang commented on FLINK-20972:


[~gaoyunhaii] What you said is reasonable, but in my opinion, this is a 
notification message output from the internal Flink framework, which is used to 
tell the user which batch of checkpoint completed. However, at present, there 
are no requirements and restrictions for the transaction type, so that the user 
can freely define it. A little carelessness will cause this problem, unless the 
user is very familiar with the processing logic of this code, In order to 
effectively avoid this problem, regarding the transaction class, if the user is 
required to participate in the output information when the checkpoint is 
completed, the Flink can completely define an interface type (Transaction) for 
the user to implement the interface. Therefore, I think this is the issue of 
Flink

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:29 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, 
sqlList: ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sqlList += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 log.info will be invoke Transaction'toString, all data of sqlList will be 
output.
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 log.info will be invoke Transaction'toString, all data of sqlList will be 
output.
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265737#comment-17265737
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:27 AM:
--

Hi [~gaoyunhaii] i can overwrite toString method in my Transaction, If don't 
fix the TwoPhaseCommitSinkFunctionOutput, it means that all transaction class 
may have such a problem. To solve this problem, you need to overwrite the 
toString method in your Transaction class


was (Author: benjobs):
Hi [~gaoyunhaii] i can overwrite toString method in my Transaction, If don't 
fix the TwoPhaseCommitSinkFunctionOutput, it means that all transaction class 
may have such a problem. To solve this problem, you need to reset the toString 
method.

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265737#comment-17265737
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:25 AM:
--

Hi [~gaoyunhaii] i can overwrite toString method in my Transaction, If don't 
fix the TwoPhaseCommitSinkFunctionOutput, it means that all transaction class 
may have such a problem. To solve this problem, you need to reset the toString 
method.


was (Author: benjobs):
Hi [~gaoyunhaii] i can overwrite toString method for my Transaction, If don't 
fix the TwoPhaseCommitSinkFunctionOutput, it means that all transaction class 
may have such a problem. To solve this problem, you need to reset the toString 
method.

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265737#comment-17265737
 ] 

huajiewang commented on FLINK-20972:


Hi [~gaoyunhaii] i can overwrite toString method for my Transaction, If don't 
fix the TwoPhaseCommitSinkFunctionOutput, it means that all transaction class 
may have such a problem. To solve this problem, you need to reset the toString 
method.

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:20 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 log.info will be invoke Transaction'toString, all data of sqlList will be 
output.
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:19 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:18 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 

 

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:18 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:18 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 log.info will be invoke Transaction'toString
{code:java}
LOG.info( "{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId); {code}

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:17 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 

 


was (Author: benjobs):
Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 

 

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang edited comment on FLINK-20972 at 1/15/21, 6:16 AM:
--

Hi [~gaoyunhaii] u are right, my Transaction class like this:

scala: 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 
{code:java}
class Transaction {
  private String transactionId;
  private List sqlList;

  public Transaction() {
this.transactionId = UUID.randomUUID().toString;
this.sqlList = new ArrayList<>;
  }

  public void add(String sql) {
 this.sqlList.add(sql)
  }
}{code}
 

 

 


was (Author: benjobs):
u are right, my Transaction class is here:

scala:

 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 

 

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17265732#comment-17265732
 ] 

huajiewang commented on FLINK-20972:


u are right, my Transaction class is here:

scala:

 
{code:java}
case class Transaction(transactionId: String = UUID.randomUUID().toString, sql: 
ListBuffer[String] = ListBuffer.empty) extends Serializable {
  def +(text: String): Unit = sql += text
}{code}
java: 

 

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
output(log.info),which will cause IO bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:

!1610682498960.jpg|width=838,height=630!
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance) 

TransactionHolder'toString method code is:

!1610682603148.jpg|width=859,height=327!
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
output(log.info),which will cause IO bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
!1610682498960.jpg|width=838,height=630!

will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

!1610682603148.jpg|width=859,height=327!
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> !1610682498960.jpg|width=838,height=630!
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance) 
> TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
output(log.info),which will cause IO bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
!1610682498960.jpg|width=838,height=630!

will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

!1610682603148.jpg|width=859,height=327!
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
output(log.info),which will cause IO bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> !1610682498960.jpg|width=838,height=630!
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
> !1610682603148.jpg|width=859,height=327!
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Attachment: 1610682603148.jpg

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, 1610682603148.jpg, 
> Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Attachment: 1610682498960.jpg

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: 1610682498960.jpg, Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
output(log.info),which will cause IO bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> in TwoPhaseCommitSinkFunctionOutput Maybe A large number of EventData will be 
> output(log.info),which will cause IO bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>  handle is the concrete realization of my Transaction! 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 

in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
 

will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe 
> A large number of EventData will be output (log.info),which will cause IO 
> bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 

in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
 

will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe 
> A large number of EventData will be output (log.info),which will cause IO 
> bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
>  
> in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
>  
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , output e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe 
> A large number of EventData will be output (log.info),which will cause IO 
> bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
>  in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
>  
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe A 
large number of EventData will be output (log.info),which will cause IO 
bottleneck and disk waste.

 
 my code in the attachment, A large number event data output in the log output 
by flink , output e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete, Maybe 
> A large number of EventData will be output (log.info),which will cause IO 
> bottleneck and disk waste.
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink , output e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
>  in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
>  
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be invoke pendingTransaction'toString method (pendingTransaction is 
TransactionHolder'instance), TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}
 
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}

  
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink, 
>   
>  e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
>  in TwoPhaseCommitSinkFunction about LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be invoke pendingTransaction'toString method (pendingTransaction is 
> TransactionHolder'instance), TransactionHolder'toString method code is:
>  
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
>

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 e.g: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:
{code:java}
LOG.info(
"{} - checkpoint {} complete, committing transaction {} from checkpoint 
{}",
name(),
checkpointId,
pendingTransaction,
pendingTransactionCheckpointId); {code}
will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:

 
{code:java}
@Override
public String toString() {
return "TransactionHolder{"
+ "handle="
+  handle
+ ", transactionStartTime="
+ transactionStartTime
+ '}';
}{code}

  
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 like: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:
{code:java}
  LOG.info("{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId);  
 {code}
will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:

 


 @Override
 public String toString() {
 return "TransactionHolder

{" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + '}

';
 }
  
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink, 
>   
>  e.g: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
>  method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about 
> LOG.info code is as follows:
> {code:java}
> LOG.info(
> "{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
> name(),
> checkpointId,
> pendingTransaction,
> pendingTransactionCheckpointId); {code}
> will be call the toString method of pendingTransaction (TransactionHolder), 
> TransactionHolder'toString method code is:
>  
> {code:java}
> @Override
> public String toString() {
> return "TransactionHolder{"
> + "handle="
> +  handle
> + ", transactionStartTime="
> + transactionStartTime
> + '}';
> }{code}
>   
>  handle is the 

[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 like: 
{code:java}
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4{code}
 method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:
{code:java}
  LOG.info("{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}", name(), checkpointId, pendingTransaction, 
pendingTransactionCheckpointId);  
 {code}
will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:

 


 @Override
 public String toString() {
 return "TransactionHolder

{" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + '}

';
 }
  
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 like: 


 Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde \{handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4

 

method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:

LOG.info("{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}",
 name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);

 
 will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:
 @Override
 public String toString() {
 return "TransactionHolder

{" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + '}

';
 }
  
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink, 
>   
>  like: 
> {code:java}
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde {handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4{code}
>  method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about 
> LOG.info code is as follows:
> {code:java}
>   LOG.info("{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}", name(), checkpointId, pendingTransaction, 
> pendingTransactionCheckpointId);  
>  {code}
> will be call the toString method of pendingTransaction (TransactionHolder), 
> TransactionHolder'toString method code is:
>  
>  @Override
>  public String toString() {
>  return "TransactionHolder
> {" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + 
> '}
> ';
>  }
>   
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
 my code in the attachment, A large number event data output in the log output 
by flink, 
  
 like: 


 Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde \{handle=Transaction(b420c880a951403984f231dd7e33597b, 
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4

 

method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:

LOG.info("{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}",
 name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);

 
 will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:
 @Override
 public String toString() {
 return "TransactionHolder

{" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + '}

';
 }
  
 handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
  
  

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
my code in the attachment, A large number event data output in the log output 
by flink, 
 
like: 
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde

{handle=Transaction(b420c880a951403984f231dd7e33597b, ListBuffer(insert into 
table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532}

from checkpoint 4

 

 

method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:

LOG.info("{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}",
 name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);

 
will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:
@Override
public String toString() {
 return "TransactionHolder{"
 + "handle="
 + handle
 + ", transactionStartTime="
 + transactionStartTime
 + '}';
}
 
handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
 
 

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink, 
>   
>  like: 
>  Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde \{handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4
>  
> method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about 
> LOG.info code is as follows:
> LOG.info("{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
>  name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);
>  
>  will be call the toString method of pendingTransaction (TransactionHolder), 
> TransactionHolder'toString method code is:
>  @Override
>  public String toString() {
>  return "TransactionHolder
> {" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + 
> '}
> ';
>  }
>   
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


[ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17264793#comment-17264793
 ] 

huajiewang commented on FLINK-20972:


yes!

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix, pull-request-available
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
>  my code in the attachment, A large number event data output in the log 
> output by flink, 
>   
>  like: 
>  Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde \{handle=Transaction(b420c880a951403984f231dd7e33597b, 
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4
>  
> method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about 
> LOG.info code is as follows:
> LOG.info("{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
>  name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);
>  
>  will be call the toString method of pendingTransaction (TransactionHolder), 
> TransactionHolder'toString method code is:
>  @Override
>  public String toString() {
>  return "TransactionHolder
> {" + "handle=" + handle + ", transactionStartTime=" + transactionStartTime + 
> '}
> ';
>  }
>   
>  handle is the concrete realization of my Transaction! There is a parameter 
> of List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>   
>   
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 
my code in the attachment, A large number event data output in the log output 
by flink, 
 
like: 
Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolde

{handle=Transaction(b420c880a951403984f231dd7e33597b, ListBuffer(insert into 
table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532}

from checkpoint 4

 

 

method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about LOG.info 
code is as follows:

LOG.info("{} - checkpoint {} complete, committing transaction {} from 
checkpoint {}",
 name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);

 
will be call the toString method of pendingTransaction (TransactionHolder), 
TransactionHolder'toString method code is:
@Override
public String toString() {
 return "TransactionHolder{"
 + "handle="
 + handle
 + ", transactionStartTime="
 + transactionStartTime
 + '}';
}
 
handle is the concrete realization of my Transaction! There is a parameter of 
List type in my Transaction, which is used to receive data. as a result, these 
data are printed out(log.info)
 
 

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 

 

Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolder{handle=Transaction(b420c880a951403984f231dd7e33597b,
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
> my code in the attachment, A large number event data output in the log output 
> by flink, 
>  
> like: 
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolde
> {handle=Transaction(b420c880a951403984f231dd7e33597b, ListBuffer(insert into 
> table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532}
> from checkpoint 4
>  
>  
> method notifyCheckpointComplete of TwoPhaseCommitSinkFunction,  about 
> LOG.info code is as follows:
> LOG.info("{} - checkpoint {} complete, committing transaction {} from 
> checkpoint {}",
>  name(), checkpointId, pendingTransaction, pendingTransactionCheckpointId);
>  
> will be call the toString method of pendingTransaction (TransactionHolder), 
> TransactionHolder'toString method code is:
> @Override
> public String toString() {
>  return "TransactionHolder{"
>  + "handle="
>  + handle
>  + ", transactionStartTime="
>  + transactionStartTime
>  + '}';
> }
>  
> handle is the concrete realization of my Transaction! There is a parameter of 
> List type in my Transaction, which is used to receive data. as a result, 
> these data are printed out(log.info)
>  
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Description: 
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 

 

Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
TransactionHolder{handle=Transaction(b420c880a951403984f231dd7e33597b,
ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
transactionStartTime=1610426158532} from checkpoint 4

 

  was:
when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 


> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  
>  
> Jdbc2PCSinkFunction 1/1 - checkpoint 4 complete, committing transaction 
> TransactionHolder{handle=Transaction(b420c880a951403984f231dd7e33597b,
> ListBuffer(insert into table(field1,field2) value ('11','22') ... ... ), 
> transactionStartTime=1610426158532} from checkpoint 4
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)


 [ 
https://issues.apache.org/jira/browse/FLINK-20972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

huajiewang updated FLINK-20972:
---
Attachment: Jdbc2PCSinkFunction.scala

> TwoPhaseCommitSinkFunction Output a large amount of EventData
> -
>
> Key: FLINK-20972
> URL: https://issues.apache.org/jira/browse/FLINK-20972
> Project: Flink
>  Issue Type: Improvement
>  Components: API / DataStream
>Affects Versions: 1.12.0
> Environment: flink 1.4.0 +
>Reporter: huajiewang
>Priority: Minor
>  Labels: easyfix
> Attachments: Jdbc2PCSinkFunction.scala
>
>   Original Estimate: 1h
>  Remaining Estimate: 1h
>
> when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,
> Maybe A large number of EventData will be output (log.info)
> ,which will cause IO bottleneck and disk waste
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (FLINK-20972) TwoPhaseCommitSinkFunction Output a large amount of EventData

2021-01-14 Thread huajiewang (Jira)
huajiewang created FLINK-20972:
--

 Summary: TwoPhaseCommitSinkFunction Output a large amount of 
EventData
 Key: FLINK-20972
 URL: https://issues.apache.org/jira/browse/FLINK-20972
 Project: Flink
  Issue Type: Improvement
  Components: API / DataStream
Affects Versions: 1.12.0
 Environment: flink 1.4.0 +
Reporter: huajiewang


when TwoPhaseCommitSinkFunctionOutput tigger notifyCheckpointComplete,

Maybe A large number of EventData will be output (log.info)

,which will cause IO bottleneck and disk waste

 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)