[jira] [Updated] (BEAM-10224) Test using a DATE field in an aggregation

2020-06-10 Thread Robin Qiu (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10224?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robin Qiu updated BEAM-10224:
-
Labels: beam-fixit  (was: )

> Test using a DATE field in an aggregation
> -
>
> Key: BEAM-10224
> URL: https://issues.apache.org/jira/browse/BEAM-10224
> Project: Beam
>  Issue Type: Task
>  Components: dsl-sql
>Reporter: Brian Hulette
>Assignee: Robin Qiu
>Priority: P2
>  Labels: beam-fixit
>
> Since logical types are aggregated with their representation type this may be 
> an issue. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9179) Refactor Beam ZetaSQL type translation code

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9179?focusedWorklogId=444136&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444136
 ]

ASF GitHub Bot logged work on BEAM-9179:


Author: ASF GitHub Bot
Created on: 11/Jun/20 06:46
Start Date: 11/Jun/20 06:46
Worklog Time Spent: 10m 
  Work Description: robinyqiu opened a new pull request #11988:
URL: https://github.com/apache/beam/pull/11988


   Simple code cleanup (adding tests, removing dead code, etc.). Does not 
change functionality.
   
   r: @apilloud 
   
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/badge/icon)](https://builds.ap

[jira] [Commented] (BEAM-9178) Support ZetaSQL TIMESTAMP functions in BeamSQL

2020-06-10 Thread Robin Qiu (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9178?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132956#comment-17132956
 ] 

Robin Qiu commented on BEAM-9178:
-

The majority of TIMESTAMP functions are already supported. The remaining ones 
requires support of other data types (e.g. TIME, DATETIME). After those types 
are implemented we can complete this issue.

> Support ZetaSQL TIMESTAMP functions in BeamSQL
> --
>
> Key: BEAM-9178
> URL: https://issues.apache.org/jira/browse/BEAM-9178
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql-zetasql
>Reporter: Robin Qiu
>Priority: P2
>  Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> Support *all* TIMESTAMP functions defined in ZetaSQL (BigQuery Standard SQL). 
> See the full list of functions below:
> [https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9178) Support ZetaSQL TIMESTAMP functions in BeamSQL

2020-06-10 Thread Robin Qiu (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9178?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robin Qiu reassigned BEAM-9178:
---

Assignee: Zijie Song

> Support ZetaSQL TIMESTAMP functions in BeamSQL
> --
>
> Key: BEAM-9178
> URL: https://issues.apache.org/jira/browse/BEAM-9178
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql-zetasql
>Reporter: Robin Qiu
>Assignee: Zijie Song
>Priority: P2
>  Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> Support *all* TIMESTAMP functions defined in ZetaSQL (BigQuery Standard SQL). 
> See the full list of functions below:
> [https://cloud.google.com/bigquery/docs/reference/standard-sql/timestamp_functions]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10240) Support ZetaSQL DATETIME functions in BeamSQL

2020-06-10 Thread Robin Qiu (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10240?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robin Qiu updated BEAM-10240:
-
Status: Open  (was: Triage Needed)

> Support ZetaSQL DATETIME functions in BeamSQL
> -
>
> Key: BEAM-10240
> URL: https://issues.apache.org/jira/browse/BEAM-10240
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql-zetasql
>Reporter: Robin Qiu
>Assignee: Zijie Song
>Priority: P2
>
> Support *all* DATETIME functions defined in ZetaSQL (BigQuery Standard SQL). 
> See the list of functions below:
> [https://github.com/google/zetasql/blob/master/docs/datetime_functions.md|https://github.com/google/zetasql/blob/master/docs/time_functions.md]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10239) Support ZetaSQL NUMERIC type in BeamSQL

2020-06-10 Thread Robin Qiu (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10239?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robin Qiu updated BEAM-10239:
-
Status: Open  (was: Triage Needed)

> Support ZetaSQL NUMERIC type in BeamSQL
> ---
>
> Key: BEAM-10239
> URL: https://issues.apache.org/jira/browse/BEAM-10239
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql-zetasql
>Reporter: Robin Qiu
>Assignee: Zijie Song
>Priority: P2
>
> Support NUMERIC type defined in ZetaSQL (BigQuery Standard SQL). See the list 
> of functions below:
> [https://github.com/google/zetasql/blob/master/docs/data-types.md#numeric-type]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10240) Support ZetaSQL DATETIME functions in BeamSQL

2020-06-10 Thread Robin Qiu (Jira)
Robin Qiu created BEAM-10240:


 Summary: Support ZetaSQL DATETIME functions in BeamSQL
 Key: BEAM-10240
 URL: https://issues.apache.org/jira/browse/BEAM-10240
 Project: Beam
  Issue Type: New Feature
  Components: dsl-sql-zetasql
Reporter: Robin Qiu
Assignee: Zijie Song


Support *all* DATETIME functions defined in ZetaSQL (BigQuery Standard SQL). 
See the list of functions below:

[https://github.com/google/zetasql/blob/master/docs/datetime_functions.md|https://github.com/google/zetasql/blob/master/docs/time_functions.md]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10239) Support ZetaSQL NUMERIC type in BeamSQL

2020-06-10 Thread Robin Qiu (Jira)
Robin Qiu created BEAM-10239:


 Summary: Support ZetaSQL NUMERIC type in BeamSQL
 Key: BEAM-10239
 URL: https://issues.apache.org/jira/browse/BEAM-10239
 Project: Beam
  Issue Type: New Feature
  Components: dsl-sql-zetasql
Reporter: Robin Qiu
Assignee: Zijie Song


Support NUMERIC type defined in ZetaSQL (BigQuery Standard SQL). See the list 
of functions below:

[https://github.com/google/zetasql/blob/master/docs/data-types.md#numeric-type]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10238) Improve Beam ZetaSQL type system

2020-06-10 Thread Robin Qiu (Jira)
Robin Qiu created BEAM-10238:


 Summary: Improve Beam ZetaSQL type system
 Key: BEAM-10238
 URL: https://issues.apache.org/jira/browse/BEAM-10238
 Project: Beam
  Issue Type: New Feature
  Components: dsl-sql-zetasql
Reporter: Robin Qiu
Assignee: Robin Qiu


This is an umbrella issue for several improvements we plan to make to the Beam 
ZetaSQL type system.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10238) Improve Beam ZetaSQL type system

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10238?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-10238:
---
Status: Open  (was: Triage Needed)

> Improve Beam ZetaSQL type system
> 
>
> Key: BEAM-10238
> URL: https://issues.apache.org/jira/browse/BEAM-10238
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql-zetasql
>Reporter: Robin Qiu
>Assignee: Robin Qiu
>Priority: P2
>
> This is an umbrella issue for several improvements we plan to make to the 
> Beam ZetaSQL type system.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-6764) OutOfMemory Exception in Dataflow Spanner Write Mutations

2020-06-10 Thread Mai Nakagawa (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132934#comment-17132934
 ] 

Mai Nakagawa edited comment on BEAM-6764 at 6/11/20, 5:56 AM:
--

[~jleoirab] - Are you using [Streaming Engine|#dataflow-streaming-engine]? We 
faced the same situation while using it. We could work this around by not using 
Streaming Engine.


was (Author: nakagawa@gmail.com):
[~jleoirab] - Are you using [Streaming Engine|#dataflow-streaming-engine]]? We 
faced the same situation while using it. We could work this around by not using 
Streaming Engine.

> OutOfMemory Exception in Dataflow Spanner Write Mutations
> -
>
> Key: BEAM-6764
> URL: https://issues.apache.org/jira/browse/BEAM-6764
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-gcp
>Affects Versions: 2.9.0, 2.10.0, 2.11.0
>Reporter: Joshua
>Priority: P2
>  Labels: stale-P2
>
> Since I upgraded my apache beam sdk version to >= 2.9.0, I have been noticing 
> OOM exceptions while using the dataflow runner to write mutations to spanner. 
> I have been using n1-standard-4 since version 2.9.0. On that version, it 
> works. But on higher versions, I get the exception.
>  
> The stackdriver logs is provided below
> {code:java}
> java.lang.OutOfMemoryError: Java heap space
> java.util.ArrayList.(ArrayList.java:152)
> org.apache.beam.sdk.io.gcp.spanner.SpannerIO$GatherBundleAndSortFn.initSorter(SpannerIO.java:1056)
> org.apache.beam.sdk.io.gcp.spanner.SpannerIO$GatherBundleAndSortFn.startBundle(SpannerIO.java:1049)
> {code}
> I have a very basic PTransform for writing to Spanner:
> {code:java}
> SpannerIO.write().withInstanceId(options.getSpannerInstanceId()) 
> .withDatabaseId(options.getSpannerDatabaseId()) 
> .withProjectId(options.getProject()) 
> .withFailureMode(SpannerIO.FailureMode.REPORT_FAILURES);
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-6764) OutOfMemory Exception in Dataflow Spanner Write Mutations

2020-06-10 Thread Mai Nakagawa (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132934#comment-17132934
 ] 

Mai Nakagawa edited comment on BEAM-6764 at 6/11/20, 5:56 AM:
--

[~jleoirab] - Are you using [Streaming Engine|#dataflow-streaming-engine]]? We 
faced the same situation while using it. We could work this around by not using 
Streaming Engine.


was (Author: nakagawa@gmail.com):
[~jleoirab] - Are you using [Streaming 
Engine|[https://cloud.google.com/dataflow/docs/guides/deploying-a-pipeline#dataflow-streaming-engine]|https://cloud.google.com/dataflow/docs/guides/deploying-a-pipeline#dataflow-streaming-engine]?]?
 We faced the same situation while using it. We could work this around by not 
using Streaming Engine.

> OutOfMemory Exception in Dataflow Spanner Write Mutations
> -
>
> Key: BEAM-6764
> URL: https://issues.apache.org/jira/browse/BEAM-6764
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-gcp
>Affects Versions: 2.9.0, 2.10.0, 2.11.0
>Reporter: Joshua
>Priority: P2
>  Labels: stale-P2
>
> Since I upgraded my apache beam sdk version to >= 2.9.0, I have been noticing 
> OOM exceptions while using the dataflow runner to write mutations to spanner. 
> I have been using n1-standard-4 since version 2.9.0. On that version, it 
> works. But on higher versions, I get the exception.
>  
> The stackdriver logs is provided below
> {code:java}
> java.lang.OutOfMemoryError: Java heap space
> java.util.ArrayList.(ArrayList.java:152)
> org.apache.beam.sdk.io.gcp.spanner.SpannerIO$GatherBundleAndSortFn.initSorter(SpannerIO.java:1056)
> org.apache.beam.sdk.io.gcp.spanner.SpannerIO$GatherBundleAndSortFn.startBundle(SpannerIO.java:1049)
> {code}
> I have a very basic PTransform for writing to Spanner:
> {code:java}
> SpannerIO.write().withInstanceId(options.getSpannerInstanceId()) 
> .withDatabaseId(options.getSpannerDatabaseId()) 
> .withProjectId(options.getProject()) 
> .withFailureMode(SpannerIO.FailureMode.REPORT_FAILURES);
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-6764) OutOfMemory Exception in Dataflow Spanner Write Mutations

2020-06-10 Thread Mai Nakagawa (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-6764?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132934#comment-17132934
 ] 

Mai Nakagawa commented on BEAM-6764:


[~jleoirab] - Are you using [Streaming 
Engine|[https://cloud.google.com/dataflow/docs/guides/deploying-a-pipeline#dataflow-streaming-engine]|https://cloud.google.com/dataflow/docs/guides/deploying-a-pipeline#dataflow-streaming-engine]?]?
 We faced the same situation while using it. We could work this around by not 
using Streaming Engine.

> OutOfMemory Exception in Dataflow Spanner Write Mutations
> -
>
> Key: BEAM-6764
> URL: https://issues.apache.org/jira/browse/BEAM-6764
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-gcp
>Affects Versions: 2.9.0, 2.10.0, 2.11.0
>Reporter: Joshua
>Priority: P2
>  Labels: stale-P2
>
> Since I upgraded my apache beam sdk version to >= 2.9.0, I have been noticing 
> OOM exceptions while using the dataflow runner to write mutations to spanner. 
> I have been using n1-standard-4 since version 2.9.0. On that version, it 
> works. But on higher versions, I get the exception.
>  
> The stackdriver logs is provided below
> {code:java}
> java.lang.OutOfMemoryError: Java heap space
> java.util.ArrayList.(ArrayList.java:152)
> org.apache.beam.sdk.io.gcp.spanner.SpannerIO$GatherBundleAndSortFn.initSorter(SpannerIO.java:1056)
> org.apache.beam.sdk.io.gcp.spanner.SpannerIO$GatherBundleAndSortFn.startBundle(SpannerIO.java:1049)
> {code}
> I have a very basic PTransform for writing to Spanner:
> {code:java}
> SpannerIO.write().withInstanceId(options.getSpannerInstanceId()) 
> .withDatabaseId(options.getSpannerDatabaseId()) 
> .withProjectId(options.getProject()) 
> .withFailureMode(SpannerIO.FailureMode.REPORT_FAILURES);
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9179) Refactor Beam ZetaSQL type translation code

2020-06-10 Thread Robin Qiu (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9179?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robin Qiu updated BEAM-9179:

Summary: Refactor Beam ZetaSQL type translation code  (was: Simplify types 
in Beam ZetaSQL)

> Refactor Beam ZetaSQL type translation code
> ---
>
> Key: BEAM-9179
> URL: https://issues.apache.org/jira/browse/BEAM-9179
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Robin Qiu
>Assignee: Robin Qiu
>Priority: P2
>  Labels: stale-assigned
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> Currently, how types work in Beam ZetaSQL is very confusing: code that does 
> type conversion between Calcite/Beam/ZetaSQL type systems are everywhere and 
> is likely to be not consistent. In addition, the recent migration to 
> BeamZetaSqlCalcRel makes the situation even worse: it adds another layer for 
> type conversion and makes a lot of code (for making ZetaSQL functions 
> manually) obsolete. This makes debugging and supporting new features very 
> hard. A few examples:
>  * Support new ZetaSQL types (e.g. DATE/TIME/DATETIME)
>  * Fix the long-standing precision & timezone issue of TIMESTMAP type
>  * Support BeamSQL query parameters
> Therefore, I propose we work on the following items to make the story of Beam 
> ZetaSQL type clearer:
>  * Define a clear mapping between Calcite/Beam/ZetaSQL types and values
>  * Refactor the codebase to move all type conversion code to a single source 
> of truth
>  * Add thorough testing for it
>  * Remove obsolete code that is added to make functions work before 
> BeamZetaSqlCalcRel



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9951) Create Go SDK synthetic sources.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9951?focusedWorklogId=444123&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444123
 ]

ASF GitHub Bot logged work on BEAM-9951:


Author: ASF GitHub Bot
Created on: 11/Jun/20 05:20
Start Date: 11/Jun/20 05:20
Worklog Time Spent: 10m 
  Work Description: lostluck commented on pull request #11870:
URL: https://github.com/apache/beam/pull/11870#issuecomment-642414004


   Run Go PostCommit



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444123)
Time Spent: 4h 50m  (was: 4h 40m)

> Create Go SDK synthetic sources.
> 
>
> Key: BEAM-9951
> URL: https://issues.apache.org/jira/browse/BEAM-9951
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 4h 50m
>  Remaining Estimate: 0h
>
> Create synthetic sources for the Go SDK like 
> [Java|https://github.com/apache/beam/tree/master/sdks/java/io/synthetic/src/main/java/org/apache/beam/sdk/io/synthetic]
>  and 
> [Python|https://github.com/apache/beam/blob/master/sdks/python/apache_beam/testing/synthetic_pipeline.py]
>  have.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10169) ParDo* functions should declare the correct output N in their error message

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10169?focusedWorklogId=444119&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444119
 ]

ASF GitHub Bot logged work on BEAM-10169:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 05:06
Start Date: 11/Jun/20 05:06
Worklog Time Spent: 10m 
  Work Description: lostluck commented on a change in pull request #11976:
URL: https://github.com/apache/beam/pull/11976#discussion_r438475207



##
File path: sdks/go/pkg/beam/core/util/reflectx/functions_test.go
##
@@ -41,3 +45,19 @@ func TestLoadFunction(t *testing.T) {
t.Errorf("got %d, wanted %d", out[0].Int(), testFunction())
}
 }
+
+func TestFunctionOutputSize(t *testing.T) {
+   expected := 1
+   received := FunctionOutputSize(testFunction)
+   if received != expected {
+   t.Errorf("got %d, wanted %d", received, expected)
+   }
+}
+
+func TestFunction2OutputSize(t *testing.T) {

Review comment:
   Conventionally, if there are multiple cases for a given test, they 
should be combined into the same function, and run as a loop.
   
   See https://gobyexample.com/testing for an excellent example.
   
   Otherwise, this function should be named TestFunctionOutputSize_2

##
File path: sdks/go/pkg/beam/pardo.go
##
@@ -414,7 +414,45 @@ func ParDo6(s Scope, dofn interface{}, col PCollection, 
opts ...Option) (PCollec
 func ParDo7(s Scope, dofn interface{}, col PCollection, opts ...Option) 
(PCollection, PCollection, PCollection, PCollection, PCollection, PCollection, 
PCollection) {
ret := MustN(TryParDo(s, dofn, col, opts...))
if len(ret) != 7 {
-   panic(fmt.Sprintf("expected 7 output. Found: %v", ret))
+   panic(ParDoErrorFormatter(dofn, ParDo7))
}
return ret[0], ret[1], ret[2], ret[3], ret[4], ret[5], ret[6]
 }
+
+//ParDoErrorFormatter is a helper function to provide a more concise error
+// message to the users when a DoFn and its ParDo pairing is incorrect.
+func ParDoErrorFormatter(doFn interface{}, parDo interface{}) string {
+   doFnName := reflectx.FunctionName(doFn)
+   doFnOutSize := reflectx.FunctionOutputSize(doFn)
+
+   parDoName := reflectx.FunctionName(parDo)
+   parDoOutSize := reflectx.FunctionOutputSize(parDo)
+
+   useParDo := reflectx.FunctionName(RecommendParDo(doFnOutSize))
+   return fmt.Sprintf("DoFn %v has %v outptus, but %v requires %v outputs, 
Use %v instead.", doFnName, doFnOutSize, parDoName, parDoOutSize, useParDo)
+
+}
+
+// recommendParDo takes a in a DoFns emit dimension and recommends the correct
+// ParDo to use.
+func RecommendParDo(emitDim int) interface{} {
+   switch {
+   case emitDim == 0:
+   return ParDo0
+   case emitDim == 1:
+   return ParDo
+   case emitDim == 2:
+   return ParDo2
+   case emitDim == 3:
+   return ParDo3
+   case emitDim == 4:
+   return ParDo4
+   case emitDim == 5:
+   return ParDo5
+   case emitDim == 6:
+   return ParDo6
+   case emitDim == 7:
+   return ParDo7
+   }
+   return ParDoN
+}

Review comment:
   What I'd recommend here instead though is to return a string instead of 
the function.
   ```
   switch emitDim {
case 0,2,3,4,5,6,7:
 return fmt.Sprintf("ParDo%d", emitDim)
   case 1:
 return "ParDo"  
   default:
 return "ParDoN"
   }
   ```
   
   Easier to read, and see if it's correct. Read [Effective Go for more about 
Switches](https://golang.org/doc/effective_go.html#switch)

##
File path: sdks/go/pkg/beam/pardo.go
##
@@ -414,7 +414,45 @@ func ParDo6(s Scope, dofn interface{}, col PCollection, 
opts ...Option) (PCollec
 func ParDo7(s Scope, dofn interface{}, col PCollection, opts ...Option) 
(PCollection, PCollection, PCollection, PCollection, PCollection, PCollection, 
PCollection) {
ret := MustN(TryParDo(s, dofn, col, opts...))
if len(ret) != 7 {
-   panic(fmt.Sprintf("expected 7 output. Found: %v", ret))
+   panic(ParDoErrorFormatter(dofn, ParDo7))
}
return ret[0], ret[1], ret[2], ret[3], ret[4], ret[5], ret[6]
 }
+
+//ParDoErrorFormatter is a helper function to provide a more concise error
+// message to the users when a DoFn and its ParDo pairing is incorrect.
+func ParDoErrorFormatter(doFn interface{}, parDo interface{}) string {
+   doFnName := reflectx.FunctionName(doFn)
+   doFnOutSize := reflectx.FunctionOutputSize(doFn)
+
+   parDoName := reflectx.FunctionName(parDo)
+   parDoOutSize := reflectx.FunctionOutputSize(parDo)
+
+   useParDo := reflectx.FunctionName(RecommendParDo(doFnOutSize))
+   return fmt.Sprintf("DoFn %v has %v outptus, but %v requires %v outputs, 
Use %v instead.", doFnName, doFnO

[jira] [Commented] (BEAM-9466) SnowflakeIO in python

2020-06-10 Thread Shashanka Balakuntala Srinivasa (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9466?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132903#comment-17132903
 ] 

Shashanka Balakuntala Srinivasa commented on BEAM-9466:
---

Hi [~kenn], there was a discussion in dev-mailer list which can be found here 
[1] where [~kkucharczyk] mentioned that they were working on a cross platform 
connector. Hence I parked on this. But if this still can make sense...please 
let me know i will work on this and complete it at my earliest.

 

[1] - 
[https://lists.apache.org/thread.html/r4759f0aa5ba830e239589c3dd2724f1f9b145c31fbdd6b0e41719a6b%40%3Cdev.beam.apache.org%3E]

> SnowflakeIO in python
> -
>
> Key: BEAM-9466
> URL: https://issues.apache.org/jira/browse/BEAM-9466
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Shashanka Balakuntala Srinivasa
>Priority: P3
>
> As Elias Djurfeldt mentioned, there is a Snowflake Python library available 
> [1], using this to create an SnowflakeIO for python
> [1] [https://docs.snowflake.net/manuals/user-guide/python-connector.html]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-10237) Clean up Experimental usage and come up with a graduation policy

2020-06-10 Thread Kenneth Knowles (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-10237?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132902#comment-17132902
 ] 

Kenneth Knowles commented on BEAM-10237:


I think [~iemejia] expressed interest, too. (apologies if I am misremembering)

> Clean up Experimental usage and come up with a graduation policy
> 
>
> Key: BEAM-10237
> URL: https://issues.apache.org/jira/browse/BEAM-10237
> Project: Beam
>  Issue Type: Bug
>  Components: beam-model
>Reporter: Ahmet Altay
>Priority: P2
>
> Beam, APIs use experimental features quite a lot. Experimental features are 
> growing, but at some point we should consider either removing or graduating 
> them.
> This is a proposal to:
> - Track: All experimental features will be continually tracked in the change 
> log explicitly.
> - For features staying in experimental mode longer than 12 months,
> --> Graduate: we should review whether the feature meets the usage bar to be 
> long term maintained, 
> -- > Deprecate: or should be deprecated to reduce maintenance load.
> It would be good to add a github ci action to check new usage of experimental 
> tags are added to the change log.
> /cc [~kenn] [~lcwik] [~robertwb]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (BEAM-3194) Support annotating that a DoFn requires stable / deterministic input for replay/retry

2020-06-10 Thread Robin Qiu (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-3194?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Robin Qiu closed BEAM-3194.
---
Fix Version/s: 2.21.0
   Resolution: Fixed

> Support annotating that a DoFn requires stable / deterministic input for 
> replay/retry
> -
>
> Key: BEAM-3194
> URL: https://issues.apache.org/jira/browse/BEAM-3194
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model, sdk-java-core
>Reporter: Kenneth Knowles
>Priority: P2
>  Labels: stale-P2
> Fix For: 2.21.0
>
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> See the thread: 
> https://lists.apache.org/thread.html/5fd81ce371aeaf642665348f8e6940e308e04275dd7072f380f9f945@%3Cdev.beam.apache.org%3E
> We need this in order to have truly cross-runner end-to-end exactly once via 
> replay + idempotence.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-3194) Support annotating that a DoFn requires stable / deterministic input for replay/retry

2020-06-10 Thread Robin Qiu (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-3194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132892#comment-17132892
 ] 

Robin Qiu commented on BEAM-3194:
-

[~kenn] Yes the annotation already exists. I will close this JIRA. Feel free to 
re-open if you disagree.

> Support annotating that a DoFn requires stable / deterministic input for 
> replay/retry
> -
>
> Key: BEAM-3194
> URL: https://issues.apache.org/jira/browse/BEAM-3194
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model, sdk-java-core
>Reporter: Kenneth Knowles
>Priority: P2
>  Labels: stale-P2
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> See the thread: 
> https://lists.apache.org/thread.html/5fd81ce371aeaf642665348f8e6940e308e04275dd7072f380f9f945@%3Cdev.beam.apache.org%3E
> We need this in order to have truly cross-runner end-to-end exactly once via 
> replay + idempotence.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-1261) State API should allow state to be managed in different windows

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-1261?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-1261:
--
Issue Type: Wish  (was: New Feature)

> State API should allow state to be managed in different windows
> ---
>
> Key: BEAM-1261
> URL: https://issues.apache.org/jira/browse/BEAM-1261
> Project: Beam
>  Issue Type: Wish
>  Components: beam-model
>Reporter: Ben Chambers
>Priority: P2
>  Labels: stale-P2
>
> For example, even if the elements are being processed in fixed windows of an 
> hour, it may be desirable for the state to "roll over" between windows (or be 
> available to all windows).
> It will also be necessary to figure out when this state should be deleted 
> (TTL? maximum retention?)
> Another problem is how to deal with out of order data. If data comes in from 
> the 10:00 AM window, should its state changes be visible to the data in the 
> 9:00 AM window? 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-2182) AvroCoder is quite overconfident: it claims to be able to create a schema for things it certainly cannot

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-2182?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-2182:
--
Summary: AvroCoder is quite overconfident: it claims to be able to create a 
schema for things it certainly cannot  (was: AvroCoder is quite overconfident)

> AvroCoder is quite overconfident: it claims to be able to create a schema for 
> things it certainly cannot
> 
>
> Key: BEAM-2182
> URL: https://issues.apache.org/jira/browse/BEAM-2182
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Kenneth Knowles
>Priority: P2
>  Labels: stale-P2
>
> AvoCoder claims it can create a schema for:
> {code}
> public static class X {
>   public Thread t = Thread.currentThread();
> }
> {code}
> The issue goes all the way to Avro, we think. To be investigated. Meanwhile, 
> we need AvroCoder to say no to such things.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?focusedWorklogId=444088&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444088
 ]

ASF GitHub Bot logged work on BEAM-10201:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 02:30
Start Date: 11/Jun/20 02:30
Worklog Time Spent: 10m 
  Work Description: rezarokni commented on pull request #11929:
URL: https://github.com/apache/beam/pull/11929#issuecomment-642368157


   @reuvenlax @pabloem 
   The latest commit covers most of the points now, with the exception of the 
three open questions I raised.
   - Return of the Schema that failed. @pabloem 
   - Discussion about withExtendedErrMsg @reuvenlax 
   - use of transient with ObjectMapper @reuvenlax 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444088)
Time Spent: 3h  (was: 2h 50m)

> Enhance JsonToRow to add Deadletter Support
> ---
>
> Key: BEAM-10201
> URL: https://issues.apache.org/jira/browse/BEAM-10201
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Reza ardeshir rokni
>Assignee: Reza ardeshir rokni
>Priority: P2
>  Time Spent: 3h
>  Remaining Estimate: 0h
>
> Current JsonToRow transform does not support Dead Letter pattern on parse 
> failures.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?focusedWorklogId=444087&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444087
 ]

ASF GitHub Bot logged work on BEAM-10201:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 02:27
Start Date: 11/Jun/20 02:27
Worklog Time Spent: 10m 
  Work Description: rezarokni commented on a change in pull request #11929:
URL: https://github.com/apache/beam/pull/11929#discussion_r438508022



##
File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/JsonToRow.java
##
@@ -116,4 +131,267 @@ private ObjectMapper objectMapper() {
   return this.objectMapper;
 }
   }
+
+  /**
+   * Enable Dead letter support. If this value is set errors in the parsing 
layer are returned as
+   * Row objects within a {@link ParseResult}
+   *
+   * You can access the results by using:
+   *
+   * ParseResult results = 
jsonPersons.apply(JsonToRow.withDeadLetter(PERSON_SCHEMA));
+   *
+   * {@link ParseResult#getResults()}
+   *
+   * {@Code PCollection personRows = results.getResults()}
+   *
+   * {@link ParseResult#getFailedToParseLines()}
+   *
+   * {@Code PCollection errorsLines = results.getFailedToParseLines()}
+   *
+   * To access the reason for the failure you will need to first enable 
extended error reporting.
+   * {@Code ParseResult results =
+   * 
jsonPersons.apply(JsonToRow.withDeadLetter(PERSON_SCHEMA).withExtendedErrorInfo());
 }
+   *
+   * {@link ParseResult#getFailedToParseLinesWithErr()}
+   *
+   * {@Code PCollection errorsLinesWithErrMsg = 
results.getFailedToParseLines()}
+   *
+   * @return {@link JsonToRowWithErrFn}
+   */
+  @Experimental(Kind.SCHEMAS)
+  public static JsonToRowWithErrFn withDeadLetter(Schema rowSchema) {
+return JsonToRowWithErrFn.forSchema(rowSchema);
+  }
+
+  @AutoValue
+  abstract static class JsonToRowWithErrFn extends 
PTransform, ParseResult> {
+
+private Pipeline pipeline;
+
+private PCollection parsedLine;
+private PCollection failedParse;
+private PCollection failedParseWithErr;
+
+private static final String LINE_FIELD_NAME = "line";
+private static final String ERROR_FIELD_NAME = "err";
+
+public static final Schema ERROR_ROW_SCHEMA =
+Schema.of(Field.of(LINE_FIELD_NAME, FieldType.STRING));
+
+public static final Schema ERROR_ROW_WITH_ERR_MSG_SCHEMA =
+Schema.of(
+Field.of(LINE_FIELD_NAME, FieldType.STRING),
+Field.of(ERROR_FIELD_NAME, FieldType.STRING));
+
+static final TupleTag PARSED_LINE = new TupleTag() {};
+static final TupleTag PARSE_ERROR_LINE = new TupleTag() {};
+static final TupleTag PARSE_ERROR_LINE_WITH_MSG = new TupleTag() 
{};
+
+public abstract Schema getSchema();
+
+public abstract String getLineFieldName();
+
+public abstract String getErrorFieldName();
+
+public abstract boolean getExtendedErrorInfo();
+
+PCollection deadLetterCollection;
+
+public abstract Builder toBuilder();
+
+@AutoValue.Builder
+public abstract static class Builder {
+  public abstract Builder setSchema(Schema value);
+
+  public abstract Builder setLineFieldName(String value);
+
+  public abstract Builder setErrorFieldName(String value);
+
+  public abstract Builder setExtendedErrorInfo(boolean value);
+
+  public abstract JsonToRowWithErrFn build();
+}
+
+public static JsonToRowWithErrFn forSchema(Schema rowSchema) {
+  // Throw exception if this schema is not supported by RowJson
+  RowJson.verifySchemaSupported(rowSchema);
+  return new AutoValue_JsonToRow_JsonToRowWithErrFn.Builder()
+  .setSchema(rowSchema)
+  .setExtendedErrorInfo(false)
+  .setLineFieldName(LINE_FIELD_NAME)
+  .setErrorFieldName(ERROR_FIELD_NAME)
+  .build();
+}
+
+/**
+ * Adds the error message to the returned error Row.
+ *
+ * @return {@link JsonToRow}
+ */
+public JsonToRowWithErrFn withExtendedErrorInfo() {
+  return this.toBuilder().setExtendedErrorInfo(true).build();
+}
+
+/**
+ * Sets the field name for the line field in the returned Row.
+ *
+ * @return {@link JsonToRow}
+ */
+public JsonToRowWithErrFn setLineField(String lineField) {
+  return this.toBuilder().setLineFieldName(lineField).build();
+}
+
+/**
+ * Adds the error message to the returned error Row.
+ *
+ * @return {@link JsonToRow}
+ */
+public JsonToRowWithErrFn setErrorField(String errorField) {
+  if (!this.getExtendedErrorInfo()) {
+throw new IllegalArgumentException(
+"This option is only available with Extended Error Info.");
+  }
+  return this.toBuilder().setErrorFieldName(errorField).build();
+}
+
+@Override
+public ParseResult expand(PCollection jsonStrings) {
+
+  PCollectionTuple result =
+ 

[jira] [Work logged] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?focusedWorklogId=444086&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444086
 ]

ASF GitHub Bot logged work on BEAM-10201:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 02:26
Start Date: 11/Jun/20 02:26
Worklog Time Spent: 10m 
  Work Description: rezarokni commented on pull request #11929:
URL: https://github.com/apache/beam/pull/11929#issuecomment-642367042


   @TheNeuralBit If you have time, would be great if you could scan the 
ParseResult and see if the API would suite all the things PubSubIO needs.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444086)
Time Spent: 2h 40m  (was: 2.5h)

> Enhance JsonToRow to add Deadletter Support
> ---
>
> Key: BEAM-10201
> URL: https://issues.apache.org/jira/browse/BEAM-10201
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Reza ardeshir rokni
>Assignee: Reza ardeshir rokni
>Priority: P2
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> Current JsonToRow transform does not support Dead Letter pattern on parse 
> failures.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10237) Clean up Experimental usage and come up with a graduation policy

2020-06-10 Thread Ahmet Altay (Jira)
Ahmet Altay created BEAM-10237:
--

 Summary: Clean up Experimental usage and come up with a graduation 
policy
 Key: BEAM-10237
 URL: https://issues.apache.org/jira/browse/BEAM-10237
 Project: Beam
  Issue Type: Bug
  Components: beam-model
Reporter: Ahmet Altay


Beam, APIs use experimental features quite a lot. Experimental features are 
growing, but at some point we should consider either removing or graduating 
them.

This is a proposal to:

- Track: All experimental features will be continually tracked in the change 
log explicitly.
- For features staying in experimental mode longer than 12 months,
--> Graduate: we should review whether the feature meets the usage bar to be 
long term maintained, 
-- > Deprecate: or should be deprecated to reduce maintenance load.

It would be good to add a github ci action to check new usage of experimental 
tags are added to the change log.

/cc [~kenn] [~lcwik] [~robertwb]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10232) pip check failure: rsa 4.1

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10232?focusedWorklogId=444082&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444082
 ]

ASF GitHub Bot logged work on BEAM-10232:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 02:04
Start Date: 11/Jun/20 02:04
Worklog Time Spent: 10m 
  Work Description: ibzib commented on pull request #11987:
URL: https://github.com/apache/beam/pull/11987#issuecomment-642360956


   R: @aaltay @udim 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444082)
Time Spent: 20m  (was: 10m)

> pip check failure: rsa 4.1
> --
>
> Key: BEAM-10232
> URL: https://issues.apache.org/jira/browse/BEAM-10232
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Kyle Weaver
>Assignee: Kyle Weaver
>Priority: P2
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Execution failed for task ':sdks:python:test-suites:tox:py37:mypyPy37'.
> google-auth 1.16.1 has requirement rsa<4.1,>=3.1.4, but you have rsa 4.1.
> ERROR: InvocationError for command 
> /home/jenkins/jenkins-slave/workspace/beam_PreCommit_PythonLint_Commit/src/sdks/python/test-suites/tox/py37/build/srcs/sdks/python/target/.tox-py37-mypy/py37-mypy/bin/pip
>  check (exited with code 1)



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9798) Go SplittableDoFn performance testing and optimizations

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132836#comment-17132836
 ] 

Daniel Oliveira commented on BEAM-9798:
---

Still something I have to look into.

> Go SplittableDoFn performance testing and optimizations
> ---
>
> Key: BEAM-9798
> URL: https://issues.apache.org/jira/browse/BEAM-9798
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>
> Add SDF benchmarks and implement performance optimizations to speed up SDFs. 
> Pretty straightforward.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9798) Go SplittableDoFn performance testing and optimizations

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9798?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-9798:
-

Assignee: Daniel Oliveira

> Go SplittableDoFn performance testing and optimizations
> ---
>
> Key: BEAM-9798
> URL: https://issues.apache.org/jira/browse/BEAM-9798
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>
> Add SDF benchmarks and implement performance optimizations to speed up SDFs. 
> Pretty straightforward.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10236) beam_PreCommit_PythonDocker_Commit failing

2020-06-10 Thread Kyle Weaver (Jira)
Kyle Weaver created BEAM-10236:
--

 Summary: beam_PreCommit_PythonDocker_Commit failing
 Key: BEAM-10236
 URL: https://issues.apache.org/jira/browse/BEAM-10236
 Project: Beam
  Issue Type: Bug
  Components: test-failures
Reporter: Kyle Weaver


Task :sdks:python:container:py35:docker FAILED

RuntimeError: Could not retrieve licences for packages [['3', 'a', 'h', 'p', 
's', 'y']] in Python3.5 environment.

Package names are scrambled due to BEAM-10234, but I'm assuming it's pysha3.





--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9753) Use cmp in fullvalue_test.go

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132835#comment-17132835
 ] 

Daniel Oliveira commented on BEAM-9753:
---

Still on my backlog.

> Use cmp in fullvalue_test.go
> 
>
> Key: BEAM-9753
> URL: https://issues.apache.org/jira/browse/BEAM-9753
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P3
>
> We could probably update the comparison helpers in 
> [fullvalue_test.go|https://github.com/apache/beam/blob/master/sdks/go/pkg/beam/core/runtime/exec/fullvalue_test.go]
>  to use cmp options and Transformers instead which would make things much 
> clearer.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9643) Add user-facing Go SDF documentation.

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-9643:
-

Assignee: Daniel Oliveira

> Add user-facing Go SDF documentation.
> -
>
> Key: BEAM-9643
> URL: https://issues.apache.org/jira/browse/BEAM-9643
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> This means adding the documentation about how to use SDFs and the contracts 
> of all the SDF methods to the Go SDK code, as well as updating the Go SDF 
> design doc.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-9643) Add user-facing Go SDF documentation.

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9643?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira resolved BEAM-9643.
---
Fix Version/s: Not applicable
   Resolution: Fixed

> Add user-facing Go SDF documentation.
> -
>
> Key: BEAM-9643
> URL: https://issues.apache.org/jira/browse/BEAM-9643
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
> Fix For: Not applicable
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> This means adding the documentation about how to use SDFs and the contracts 
> of all the SDF methods to the Go SDK code, as well as updating the Go SDF 
> design doc.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9753) Use cmp in fullvalue_test.go

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9753?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-9753:
-

Assignee: Daniel Oliveira

> Use cmp in fullvalue_test.go
> 
>
> Key: BEAM-9753
> URL: https://issues.apache.org/jira/browse/BEAM-9753
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P3
>
> We could probably update the comparison helpers in 
> [fullvalue_test.go|https://github.com/apache/beam/blob/master/sdks/go/pkg/beam/core/runtime/exec/fullvalue_test.go]
>  to use cmp options and Transformers instead which would make things much 
> clearer.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9642) Add SDF execution-time runners

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-9642:
-

Assignee: Daniel Oliveira

> Add SDF execution-time runners
> --
>
> Key: BEAM-9642
> URL: https://issues.apache.org/jira/browse/BEAM-9642
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 6h 10m
>  Remaining Estimate: 0h
>
> Adds execution-time SDF runner units to the exec package, and any unit tests 
> + helpers required.
> This is needed to get the expanded SDF URNs to execute in the runner harness.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work started] (BEAM-9642) Add SDF execution-time runners

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Work on BEAM-9642 started by Daniel Oliveira.
-
> Add SDF execution-time runners
> --
>
> Key: BEAM-9642
> URL: https://issues.apache.org/jira/browse/BEAM-9642
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 6h 10m
>  Remaining Estimate: 0h
>
> Adds execution-time SDF runner units to the exec package, and any unit tests 
> + helpers required.
> This is needed to get the expanded SDF URNs to execute in the runner harness.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-9642) Add SDF execution-time runners

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9642?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira resolved BEAM-9642.
---
Fix Version/s: Not applicable
   Resolution: Fixed

> Add SDF execution-time runners
> --
>
> Key: BEAM-9642
> URL: https://issues.apache.org/jira/browse/BEAM-9642
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
> Fix For: Not applicable
>
>  Time Spent: 6h 10m
>  Remaining Estimate: 0h
>
> Adds execution-time SDF runner units to the exec package, and any unit tests 
> + helpers required.
> This is needed to get the expanded SDF URNs to execute in the runner harness.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9611) Add a test for Iter/ReIter parameters in ParDos (such as after a GBK).

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9611?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-9611:
-

Assignee: Daniel Oliveira

> Add a test for Iter/ReIter parameters in ParDos (such as after a GBK).
> --
>
> Key: BEAM-9611
> URL: https://issues.apache.org/jira/browse/BEAM-9611
> Project: Beam
>  Issue Type: Test
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>
> Some bugs popped up recently because DoFn validation caused an error with 
> ParDos that had Iter/ReIter elements. That could've been caught if a test 
> existed that checked the creation of ParDos with Iter/ReIters, so that should 
> be added, either as a new test or as part of some existing integration test.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9611) Add a test for Iter/ReIter parameters in ParDos (such as after a GBK).

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9611?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132834#comment-17132834
 ] 

Daniel Oliveira commented on BEAM-9611:
---

Still on my backlog.

> Add a test for Iter/ReIter parameters in ParDos (such as after a GBK).
> --
>
> Key: BEAM-9611
> URL: https://issues.apache.org/jira/browse/BEAM-9611
> Project: Beam
>  Issue Type: Test
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>
> Some bugs popped up recently because DoFn validation caused an error with 
> ParDos that had Iter/ReIter elements. That could've been caught if a test 
> existed that checked the creation of ParDos with Iter/ReIters, so that should 
> be added, either as a new test or as part of some existing integration test.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8571) Use SDK version in default Go container tag

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8571?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132833#comment-17132833
 ] 

Daniel Oliveira commented on BEAM-8571:
---

Still something on my backlog.

> Use SDK version in default Go container tag
> ---
>
> Key: BEAM-8571
> URL: https://issues.apache.org/jira/browse/BEAM-8571
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Kyle Weaver
>Assignee: Daniel Oliveira
>Priority: P2
>
> Currently, the Go SDK uses container `apachebeam/go_sdk:latest` by default 
> [1]. This causes confusion when using locally built containers [2], 
> especially since images are automatically pulled, meaning the release image 
> is used instead of the developer's own image (BEAM-8545).
> [1] 
> [https://github.com/apache/beam/blob/473377ef8f51949983508f70663e75ef0ee24a7f/sdks/go/pkg/beam/options/jobopts/options.go#L111]
> [[2] 
> https://lists.apache.org/thread.html/07131e314e229ec60100eaa2c0cf6dfc206bf2b0f78c3cee9ebb0bda@%3Cdev.beam.apache.org%3E|https://lists.apache.org/thread.html/07131e314e229ec60100eaa2c0cf6dfc206bf2b0f78c3cee9ebb0bda@%3Cdev.beam.apache.org%3E]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-8571) Use SDK version in default Go container tag

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8571?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-8571:
-

Assignee: Daniel Oliveira

> Use SDK version in default Go container tag
> ---
>
> Key: BEAM-8571
> URL: https://issues.apache.org/jira/browse/BEAM-8571
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Kyle Weaver
>Assignee: Daniel Oliveira
>Priority: P2
>
> Currently, the Go SDK uses container `apachebeam/go_sdk:latest` by default 
> [1]. This causes confusion when using locally built containers [2], 
> especially since images are automatically pulled, meaning the release image 
> is used instead of the developer's own image (BEAM-8545).
> [1] 
> [https://github.com/apache/beam/blob/473377ef8f51949983508f70663e75ef0ee24a7f/sdks/go/pkg/beam/options/jobopts/options.go#L111]
> [[2] 
> https://lists.apache.org/thread.html/07131e314e229ec60100eaa2c0cf6dfc206bf2b0f78c3cee9ebb0bda@%3Cdev.beam.apache.org%3E|https://lists.apache.org/thread.html/07131e314e229ec60100eaa2c0cf6dfc206bf2b0f78c3cee9ebb0bda@%3Cdev.beam.apache.org%3E]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-6113) Streamline beam.Init() requirement for Go pipelines.

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-6113?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132830#comment-17132830
 ] 

Daniel Oliveira commented on BEAM-6113:
---

Still something I want on my backlog.

> Streamline beam.Init() requirement for Go pipelines.
> 
>
> Key: BEAM-6113
> URL: https://issues.apache.org/jira/browse/BEAM-6113
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P3
>  Labels: documentation, usability
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> When writing a Go pipeline it's necessary to call beam.Init() at the 
> beginning of your code, but right now there is nearly no validation or 
> checking around that. This makes it very easy for a new user of Beam Go to 
> accidentally leave it out and not get a clear signal for why their pipeline 
> isn't being executed as expected. This issue is for tracking progress towards 
> streamlining this requirement to improve the user experience.
> In the short term this can be easily improved by improving documentation and 
> examples to clearly communicate the necessity of beam.Init(), and also add 
> log warnings to provide some signal if the user forgot to call it.
> In the long term, the best solution to this would be to try avoiding the 
> problem all together (for example by removing the need for beam.Init(), or 
> having it called implicitly without relying on users to call it), or to 
> create a stronger failure state, for example by having pipelines without 
> beam.Init() immediately break with a clear message notifying the user that it 
> was not called.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-6113) Streamline beam.Init() requirement for Go pipelines.

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6113?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-6113:
-

Assignee: Daniel Oliveira

> Streamline beam.Init() requirement for Go pipelines.
> 
>
> Key: BEAM-6113
> URL: https://issues.apache.org/jira/browse/BEAM-6113
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P3
>  Labels: documentation, usability
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> When writing a Go pipeline it's necessary to call beam.Init() at the 
> beginning of your code, but right now there is nearly no validation or 
> checking around that. This makes it very easy for a new user of Beam Go to 
> accidentally leave it out and not get a clear signal for why their pipeline 
> isn't being executed as expected. This issue is for tracking progress towards 
> streamlining this requirement to improve the user experience.
> In the short term this can be easily improved by improving documentation and 
> examples to clearly communicate the necessity of beam.Init(), and also add 
> log warnings to provide some signal if the user forgot to call it.
> In the long term, the best solution to this would be to try avoiding the 
> problem all together (for example by removing the need for beam.Init(), or 
> having it called implicitly without relying on users to call it), or to 
> create a stronger failure state, for example by having pipelines without 
> beam.Init() immediately break with a clear message notifying the user that it 
> was not called.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9882) Go SplittableDoFn testing

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9882?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132828#comment-17132828
 ] 

Daniel Oliveira commented on BEAM-9882:
---

Still working on this.

> Go SplittableDoFn testing
> -
>
> Key: BEAM-9882
> URL: https://issues.apache.org/jira/browse/BEAM-9882
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Labels: stale-assigned
>
> This is a catch-all jira for all tasks needed to fully test SplittableDoFns 
> in Go. For progress on SplittableDoFns themselves, see BEAM-3301



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9883) Generalize SDF-validating restrictions

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9883?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132827#comment-17132827
 ] 

Daniel Oliveira commented on BEAM-9883:
---

This is still something I want to work on, in my backlog.

> Generalize SDF-validating restrictions
> --
>
> Key: BEAM-9883
> URL: https://issues.apache.org/jira/browse/BEAM-9883
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Labels: stale-assigned
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> We have some restrictions written for the purpose of validating that SDFs 
> work in sdf_invokers_test.go, but they can be improved and generalized. The 
> main improvement is changing the validation approach so that the restriction 
> keeps track of each method it's had called on it. Then this can be 
> generalized so that it can be used in upcoming integration tests to validate 
> which methods are being called.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-2937) Fn API combiner support w/ lifting to PGBK

2020-06-10 Thread Daniel Oliveira (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-2937?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132826#comment-17132826
 ] 

Daniel Oliveira commented on BEAM-2937:
---

I'll see if I can wrap this up.

> Fn API combiner support w/ lifting to PGBK
> --
>
> Key: BEAM-2937
> URL: https://issues.apache.org/jira/browse/BEAM-2937
> Project: Beam
>  Issue Type: Improvement
>  Components: beam-model
>Reporter: Henning Rohde
>Assignee: Daniel Oliveira
>Priority: P2
>  Labels: portability, stale-assigned
>  Time Spent: 3h 50m
>  Remaining Estimate: 0h
>
> The FnAPI should support this optimization. Detailed design: 
> https://s.apache.org/beam-runner-api-combine-model
> Once design is ready, expand subtasks similarly to BEAM-2822.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-4678) Support portable combiner lifting in Java Flink Runner

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-4678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-4678:
-

Assignee: Kyle Weaver  (was: Daniel Oliveira)

Kyle, any idea if this is already implemented? It seems likely but I wanna 
confirm before closing this.

> Support portable combiner lifting in Java Flink Runner
> --
>
> Key: BEAM-4678
> URL: https://issues.apache.org/jira/browse/BEAM-4678
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-flink
>Reporter: Daniel Oliveira
>Assignee: Kyle Weaver
>Priority: P2
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Adjust Flink Runner to support portable combiner lifting as described in the 
> following doc:
> https://s.apache.org/beam-runner-api-combine-model



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10232) pip check failure: rsa 4.1

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10232?focusedWorklogId=444077&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444077
 ]

ASF GitHub Bot logged work on BEAM-10232:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 01:36
Start Date: 11/Jun/20 01:36
Worklog Time Spent: 10m 
  Work Description: ibzib opened a new pull request #11987:
URL: https://github.com/apache/beam/pull/11987


   **Please** add a meaningful description for your change here
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Sta

[jira] [Assigned] (BEAM-4678) Support portable combiner lifting in Java Flink Runner

2020-06-10 Thread Daniel Oliveira (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-4678?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Daniel Oliveira reassigned BEAM-4678:
-

Assignee: Daniel Oliveira

> Support portable combiner lifting in Java Flink Runner
> --
>
> Key: BEAM-4678
> URL: https://issues.apache.org/jira/browse/BEAM-4678
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-flink
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Adjust Flink Runner to support portable combiner lifting as described in the 
> following doc:
> https://s.apache.org/beam-runner-api-combine-model



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9951) Create Go SDK synthetic sources.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9951?focusedWorklogId=444073&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444073
 ]

ASF GitHub Bot logged work on BEAM-9951:


Author: ASF GitHub Bot
Created on: 11/Jun/20 01:16
Start Date: 11/Jun/20 01:16
Worklog Time Spent: 10m 
  Work Description: youngoli commented on a change in pull request #11870:
URL: https://github.com/apache/beam/pull/11870#discussion_r438490190



##
File path: sdks/go/pkg/beam/testing/passert/count.go
##
@@ -0,0 +1,52 @@
+// Licensed to the Apache Software Foundation (ASF) under one or more
+// contributor license agreements.  See the NOTICE file distributed with
+// this work for additional information regarding copyright ownership.
+// The ASF licenses this file to You under the Apache License, Version 2.0
+// (the "License"); you may not use this file except in compliance with
+// the License.  You may obtain a copy of the License at
+//
+//http://www.apache.org/licenses/LICENSE-2.0
+//
+// Unless required by applicable law or agreed to in writing, software
+// distributed under the License is distributed on an "AS IS" BASIS,
+// WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+// See the License for the specific language governing permissions and
+// limitations under the License.
+
+package passert
+
+import (
+   "fmt"
+
+   "github.com/apache/beam/sdks/go/pkg/beam"
+   "github.com/apache/beam/sdks/go/pkg/beam/core/typex"
+)
+
+func Count(s beam.Scope, col beam.PCollection, name string, count int) {
+   s = s.Scope(fmt.Sprintf("passert.Count(%v)", name))
+
+   if typex.IsKV(col.Type()) {
+   col = beam.DropKey(s, col)
+   }
+   counted := beam.Combine(s, &elmCountCombineFn{}, col)
+   Equals(s, counted, count)

Review comment:
   I thought so, that's what I was referring to. The error message in 
passert.Sum (while better than passert.Equals) is still a bit unclear, just 
because it specifically mentions passert.Sum instead of passert.Count.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444073)
Time Spent: 4h 40m  (was: 4.5h)

> Create Go SDK synthetic sources.
> 
>
> Key: BEAM-9951
> URL: https://issues.apache.org/jira/browse/BEAM-9951
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 4h 40m
>  Remaining Estimate: 0h
>
> Create synthetic sources for the Go SDK like 
> [Java|https://github.com/apache/beam/tree/master/sdks/java/io/synthetic/src/main/java/org/apache/beam/sdk/io/synthetic]
>  and 
> [Python|https://github.com/apache/beam/blob/master/sdks/python/apache_beam/testing/synthetic_pipeline.py]
>  have.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10235) Create stats transform that counts number of elements in a PCollection.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10235?focusedWorklogId=444072&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444072
 ]

ASF GitHub Bot logged work on BEAM-10235:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 01:11
Start Date: 11/Jun/20 01:11
Worklog Time Spent: 10m 
  Work Description: youngoli commented on pull request #11986:
URL: https://github.com/apache/beam/pull/11986#issuecomment-642346309


   R: @lostluck 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444072)
Time Spent: 20m  (was: 10m)

> Create stats transform that counts number of elements in a PCollection.
> ---
>
> Key: BEAM-10235
> URL: https://issues.apache.org/jira/browse/BEAM-10235
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> In the process of implementing https://github.com/apache/beam/pull/11870 it 
> seemed useful to turn some of the functionality I was working on into a 
> general transform.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10235) Create stats transform that counts number of elements in a PCollection.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10235?focusedWorklogId=444071&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444071
 ]

ASF GitHub Bot logged work on BEAM-10235:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 01:10
Start Date: 11/Jun/20 01:10
Worklog Time Spent: 10m 
  Work Description: youngoli opened a new pull request #11986:
URL: https://github.com/apache/beam/pull/11986


   This transform counts the number of elements in a PTransform.
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [x] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [x] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [x] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 

[jira] [Created] (BEAM-10235) Create stats transform that counts number of elements in a PCollection.

2020-06-10 Thread Daniel Oliveira (Jira)
Daniel Oliveira created BEAM-10235:
--

 Summary: Create stats transform that counts number of elements in 
a PCollection.
 Key: BEAM-10235
 URL: https://issues.apache.org/jira/browse/BEAM-10235
 Project: Beam
  Issue Type: New Feature
  Components: sdk-go
Reporter: Daniel Oliveira
Assignee: Daniel Oliveira


In the process of implementing https://github.com/apache/beam/pull/11870 it 
seemed useful to turn some of the functionality I was working on into a general 
transform.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10235) Create stats transform that counts number of elements in a PCollection.

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10235?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-10235:
---
Status: Open  (was: Triage Needed)

> Create stats transform that counts number of elements in a PCollection.
> ---
>
> Key: BEAM-10235
> URL: https://issues.apache.org/jira/browse/BEAM-10235
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-go
>Reporter: Daniel Oliveira
>Assignee: Daniel Oliveira
>Priority: P2
>
> In the process of implementing https://github.com/apache/beam/pull/11870 it 
> seemed useful to turn some of the functionality I was working on into a 
> general transform.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9852) C-ares status is not ARES_SUCCESS: Misformatted domain name

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9852?focusedWorklogId=444070&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444070
 ]

ASF GitHub Bot logged work on BEAM-9852:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:47
Start Date: 11/Jun/20 00:47
Worklog Time Spent: 10m 
  Work Description: ibzib commented on pull request #11972:
URL: https://github.com/apache/beam/pull/11972#issuecomment-642340383


   Run PythonDocker PreCommit



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444070)
Time Spent: 2h 20m  (was: 2h 10m)

> C-ares status is not ARES_SUCCESS: Misformatted domain name
> ---
>
> Key: BEAM-9852
> URL: https://issues.apache.org/jira/browse/BEAM-9852
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-flink, runner-spark
>Reporter: Kyle Weaver
>Priority: P2
>  Labels: portability-flink, portability-spark
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> This affects all portable runners (Flink, Spark, Dataflow Python streaming). 
> It does not appear to cause pipelines to fail.
> Exception in thread read_grpc_client_inputs:
> Traceback (most recent call last):
> File "/usr/local/lib/python3.7/threading.py", line 926, in _bootstrap_inner
> self.run()
> File "/usr/local/lib/python3.7/threading.py", line 870, in run
> self._target(*self._args, **self._kwargs)
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 545, in 
> target=lambda: self._read_inputs(elements_iterator),
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 528, in _read_inputs
> for elements in elements_iterator:
> File "/usr/local/lib/python3.7/site-packages/grpc/channel.py", line 388, in 
> __next_
> return self._next()
> File "/usr/local/lib/python3.7/site-packages/grpc/_channel.py", line 365, in 
> _next
> raise self
> grpc._channel._Rendezvous: <_Rendezvous of RPC that terminated with:
> status = StatusCode.UNAVAILABLE
> details = "DNS resolution failed"
> debug_error_string = 
> "{"created":"@1587426512.443144965","description":"Failed to pick 
> subchannel","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":3876,"referenced_errors":[{"created":"@1587426512.443142363","description":"Resolver
>  transient 
> failure","file":"src/core/ext/filters/client_channel/resolving_lb_policy.cc","file_line":263,"referenced_errors":[{"created":"@1587426512.443141313","description":"DNS
>  resolution 
> failed","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/dns_resolver_ares.cc","file_line":357,"grpc_status":14,"referenced_errors":[{"created":"@1587426512.443136986","description":"C-ares
>  status is not ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244,"referenced_errors":[
> {"created":"@1587426512.443126564","description":"C-ares status is not 
> ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244}
> ]}]}]}]}"
> >



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9852) C-ares status is not ARES_SUCCESS: Misformatted domain name

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9852?focusedWorklogId=444069&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444069
 ]

ASF GitHub Bot logged work on BEAM-9852:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:47
Start Date: 11/Jun/20 00:47
Worklog Time Spent: 10m 
  Work Description: ibzib edited a comment on pull request #11972:
URL: https://github.com/apache/beam/pull/11972#issuecomment-642336725


   PythonDocker is failing with a license issue:
   
   ```
 File "/tmp/license_scripts/pull_licenses_py.py", line 149, in 
   py_ver=py_ver, license_list=sorted(','.join(no_licenses)), 
how_to=how_to))
   RuntimeError: Could not retrieve licences for packages [['3', 'a', 'h', 'p', 
's', 'y']] in Python3.5 environment. 
   ```
   
   Package names are scrambled due to BEAM-10234, but I'm assuming it's pysha3.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444069)
Time Spent: 2h 10m  (was: 2h)

> C-ares status is not ARES_SUCCESS: Misformatted domain name
> ---
>
> Key: BEAM-9852
> URL: https://issues.apache.org/jira/browse/BEAM-9852
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-flink, runner-spark
>Reporter: Kyle Weaver
>Priority: P2
>  Labels: portability-flink, portability-spark
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> This affects all portable runners (Flink, Spark, Dataflow Python streaming). 
> It does not appear to cause pipelines to fail.
> Exception in thread read_grpc_client_inputs:
> Traceback (most recent call last):
> File "/usr/local/lib/python3.7/threading.py", line 926, in _bootstrap_inner
> self.run()
> File "/usr/local/lib/python3.7/threading.py", line 870, in run
> self._target(*self._args, **self._kwargs)
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 545, in 
> target=lambda: self._read_inputs(elements_iterator),
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 528, in _read_inputs
> for elements in elements_iterator:
> File "/usr/local/lib/python3.7/site-packages/grpc/channel.py", line 388, in 
> __next_
> return self._next()
> File "/usr/local/lib/python3.7/site-packages/grpc/_channel.py", line 365, in 
> _next
> raise self
> grpc._channel._Rendezvous: <_Rendezvous of RPC that terminated with:
> status = StatusCode.UNAVAILABLE
> details = "DNS resolution failed"
> debug_error_string = 
> "{"created":"@1587426512.443144965","description":"Failed to pick 
> subchannel","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":3876,"referenced_errors":[{"created":"@1587426512.443142363","description":"Resolver
>  transient 
> failure","file":"src/core/ext/filters/client_channel/resolving_lb_policy.cc","file_line":263,"referenced_errors":[{"created":"@1587426512.443141313","description":"DNS
>  resolution 
> failed","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/dns_resolver_ares.cc","file_line":357,"grpc_status":14,"referenced_errors":[{"created":"@1587426512.443136986","description":"C-ares
>  status is not ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244,"referenced_errors":[
> {"created":"@1587426512.443126564","description":"C-ares status is not 
> ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244}
> ]}]}]}]}"
> >



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10169) ParDo* functions should declare the correct output N in their error message

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10169?focusedWorklogId=444068&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444068
 ]

ASF GitHub Bot logged work on BEAM-10169:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 00:45
Start Date: 11/Jun/20 00:45
Worklog Time Spent: 10m 
  Work Description: codeBehindMe commented on pull request #11976:
URL: https://github.com/apache/beam/pull/11976#issuecomment-642339945


   R: @damondouglas 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444068)
Time Spent: 1.5h  (was: 1h 20m)

> ParDo* functions should declare the correct output N in their error message
> ---
>
> Key: BEAM-10169
> URL: https://issues.apache.org/jira/browse/BEAM-10169
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-go
>Reporter: Robert Burke
>Assignee: Aaron Tillekeratne
>Priority: P3
>  Labels: noob, starter
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> User report noted the confusion in the error if you use a DoFn with 0 outputs 
> with beam.ParDo instead of beam.ParDo0. 
> In that case, a panic stack trace is followed by the cryptic: "expected 1 
> output. Found: []"
> We can do better.
> While we can't change the return signature dynamically (that's for ParDoN 
> only), we can instead clearly indicate: 
> *  the DoFn in question.
> * the number of outputs the DoFn has
> * and recommend using ParDo0, ParDo, ParDo2,...ParDo7,  or ParDoN, as 
> appropriate.
> https://github.com/apache/beam/blob/master/sdks/go/pkg/beam/pardo.go#L361 
> would need to change as well as any of the specific cases that follow. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9852) C-ares status is not ARES_SUCCESS: Misformatted domain name

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9852?focusedWorklogId=444066&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444066
 ]

ASF GitHub Bot logged work on BEAM-9852:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:33
Start Date: 11/Jun/20 00:33
Worklog Time Spent: 10m 
  Work Description: ibzib commented on pull request #11972:
URL: https://github.com/apache/beam/pull/11972#issuecomment-642336725


   I'm getting a weird license issue too:
   
   ```
 File "/tmp/license_scripts/pull_licenses_py.py", line 149, in 
   py_ver=py_ver, license_list=sorted(','.join(no_licenses)), 
how_to=how_to))
   RuntimeError: Could not retrieve licences for packages [['3', 'a', 'h', 'p', 
's', 'y']] in Python3.5 environment. 
   ```
   
   Package names are scrambled due to BEAM-10234, but I'm assuming it's pysha3.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444066)
Time Spent: 2h  (was: 1h 50m)

> C-ares status is not ARES_SUCCESS: Misformatted domain name
> ---
>
> Key: BEAM-9852
> URL: https://issues.apache.org/jira/browse/BEAM-9852
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-flink, runner-spark
>Reporter: Kyle Weaver
>Priority: P2
>  Labels: portability-flink, portability-spark
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
> This affects all portable runners (Flink, Spark, Dataflow Python streaming). 
> It does not appear to cause pipelines to fail.
> Exception in thread read_grpc_client_inputs:
> Traceback (most recent call last):
> File "/usr/local/lib/python3.7/threading.py", line 926, in _bootstrap_inner
> self.run()
> File "/usr/local/lib/python3.7/threading.py", line 870, in run
> self._target(*self._args, **self._kwargs)
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 545, in 
> target=lambda: self._read_inputs(elements_iterator),
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 528, in _read_inputs
> for elements in elements_iterator:
> File "/usr/local/lib/python3.7/site-packages/grpc/channel.py", line 388, in 
> __next_
> return self._next()
> File "/usr/local/lib/python3.7/site-packages/grpc/_channel.py", line 365, in 
> _next
> raise self
> grpc._channel._Rendezvous: <_Rendezvous of RPC that terminated with:
> status = StatusCode.UNAVAILABLE
> details = "DNS resolution failed"
> debug_error_string = 
> "{"created":"@1587426512.443144965","description":"Failed to pick 
> subchannel","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":3876,"referenced_errors":[{"created":"@1587426512.443142363","description":"Resolver
>  transient 
> failure","file":"src/core/ext/filters/client_channel/resolving_lb_policy.cc","file_line":263,"referenced_errors":[{"created":"@1587426512.443141313","description":"DNS
>  resolution 
> failed","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/dns_resolver_ares.cc","file_line":357,"grpc_status":14,"referenced_errors":[{"created":"@1587426512.443136986","description":"C-ares
>  status is not ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244,"referenced_errors":[
> {"created":"@1587426512.443126564","description":"C-ares status is not 
> ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244}
> ]}]}]}]}"
> >



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9217) Update DoFn javadoc for schema type translation

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-9217:

Labels:   (was: beam-fixit)

> Update DoFn javadoc for schema type translation
> ---
>
> Key: BEAM-9217
> URL: https://issues.apache.org/jira/browse/BEAM-9217
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: P2
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> see https://github.com/apache/beam/pull/10702#discussion_r372516471
> The javadoc for {{@Element}} currently states "the argument type must match 
> the input type of this DoFn", which is incorrect. We will translate between 
> mismatched types if they both have equivalent schemas.
> cc: [~reuvenlax]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9045) Implement an Ignite runner using Apache Ignite compute grid

2020-06-10 Thread Saikat Maitra (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9045?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132801#comment-17132801
 ] 

Saikat Maitra commented on BEAM-9045:
-

Thank you [~kenn] I will share PR with initial implementation to get more 
feedback.

Regards,

Saikat 

> Implement an Ignite runner using Apache Ignite compute grid
> ---
>
> Key: BEAM-9045
> URL: https://issues.apache.org/jira/browse/BEAM-9045
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-ideas
>Reporter: Saikat Maitra
>Assignee: Saikat Maitra
>Priority: P2
>
> Implement an Ignite runner using Apache Ignite compute grid.
> Runner guide [https://beam.apache.org/contribute/runner-guide/]
> Capability Matrix 
> [https://beam.apache.org/documentation/runners/capability-matrix/]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9217) Update DoFn javadoc for schema type translation

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9217?focusedWorklogId=444065&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444065
 ]

ASF GitHub Bot logged work on BEAM-9217:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:29
Start Date: 11/Jun/20 00:29
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on pull request #11984:
URL: https://github.com/apache/beam/pull/11984#issuecomment-642335762


   R: @reuvenlax 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444065)
Time Spent: 20m  (was: 10m)

> Update DoFn javadoc for schema type translation
> ---
>
> Key: BEAM-9217
> URL: https://issues.apache.org/jira/browse/BEAM-9217
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: P2
>  Labels: beam-fixit
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> see https://github.com/apache/beam/pull/10702#discussion_r372516471
> The javadoc for {{@Element}} currently states "the argument type must match 
> the input type of this DoFn", which is incorrect. We will translate between 
> mismatched types if they both have equivalent schemas.
> cc: [~reuvenlax]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9217) Update DoFn javadoc for schema type translation

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9217?focusedWorklogId=444064&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444064
 ]

ASF GitHub Bot logged work on BEAM-9217:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:29
Start Date: 11/Jun/20 00:29
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit opened a new pull request #11984:
URL: https://github.com/apache/beam/pull/11984


   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow_V2/

[jira] [Updated] (BEAM-9045) Implement an Ignite runner using Apache Ignite compute grid

2020-06-10 Thread Saikat Maitra (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9045?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Saikat Maitra updated BEAM-9045:

Labels:   (was: stale-assigned)

> Implement an Ignite runner using Apache Ignite compute grid
> ---
>
> Key: BEAM-9045
> URL: https://issues.apache.org/jira/browse/BEAM-9045
> Project: Beam
>  Issue Type: New Feature
>  Components: runner-ideas
>Reporter: Saikat Maitra
>Assignee: Saikat Maitra
>Priority: P2
>
> Implement an Ignite runner using Apache Ignite compute grid.
> Runner guide [https://beam.apache.org/contribute/runner-guide/]
> Capability Matrix 
> [https://beam.apache.org/documentation/runners/capability-matrix/]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10234) Packages missing licenses are not listed correctly

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10234?focusedWorklogId=444063&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444063
 ]

ASF GitHub Bot logged work on BEAM-10234:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 00:24
Start Date: 11/Jun/20 00:24
Worklog Time Spent: 10m 
  Work Description: ibzib opened a new pull request #11983:
URL: https://github.com/apache/beam/pull/11983


   I also yapf'd it while I was at it.
   
   R: @udim 
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](htt

[jira] [Updated] (BEAM-9217) Update DoFn javadoc for schema type translation

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-9217:

Labels: beam-fixit  (was: )

> Update DoFn javadoc for schema type translation
> ---
>
> Key: BEAM-9217
> URL: https://issues.apache.org/jira/browse/BEAM-9217
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: P2
>  Labels: beam-fixit
>
> see https://github.com/apache/beam/pull/10702#discussion_r372516471
> The javadoc for {{@Element}} currently states "the argument type must match 
> the input type of this DoFn", which is incorrect. We will translate between 
> mismatched types if they both have equivalent schemas.
> cc: [~reuvenlax]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9217) Update DoFn javadoc for schema type translation

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9217?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette reassigned BEAM-9217:
---

Assignee: Brian Hulette

> Update DoFn javadoc for schema type translation
> ---
>
> Key: BEAM-9217
> URL: https://issues.apache.org/jira/browse/BEAM-9217
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-java-core
>Reporter: Brian Hulette
>Assignee: Brian Hulette
>Priority: P2
>
> see https://github.com/apache/beam/pull/10702#discussion_r372516471
> The javadoc for {{@Element}} currently states "the argument type must match 
> the input type of this DoFn", which is incorrect. We will translate between 
> mismatched types if they both have equivalent schemas.
> cc: [~reuvenlax]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?focusedWorklogId=444057&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444057
 ]

ASF GitHub Bot logged work on BEAM-6892:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:16
Start Date: 11/Jun/20 00:16
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11982:
URL: https://github.com/apache/beam/pull/11982#issuecomment-642331818







This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444057)
Time Spent: 9.5h  (was: 9h 20m)

> Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS 
> if not specified by user.
> ---
>
> Key: BEAM-6892
> URL: https://issues.apache.org/jira/browse/BEAM-6892
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Valentyn Tymofieiev
>Priority: P2
>  Time Spent: 9.5h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?focusedWorklogId=444059&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444059
 ]

ASF GitHub Bot logged work on BEAM-6892:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:16
Start Date: 11/Jun/20 00:16
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11982:
URL: https://github.com/apache/beam/pull/11982#issuecomment-642331957


   Run Python2_PVR_Flink PreCommit



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444059)
Time Spent: 9h 50m  (was: 9h 40m)

> Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS 
> if not specified by user.
> ---
>
> Key: BEAM-6892
> URL: https://issues.apache.org/jira/browse/BEAM-6892
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Valentyn Tymofieiev
>Priority: P2
>  Time Spent: 9h 50m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?focusedWorklogId=444058&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444058
 ]

ASF GitHub Bot logged work on BEAM-6892:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:16
Start Date: 11/Jun/20 00:16
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11982:
URL: https://github.com/apache/beam/pull/11982#issuecomment-642331889


   Run Portable_Python PreCommit



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444058)
Time Spent: 9h 40m  (was: 9.5h)

> Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS 
> if not specified by user.
> ---
>
> Key: BEAM-6892
> URL: https://issues.apache.org/jira/browse/BEAM-6892
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Valentyn Tymofieiev
>Priority: P2
>  Time Spent: 9h 40m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-10201:
-
Status: Open  (was: Triage Needed)

> Enhance JsonToRow to add Deadletter Support
> ---
>
> Key: BEAM-10201
> URL: https://issues.apache.org/jira/browse/BEAM-10201
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Reza ardeshir rokni
>Priority: P2
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> Current JsonToRow transform does not support Dead Letter pattern on parse 
> failures.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette reassigned BEAM-10201:


Assignee: Reza ardeshir rokni

> Enhance JsonToRow to add Deadletter Support
> ---
>
> Key: BEAM-10201
> URL: https://issues.apache.org/jira/browse/BEAM-10201
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Reza ardeshir rokni
>Assignee: Reza ardeshir rokni
>Priority: P2
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> Current JsonToRow transform does not support Dead Letter pattern on parse 
> failures.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?focusedWorklogId=444056&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444056
 ]

ASF GitHub Bot logged work on BEAM-10201:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 00:12
Start Date: 11/Jun/20 00:12
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on pull request #11929:
URL: https://github.com/apache/beam/pull/11929#issuecomment-642330786


   FYI `PubsubJsonTableProvider` has support for writing to a dead letter 
pubsub topic: 
https://github.com/apache/beam/blob/d5dd47b47cbdf0739ac1a28cb8fbd06becbdbae7/sdks/java/extensions/sql/src/main/java/org/apache/beam/sdk/extensions/sql/meta/provider/pubsub/PubsubIOJsonTable.java#L160.
 Maybe there's some duplicate logic we can remove after this PR.



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444056)
Time Spent: 2.5h  (was: 2h 20m)

> Enhance JsonToRow to add Deadletter Support
> ---
>
> Key: BEAM-10201
> URL: https://issues.apache.org/jira/browse/BEAM-10201
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Reza ardeshir rokni
>Priority: P2
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> Current JsonToRow transform does not support Dead Letter pattern on parse 
> failures.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10234) Packages missing licenses are not listed correctly

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10234?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles updated BEAM-10234:
---
Status: Open  (was: Triage Needed)

> Packages missing licenses are not listed correctly
> --
>
> Key: BEAM-10234
> URL: https://issues.apache.org/jira/browse/BEAM-10234
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: Kyle Weaver
>Assignee: Kyle Weaver
>Priority: P2
>
> e.g.
> RuntimeError: Could not retrieve licences for packages [[',', ',', ',', ',', 
> ',', ',', ',', '-', '-', '-', '4', 'a', 'a', 'a', 'a', 'a', 'a', 'b', 'b', 
> 'c', 'c', 'd', 'e', 'e', 'e', 'e', 'e', 'e', 'f', 'f', 'f', 'g', 'g', 'i', 
> 'i', 'i', 'i', 'i', 'i', 'i', 'k', 'l', 'l', 'm', 'm', 'm', 'm', 'n', 'n', 
> 'o', 'o', 'o', 'o', 'o', 'o', 'o', 'o', 'p', 'p', 'p', 'p', 'p', 'p', 'p', 
> 'p', 'p', 'r', 'r', 'r', 'r', 'r', 's', 's', 's', 's', 't', 't', 't', 't', 
> 't', 't', 't', 'u', 'u', 'u', 'u', 'u', 'w', 'x', 'y', 'y', 'y', 'z']] in 
> Python3.7 environment. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10234) Packages missing licenses are not listed correctly

2020-06-10 Thread Kyle Weaver (Jira)
Kyle Weaver created BEAM-10234:
--

 Summary: Packages missing licenses are not listed correctly
 Key: BEAM-10234
 URL: https://issues.apache.org/jira/browse/BEAM-10234
 Project: Beam
  Issue Type: Bug
  Components: build-system
Reporter: Kyle Weaver
Assignee: Kyle Weaver


e.g.
RuntimeError: Could not retrieve licences for packages [[',', ',', ',', ',', 
',', ',', ',', '-', '-', '-', '4', 'a', 'a', 'a', 'a', 'a', 'a', 'b', 'b', 'c', 
'c', 'd', 'e', 'e', 'e', 'e', 'e', 'e', 'f', 'f', 'f', 'g', 'g', 'i', 'i', 'i', 
'i', 'i', 'i', 'i', 'k', 'l', 'l', 'm', 'm', 'm', 'm', 'n', 'n', 'o', 'o', 'o', 
'o', 'o', 'o', 'o', 'o', 'p', 'p', 'p', 'p', 'p', 'p', 'p', 'p', 'p', 'r', 'r', 
'r', 'r', 'r', 's', 's', 's', 's', 't', 't', 't', 't', 't', 't', 't', 'u', 'u', 
'u', 'u', 'u', 'w', 'x', 'y', 'y', 'y', 'z']] in Python3.7 environment. 




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-10233) Return error message/information with existing FAILED_ROW data from BigQueryWriteFn (Python SDK)

2020-06-10 Thread Tom Hardman (Jira)
Tom Hardman created BEAM-10233:
--

 Summary: Return error message/information with existing FAILED_ROW 
data from BigQueryWriteFn (Python SDK)
 Key: BEAM-10233
 URL: https://issues.apache.org/jira/browse/BEAM-10233
 Project: Beam
  Issue Type: Improvement
  Components: io-py-gcp
Reporter: Tom Hardman


A user may call `apache_beam.io.gcp.bigquery.WriteToBigQuery` to write their 
streamed data to BQ. I any rows fail to write, this will return a tagged 
pcollection `BigQueryWriteFn.FAILED_ROWS`. This data includes a tuple 
`(destination_table, failed_row_payload)`.

My suggestion is to include the error information in the `FAILED_ROWS` 
pcollection. From the source code we can see that we have access to the error 
information, e.g. that the row failed because field `id` was `invalid` because 
`this field is not a record`. I think we should surface this to the user.

I'm happy to open a PR this myself (as I've already had to overwrite the 
original code in several projects), but it looks like we'd need a breaking 
change by either extending the tuple which would cause unpacking issues in 
existing code, or by returning a different data structure entirely.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?focusedWorklogId=444054&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444054
 ]

ASF GitHub Bot logged work on BEAM-6892:


Author: ASF GitHub Bot
Created on: 11/Jun/20 00:02
Start Date: 11/Jun/20 00:02
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11982:
URL: https://github.com/apache/beam/pull/11982#issuecomment-642327786


   I've tested this on my machine



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444054)
Time Spent: 9h 20m  (was: 9h 10m)

> Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS 
> if not specified by user.
> ---
>
> Key: BEAM-6892
> URL: https://issues.apache.org/jira/browse/BEAM-6892
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Valentyn Tymofieiev
>Priority: P2
>  Time Spent: 9h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?focusedWorklogId=444052&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444052
 ]

ASF GitHub Bot logged work on BEAM-10220:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 00:00
Start Date: 11/Jun/20 00:00
Worklog Time Spent: 10m 
  Work Description: rezarokni commented on pull request #11955:
URL: https://github.com/apache/beam/pull/11955#issuecomment-642327425


   Thanx for FYI @pabloem .
   
   This is awesome! we are also looking to add deadletter pattern to JsonToRow:
   
   https://github.com/apache/beam/pull/11929
   
   The null support your adding is great! 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444052)
Time Spent: 2h 50m  (was: 2h 40m)

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Assignee: Reuben van Ammers
>Priority: P3
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10201) Enhance JsonToRow to add Deadletter Support

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10201?focusedWorklogId=444053&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444053
 ]

ASF GitHub Bot logged work on BEAM-10201:
-

Author: ASF GitHub Bot
Created on: 11/Jun/20 00:01
Start Date: 11/Jun/20 00:01
Worklog Time Spent: 10m 
  Work Description: rezarokni commented on a change in pull request #11929:
URL: https://github.com/apache/beam/pull/11929#discussion_r438470939



##
File path: 
sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/JsonToRow.java
##
@@ -72,10 +79,44 @@
 @Experimental(Kind.SCHEMAS)
 public class JsonToRow {
 
+  private static final String LINE_FIELD_NAME = "line";
+  private static final String ERROR_FIELD_NAME = "err";
+
+  public static final Schema ERROR_ROW_SCHEMA =
+  Schema.of(
+  Field.of(LINE_FIELD_NAME, FieldType.STRING),
+  Field.of(ERROR_FIELD_NAME, FieldType.STRING));
+
+  public static final TupleTag MAIN_TUPLE_TAG = new TupleTag() {};
+  public static final TupleTag DEAD_LETTER_TUPLE_TAG = new 
TupleTag() {};
+

Review comment:
   Done.





This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444053)
Time Spent: 2h 20m  (was: 2h 10m)

> Enhance JsonToRow to add Deadletter Support
> ---
>
> Key: BEAM-10201
> URL: https://issues.apache.org/jira/browse/BEAM-10201
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-java-core
>Reporter: Reza ardeshir rokni
>Priority: P2
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> Current JsonToRow transform does not support Dead Letter pattern on parse 
> failures.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?focusedWorklogId=444050&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444050
 ]

ASF GitHub Bot logged work on BEAM-6892:


Author: ASF GitHub Bot
Created on: 10/Jun/20 23:59
Start Date: 10/Jun/20 23:59
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11982:
URL: https://github.com/apache/beam/pull/11982#issuecomment-642327079


   r: @udim 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444050)
Time Spent: 9h 10m  (was: 9h)

> Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS 
> if not specified by user.
> ---
>
> Key: BEAM-6892
> URL: https://issues.apache.org/jira/browse/BEAM-6892
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-py-core
>Reporter: Valentyn Tymofieiev
>Priority: P2
>  Time Spent: 9h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6892) Use temp_location for BQ FILE_LOADS on DirectRunner, and autocreate it in GCS if not specified by user.

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6892?focusedWorklogId=444049&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444049
 ]

ASF GitHub Bot logged work on BEAM-6892:


Author: ASF GitHub Bot
Created on: 10/Jun/20 23:59
Start Date: 10/Jun/20 23:59
Worklog Time Spent: 10m 
  Work Description: pabloem opened a new pull request #11982:
URL: https://github.com/apache/beam/pull/11982


   Following the logic from Java 
(https://github.com/apache/beam/blob/b56740f0e8cd80c2873412847d0b336837429fb9/sdks/java/extensions/google-cloud-platform-core/src/main/java/org/apache/beam/sdk/extensions/gcp/options/GcpOptions.java#L332).
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming

[jira] [Assigned] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette reassigned BEAM-10220:


Assignee: Reuben van Ammers

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Assignee: Reuben van Ammers
>Priority: P3
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?focusedWorklogId=444048&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444048
 ]

ASF GitHub Bot logged work on BEAM-10220:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 23:53
Start Date: 10/Jun/20 23:53
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11955:
URL: https://github.com/apache/beam/pull/11955#issuecomment-642325514


   and assigned BEAM-7624



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444048)
Time Spent: 2h 40m  (was: 2.5h)

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Priority: P3
>  Time Spent: 2h 40m
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?focusedWorklogId=444047&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444047
 ]

ASF GitHub Bot logged work on BEAM-10220:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 23:53
Start Date: 10/Jun/20 23:53
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11955:
URL: https://github.com/apache/beam/pull/11955#issuecomment-642325342


   Added @reubenvanammers as contributor!



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444047)
Time Spent: 2.5h  (was: 2h 20m)

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Priority: P3
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-7624) Missing field support in JsonToRow

2020-06-10 Thread Pablo Estrada (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7624?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Pablo Estrada reassigned BEAM-7624:
---

Assignee: Reuben van Ammers

> Missing field support in JsonToRow
> --
>
> Key: BEAM-7624
> URL: https://issues.apache.org/jira/browse/BEAM-7624
> Project: Beam
>  Issue Type: Improvement
>  Components: extensions-java-json
>Reporter: Anton Kedin
>Assignee: Reuben van Ammers
>Priority: P3
>
> [JsonToRow|https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/transforms/JsonToRow.java]
>  supports only strict schema matching, so it expects all the fields that are 
> declared by the schema to be present in the input JSON data. If the field is 
> missing, [it 
> crashes|https://github.com/apache/beam/blob/master/sdks/java/core/src/main/java/org/apache/beam/sdk/util/RowJsonDeserializer.java#L114-L117].
>  In practice it is useful to default to null is such cases. We should make it 
> configurable in Beam Java SDK at least, and probably in Beam SQL (in Pubsub 
> Json TableProvider) as well.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?focusedWorklogId=444046&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444046
 ]

ASF GitHub Bot logged work on BEAM-10220:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 23:52
Start Date: 10/Jun/20 23:52
Worklog Time Spent: 10m 
  Work Description: pabloem commented on pull request #11955:
URL: https://github.com/apache/beam/pull/11955#issuecomment-642325145


   fyi @rezarokni @reuvenlax 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444046)
Time Spent: 2h 20m  (was: 2h 10m)

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Priority: P3
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-10220:
-
Status: Open  (was: Triage Needed)

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Priority: P3
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-10220) Add support for implicit nulls in de/serializing JSON

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-10220?focusedWorklogId=444045&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444045
 ]

ASF GitHub Bot logged work on BEAM-10220:
-

Author: ASF GitHub Bot
Created on: 10/Jun/20 23:46
Start Date: 10/Jun/20 23:46
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on pull request #11955:
URL: https://github.com/apache/beam/pull/11955#issuecomment-642323660


   I went to assign BEAM-7624 to you, but it looks like you need to be added as 
a Beam contributor first. @pabloem could you add Reuben on jira?



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444045)
Time Spent: 2h 10m  (was: 2h)

> Add support for implicit nulls in de/serializing JSON
> -
>
> Key: BEAM-10220
> URL: https://issues.apache.org/jira/browse/BEAM-10220
> Project: Beam
>  Issue Type: New Feature
>  Components: extensions-java-json
>Reporter: Reuben van Ammers
>Priority: P3
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
> Currently,  RowJson.java doesn't have support for deserialising JSON with 
> fields missing - it fails, even if the beam schema is nullable. This is often 
> a problem for reading JSON as nulls are very commonly represented by omitting 
> the field. Additionally, for this reason, not writing nulls is a reasonable 
> when converting from Beam rows to JSON.
> This would help to allow https://issues.apache.org/jira/browse/BEAM-7624 to 
> be implemented. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8364) SchemaCoder inconsistent equality behavior for POJO

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8364?focusedWorklogId=444044&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444044
 ]

ASF GitHub Bot logged work on BEAM-8364:


Author: ASF GitHub Bot
Created on: 10/Jun/20 23:42
Start Date: 10/Jun/20 23:42
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit opened a new pull request #11981:
URL: https://github.com/apache/beam/pull/11981


   R: @reuvenlax 
   CC: @nevillelyh 
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.

[jira] [Updated] (BEAM-8364) SchemaCoder inconsistent equality behavior for POJO

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette updated BEAM-8364:

Labels: beam-fixit  (was: )

> SchemaCoder inconsistent equality behavior for POJO
> ---
>
> Key: BEAM-8364
> URL: https://issues.apache.org/jira/browse/BEAM-8364
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql, sdk-java-core
>Affects Versions: 2.16.0
>Reporter: Neville Li
>Assignee: Brian Hulette
>Priority: P3
>  Labels: beam-fixit
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> One can create a {{SchemaCoder}} for arbitrary type {{T}} with 
> {{SchemaCoder.of(schema, toRowFunction, fromRowFunction)}}. However, in cases 
> where {{T}} lacks proper equality behavior, i.e. POJO, the result coder still 
> returns true for {{consistentWithEquals}} and {{structuralValue}}s that fail 
> equality check.
> This test reproduces the issue.
> {code:java}
> import org.apache.beam.sdk.schemas.Schema;
> import org.apache.beam.sdk.schemas.SchemaCoder;
> import org.apache.beam.sdk.values.Row;
> import org.junit.Test;
> import org.junit.runner.RunWith;
> import org.junit.runners.JUnit4;
> import java.nio.charset.Charset;
> import static org.junit.Assert.*;
> @RunWith(JUnit4.class)
> public class SchemaCoderTest {
>   public static class Pojo {
> private final byte[] bytes;
> private final String id;
> public Pojo(byte[] bytes, String id) {
>   this.bytes = bytes;
>   this.id = id;
> }
> public byte[] getBytes() {
>   return bytes;
> }
> public String getId() {
>   return id;
> }
>   }
>   @Test
>   public void testCoder() {
> Schema schema = 
> Schema.builder().addByteArrayField("bytes").addStringField("id").build();
> SchemaCoder coder = SchemaCoder.of(
> schema,
> t -> Row.withSchema(schema).addValues(t.getBytes(), 
> t.getId()).build(),
> r -> new Pojo(r.getBytes("bytes"), r.getString("id")));
> Pojo p1 = new Pojo("hello".getBytes(Charset.forName("UTF-8")), "world");
> Pojo p2 = new Pojo("hello".getBytes(Charset.forName("UTF-8")), "world");
> assertNotEquals(p1, p2); // EXPECTED, p1.equals(p2) == false
> assertFalse(coder.consistentWithEquals()); // FAIL, returns true
> assertEquals(coder.structuralValue(p1), coder.structuralValue(p2)); // 
> FAIL
>   }
> }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9564) Remove insecure ssl options from MongoDBIO

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9564?focusedWorklogId=444042&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444042
 ]

ASF GitHub Bot logged work on BEAM-9564:


Author: ASF GitHub Bot
Created on: 10/Jun/20 23:35
Start Date: 10/Jun/20 23:35
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit opened a new pull request #11980:
URL: https://github.com/apache/beam/pull/11980


   Adds batching of schema'd PCollections into dataframes based on 
BatchElements transform.
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python2/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python35/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python36/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Python37/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Py_VR_Dataflow/lastCo

[jira] [Work logged] (BEAM-9564) Remove insecure ssl options from MongoDBIO

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9564?focusedWorklogId=444043&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444043
 ]

ASF GitHub Bot logged work on BEAM-9564:


Author: ASF GitHub Bot
Created on: 10/Jun/20 23:35
Start Date: 10/Jun/20 23:35
Worklog Time Spent: 10m 
  Work Description: TheNeuralBit commented on pull request #11980:
URL: https://github.com/apache/beam/pull/11980#issuecomment-642320833


   CC: @robertwb 



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444043)
Time Spent: 2h  (was: 1h 50m)

> Remove insecure ssl options from MongoDBIO
> --
>
> Key: BEAM-9564
> URL: https://issues.apache.org/jira/browse/BEAM-9564
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-mongodb
>Affects Versions: 2.21.0
>Reporter: Ismaël Mejía
>Assignee: Ismaël Mejía
>Priority: P1
>  Labels: backward-incompatible
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
> The option MongoDBIO.withIgnoreSSLCertificate  and 
> withSSLInvalidHostNameAllowedslInvalidHostNameAllowed() are insecure by 
> design. We should not encourage users to be able to use them so better to 
> remove these options.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9852) C-ares status is not ARES_SUCCESS: Misformatted domain name

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9852?focusedWorklogId=444041&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444041
 ]

ASF GitHub Bot logged work on BEAM-9852:


Author: ASF GitHub Bot
Created on: 10/Jun/20 23:33
Start Date: 10/Jun/20 23:33
Worklog Time Spent: 10m 
  Work Description: ibzib commented on pull request #11972:
URL: https://github.com/apache/beam/pull/11972#issuecomment-642320097


   retest this please



This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 444041)
Time Spent: 1h 50m  (was: 1h 40m)

> C-ares status is not ARES_SUCCESS: Misformatted domain name
> ---
>
> Key: BEAM-9852
> URL: https://issues.apache.org/jira/browse/BEAM-9852
> Project: Beam
>  Issue Type: Sub-task
>  Components: runner-flink, runner-spark
>Reporter: Kyle Weaver
>Priority: P2
>  Labels: portability-flink, portability-spark
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
> This affects all portable runners (Flink, Spark, Dataflow Python streaming). 
> It does not appear to cause pipelines to fail.
> Exception in thread read_grpc_client_inputs:
> Traceback (most recent call last):
> File "/usr/local/lib/python3.7/threading.py", line 926, in _bootstrap_inner
> self.run()
> File "/usr/local/lib/python3.7/threading.py", line 870, in run
> self._target(*self._args, **self._kwargs)
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 545, in 
> target=lambda: self._read_inputs(elements_iterator),
> File 
> "/usr/local/lib/python3.7/site-packages/apache_beam/runners/worker/data_plane.py",
>  line 528, in _read_inputs
> for elements in elements_iterator:
> File "/usr/local/lib/python3.7/site-packages/grpc/channel.py", line 388, in 
> __next_
> return self._next()
> File "/usr/local/lib/python3.7/site-packages/grpc/_channel.py", line 365, in 
> _next
> raise self
> grpc._channel._Rendezvous: <_Rendezvous of RPC that terminated with:
> status = StatusCode.UNAVAILABLE
> details = "DNS resolution failed"
> debug_error_string = 
> "{"created":"@1587426512.443144965","description":"Failed to pick 
> subchannel","file":"src/core/ext/filters/client_channel/client_channel.cc","file_line":3876,"referenced_errors":[{"created":"@1587426512.443142363","description":"Resolver
>  transient 
> failure","file":"src/core/ext/filters/client_channel/resolving_lb_policy.cc","file_line":263,"referenced_errors":[{"created":"@1587426512.443141313","description":"DNS
>  resolution 
> failed","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/dns_resolver_ares.cc","file_line":357,"grpc_status":14,"referenced_errors":[{"created":"@1587426512.443136986","description":"C-ares
>  status is not ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244,"referenced_errors":[
> {"created":"@1587426512.443126564","description":"C-ares status is not 
> ARES_SUCCESS: Misformatted domain 
> name","file":"src/core/ext/filters/client_channel/resolver/dns/c_ares/grpc_ares_wrapper.cc","file_line":244}
> ]}]}]}]}"
> >



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-8364) SchemaCoder inconsistent equality behavior for POJO

2020-06-10 Thread Brian Hulette (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8364?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Brian Hulette reassigned BEAM-8364:
---

Assignee: Brian Hulette

> SchemaCoder inconsistent equality behavior for POJO
> ---
>
> Key: BEAM-8364
> URL: https://issues.apache.org/jira/browse/BEAM-8364
> Project: Beam
>  Issue Type: Bug
>  Components: dsl-sql, sdk-java-core
>Affects Versions: 2.16.0
>Reporter: Neville Li
>Assignee: Brian Hulette
>Priority: P3
>
> One can create a {{SchemaCoder}} for arbitrary type {{T}} with 
> {{SchemaCoder.of(schema, toRowFunction, fromRowFunction)}}. However, in cases 
> where {{T}} lacks proper equality behavior, i.e. POJO, the result coder still 
> returns true for {{consistentWithEquals}} and {{structuralValue}}s that fail 
> equality check.
> This test reproduces the issue.
> {code:java}
> import org.apache.beam.sdk.schemas.Schema;
> import org.apache.beam.sdk.schemas.SchemaCoder;
> import org.apache.beam.sdk.values.Row;
> import org.junit.Test;
> import org.junit.runner.RunWith;
> import org.junit.runners.JUnit4;
> import java.nio.charset.Charset;
> import static org.junit.Assert.*;
> @RunWith(JUnit4.class)
> public class SchemaCoderTest {
>   public static class Pojo {
> private final byte[] bytes;
> private final String id;
> public Pojo(byte[] bytes, String id) {
>   this.bytes = bytes;
>   this.id = id;
> }
> public byte[] getBytes() {
>   return bytes;
> }
> public String getId() {
>   return id;
> }
>   }
>   @Test
>   public void testCoder() {
> Schema schema = 
> Schema.builder().addByteArrayField("bytes").addStringField("id").build();
> SchemaCoder coder = SchemaCoder.of(
> schema,
> t -> Row.withSchema(schema).addValues(t.getBytes(), 
> t.getId()).build(),
> r -> new Pojo(r.getBytes("bytes"), r.getString("id")));
> Pojo p1 = new Pojo("hello".getBytes(Charset.forName("UTF-8")), "world");
> Pojo p2 = new Pojo("hello".getBytes(Charset.forName("UTF-8")), "world");
> assertNotEquals(p1, p2); // EXPECTED, p1.equals(p2) == false
> assertFalse(coder.consistentWithEquals()); // FAIL, returns true
> assertEquals(coder.structuralValue(p1), coder.structuralValue(p2)); // 
> FAIL
>   }
> }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Resolved] (BEAM-6972) LTS backport: CassandraIO is broken because of use of bad relocation of guava

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6972?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles resolved BEAM-6972.
---
  Assignee: Kenneth Knowles
Resolution: Fixed

> LTS backport: CassandraIO is broken because of use of bad relocation of guava
> -
>
> Key: BEAM-6972
> URL: https://issues.apache.org/jira/browse/BEAM-6972
> Project: Beam
>  Issue Type: Bug
>  Components: io-java-cassandra
>Affects Versions: 2.5.0, 2.6.0, 2.7.0, 2.8.0, 2.9.0, 2.10.0, 2.11.0
>Reporter: Arun sethia
>Assignee: Kenneth Knowles
>Priority: P2
> Fix For: 2.7.1
>
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
> While using apache beam to run dataflow job to read data from BigQuery and 
> Store/Write to Cassandra with following libaries:
>  # beam-sdks-java-io-cassandra - 2.6.0
>  # beam-sdks-java-io-jdbc - 2.6.0
>  # beam-sdks-java-io-google-cloud-platform - 2.6.0
>  # beam-sdks-java-core - 2.6.0
>  # google-cloud-dataflow-java-sdk-all - 2.5.0
>  # google-api-client -1.25.0
>  
> I am getting following error at the time insert/save data to Cassandra.
> {code:java}
> [error] (run-main-0) org.apache.beam.sdk.Pipeline$PipelineExecutionException: 
> java.lang.NoSuchMethodError: 
> com.datastax.driver.mapping.Mapper.saveAsync(Ljava/lang/Object;)Lorg/apache/beam/repackaged/beam_sdks_java_io_cassandra/com/google/common/util/concurrent/ListenableFuture;
> org.apache.beam.sdk.Pipeline$PipelineExecutionException: 
> java.lang.NoSuchMethodError: 
> com.datastax.driver.mapping.Mapper.saveAsync(Ljava/lang/Object;)Lorg/apache/beam/repackaged/beam_sdks_java_io_cassandra/com/google/common/util/concurrent/ListenableFuture;
>  at 
> org.apache.beam.runners.direct.DirectRunner$DirectPipelineResult.waitUntilFinish(DirectRunner.java:332)
>  at 
> org.apache.beam.runners.direct.DirectRunner$DirectPipelineResult.waitUntilFinish(DirectRunner.java:302)
>  at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:197)
>  at org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:64)
>  at org.apache.beam.sdk.Pipeline.run(Pipeline.java:313)
>  at org.apache.beam.sdk.Pipeline.run(Pipeline.java:299){code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (BEAM-3197) Jenkins asfgit is commenting "SUCCESS\n\n--none--" and "FAILURE\n\n--none--" again

2020-06-10 Thread Kenneth Knowles (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-3197?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kenneth Knowles closed BEAM-3197.
-
Fix Version/s: Not applicable
   Resolution: Fixed

> Jenkins asfgit is commenting "SUCCESS\n\n--none--" and "FAILURE\n\n--none--" 
> again
> --
>
> Key: BEAM-3197
> URL: https://issues.apache.org/jira/browse/BEAM-3197
> Project: Beam
>  Issue Type: Bug
>  Components: build-system
>Reporter: Kenneth Knowles
>Priority: P2
>  Labels: stale-P2
> Fix For: Not applicable
>
>
> Example: https://github.com/apache/beam/pull/4087
> If I recall, this is because of a parent config getting merged in a 
> particular way with our own config.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-3194) Support annotating that a DoFn requires stable / deterministic input for replay/retry

2020-06-10 Thread Kenneth Knowles (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-3194?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132771#comment-17132771
 ] 

Kenneth Knowles commented on BEAM-3194:
---

[~robinyqiu] This Jira is "done" in the sense that the annotation exists, 
right? Perhaps the name is just not quite right. The annotation exists, but 
various runners have varying levels of support.

> Support annotating that a DoFn requires stable / deterministic input for 
> replay/retry
> -
>
> Key: BEAM-3194
> URL: https://issues.apache.org/jira/browse/BEAM-3194
> Project: Beam
>  Issue Type: New Feature
>  Components: beam-model, sdk-java-core
>Reporter: Kenneth Knowles
>Priority: P2
>  Labels: stale-P2
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> See the thread: 
> https://lists.apache.org/thread.html/5fd81ce371aeaf642665348f8e6940e308e04275dd7072f380f9f945@%3Cdev.beam.apache.org%3E
> We need this in order to have truly cross-runner end-to-end exactly once via 
> replay + idempotence.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-3168) Capability matrix contains runners that aren't in the left nav or on master

2020-06-10 Thread Kenneth Knowles (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-3168?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132770#comment-17132770
 ] 

Kenneth Knowles commented on BEAM-3168:
---

[~tysonjh] Some potential low-hanging fruit as you remove Gearpump and Apex: 
the runners that are still on branches really shouldn't be in the public 
capability matrix. Not sure if this is just a couple more lines to delete or if 
it is a rabbit hole.

> Capability matrix contains runners that aren't in the left nav or on master
> ---
>
> Key: BEAM-3168
> URL: https://issues.apache.org/jira/browse/BEAM-3168
> Project: Beam
>  Issue Type: Bug
>  Components: website
>Reporter: Kenneth Knowles
>Priority: P2
>  Labels: stale-P2
>
> Currently the runners on feature branches are in the capability matrix, but 
> not linked to their summary pages or anything. Now we also have IBM Streams 
> that could use a link so users can find what the column is about.
> I think we can link to either Ongoing Projects or external pages for these.
> We may want a left nav item "other runners" for those that we don't want to 
> highlight as part of what the Beam project builds and ships, so we can 
> actually put them in context a bit. This might tie in to BEAM-2888.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9679) Core Transforms | Go SDK Code Katas

2020-06-10 Thread Damon Douglas (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9679?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Damon Douglas updated BEAM-9679:

Description: 
A kata devoted to core beam transforms patterns after 
[https://github.com/apache/beam/tree/master/learning/katas/java/Core%20Transforms]
 where the take away is an individual's ability to master the following using 
an Apache Beam pipeline using the Golang SDK.

 
||Transform||Pull Request||Status||
|Map|[11564|https://github.com/apache/beam/pull/11564]|Closed|
|GroupByKey|[11734|https://github.com/apache/beam/pull/11734]|Closed|
|CoGroupByKey|[11803|https://github.com/apache/beam/pull/11803]|Closed|
|Combine Simple 
Function|[11866|https://github.com/apache/beam/pull/11866]|Closed|
|CombineFn|[11883|https://github.com/apache/beam/pull/11883]|Closed|
|CombinePerKey|[11936|https://github.com/apache/beam/pull/11936]|Closed|
|Flatten|[11806|https://github.com/apache/beam/pull/11806]|Closed|
|Partition|[11979|https://github.com/apache/beam/pull/11979]|Open|
|Side Input| | |
|Side Output| | |
|Branching| | |
|Composite Transform| | |
|DoFn Additional Parameters| | |

  was:
A kata devoted to core beam transforms patterns after 
[https://github.com/apache/beam/tree/master/learning/katas/java/Core%20Transforms]
 where the take away is an individual's ability to master the following using 
an Apache Beam pipeline using the Golang SDK.

 
||Transform||Pull Request||Status||
|Map|[11564|https://github.com/apache/beam/pull/11564]|Closed|
|GroupByKey|[11734|https://github.com/apache/beam/pull/11734]|Closed|
|CoGroupByKey|[11803|https://github.com/apache/beam/pull/11803]|Closed|
|Combine Simple 
Function|[11866|https://github.com/apache/beam/pull/11866]|Closed|
|CombineFn|[11883|https://github.com/apache/beam/pull/11883]|Closed|
|CombinePerKey|[11936|https://github.com/apache/beam/pull/11936]|Closed|
|Flatten|[11806|https://github.com/apache/beam/pull/11806]|Closed|
|Partition| | |
|Side Input| | |
|Side Output| | |
|Branching| | |
|Composite Transform| | |
|DoFn Additional Parameters| | |


> Core Transforms | Go SDK Code Katas
> ---
>
> Key: BEAM-9679
> URL: https://issues.apache.org/jira/browse/BEAM-9679
> Project: Beam
>  Issue Type: Sub-task
>  Components: katas, sdk-go
>Reporter: Damon Douglas
>Assignee: Damon Douglas
>Priority: P2
>  Time Spent: 10.5h
>  Remaining Estimate: 0h
>
> A kata devoted to core beam transforms patterns after 
> [https://github.com/apache/beam/tree/master/learning/katas/java/Core%20Transforms]
>  where the take away is an individual's ability to master the following using 
> an Apache Beam pipeline using the Golang SDK.
>  
> ||Transform||Pull Request||Status||
> |Map|[11564|https://github.com/apache/beam/pull/11564]|Closed|
> |GroupByKey|[11734|https://github.com/apache/beam/pull/11734]|Closed|
> |CoGroupByKey|[11803|https://github.com/apache/beam/pull/11803]|Closed|
> |Combine Simple 
> Function|[11866|https://github.com/apache/beam/pull/11866]|Closed|
> |CombineFn|[11883|https://github.com/apache/beam/pull/11883]|Closed|
> |CombinePerKey|[11936|https://github.com/apache/beam/pull/11936]|Closed|
> |Flatten|[11806|https://github.com/apache/beam/pull/11806]|Closed|
> |Partition|[11979|https://github.com/apache/beam/pull/11979]|Open|
> |Side Input| | |
> |Side Output| | |
> |Branching| | |
> |Composite Transform| | |
> |DoFn Additional Parameters| | |



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9679) Core Transforms | Go SDK Code Katas

2020-06-10 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9679?focusedWorklogId=444037&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-444037
 ]

ASF GitHub Bot logged work on BEAM-9679:


Author: ASF GitHub Bot
Created on: 10/Jun/20 22:40
Start Date: 10/Jun/20 22:40
Worklog Time Spent: 10m 
  Work Description: damondouglas opened a new pull request #11979:
URL: https://github.com/apache/beam/pull/11979


   This pull requests adds a Partition lesson to the Go SDK katas.  I would 
like to request the following reviewers:
   
   (R: @lostluck )
   (R: @henryken )
   
   If accepted by both reviewers, please wait until the [Stepik 
course](https://stepik.org/course/70387) is updated before finally merging this 
PR.
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [x] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [x] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] Update `CHANGES.md` with noteworthy changes.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Samza | Spark
   --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow_Java11/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink_Java11/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://

[jira] [Commented] (BEAM-6556) Consider infer:eradicate for null analysis

2020-06-10 Thread Kenneth Knowles (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-6556?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17132769#comment-17132769
 ] 

Kenneth Knowles commented on BEAM-6556:
---

I did consider it, but I think my top priority for nullness is checker 
framework.

> Consider infer:eradicate for null analysis
> --
>
> Key: BEAM-6556
> URL: https://issues.apache.org/jira/browse/BEAM-6556
> Project: Beam
>  Issue Type: Test
>  Components: build-system
>Reporter: Kenneth Knowles
>Priority: P2
> Fix For: Not applicable
>
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


  1   2   3   4   5   6   7   8   9   10   >