[jira] [Commented] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread Aaron Dixon (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018480#comment-17018480
 ] 

Aaron Dixon commented on BEAM-9144:
---

Perfect. Thanks [~suztomo], will report back

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread Tomo Suzuki (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018471#comment-17018471
 ] 

Tomo Suzuki commented on BEAM-9144:
---

Yes, new artifact will be at 
https://repository.apache.org/content/repositories/snapshots/org/apache/beam/beam-sdks-java-core/2.20.0-SNAPSHOT/
 .

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373993=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373993
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 18/Jan/20 02:04
Start Date: 18/Jan/20 02:04
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575855799
 
 
   Run Website_Stage_GCS PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373993)
Time Spent: 4h 20m  (was: 4h 10m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 4h 20m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373992=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373992
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 18/Jan/20 02:04
Start Date: 18/Jan/20 02:04
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575855782
 
 
   Run Website_Stage_GCS PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373992)
Time Spent: 4h 10m  (was: 4h)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 4h 10m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373994=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373994
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 18/Jan/20 02:04
Start Date: 18/Jan/20 02:04
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575855842
 
 
   Run Website PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373994)
Time Spent: 4.5h  (was: 4h 20m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Ahmet Altay (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ahmet Altay updated BEAM-9137:
--
Parent: BEAM-8193
Issue Type: Sub-task  (was: Bug)

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Sub-task
>  Components: testing
>Reporter: Boyuan Zhang
>Assignee: Valentyn Tymofieiev
>Priority: Major
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Ahmet Altay (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Ahmet Altay reassigned BEAM-9137:
-

Assignee: Valentyn Tymofieiev

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: Boyuan Zhang
>Assignee: Valentyn Tymofieiev
>Priority: Major
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373989=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373989
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 18/Jan/20 01:08
Start Date: 18/Jan/20 01:08
Worklog Time Spent: 10m 
  Work Description: angoenka commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368193392
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/worker_status.py
 ##
 @@ -0,0 +1,139 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+"""Worker status api handler for reporting SDK harness debug info."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+import queue
+import sys
+import threading
+import traceback
+from collections import defaultdict
+
+import grpc
+
+from apache_beam.portability.api import beam_fn_api_pb2
+from apache_beam.portability.api import beam_fn_api_pb2_grpc
+from apache_beam.runners.worker.channel_factory import GRPCChannelFactory
+from apache_beam.runners.worker.worker_id_interceptor import 
WorkerIdInterceptor
+
+
+def thread_dump():
 
 Review comment:
   I agree, we can get rid of status http server.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373989)
Time Spent: 2h  (was: 1h 50m)

> Implement status api handler in python sdk harness
> --
>
> Key: BEAM-8626
> URL: https://issues.apache.org/jira/browse/BEAM-8626
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: Yichi Zhang
>Assignee: Yichi Zhang
>Priority: Major
>  Time Spent: 2h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373990=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373990
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 18/Jan/20 01:08
Start Date: 18/Jan/20 01:08
Worklog Time Spent: 10m 
  Work Description: angoenka commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368193443
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/worker_status.py
 ##
 @@ -0,0 +1,139 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+"""Worker status api handler for reporting SDK harness debug info."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+import queue
+import sys
+import threading
+import traceback
+from collections import defaultdict
+
+import grpc
+
+from apache_beam.portability.api import beam_fn_api_pb2
+from apache_beam.portability.api import beam_fn_api_pb2_grpc
+from apache_beam.runners.worker.channel_factory import GRPCChannelFactory
+from apache_beam.runners.worker.worker_id_interceptor import 
WorkerIdInterceptor
+
+
+def thread_dump():
+  # deduplicate threads with same stack trace
+  stack_traces = defaultdict(list)
+  frames = sys._current_frames()  # pylint: disable=protected-access
+
+  for t in threading.enumerate():
+stack_trace = ''.join(traceback.format_stack(frames[t.ident]))
+thread_ident_name = (t.ident, t.name)
+stack_traces[stack_trace].append(thread_ident_name)
+
+  all_traces = ['=' * 10 + 'THREAD DUMP' + '=' * 10]
+  for stack, identity in stack_traces.items():
+ident, name = identity[0]
+trace = '--- Thread #%s name: %s %s---\n' % (
+ident, name, 'and other %d threads' %
+(len(identity) - 1) if len(identity) > 1 else '')
+if len(identity) > 1:
+  trace += 'threads: %s\n' % identity
+trace += stack
+all_traces.append(trace)
+  all_traces.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in all_traces)
+
+
+def active_processing_bundles_state(bundle_process_cache):
+  active_bundles = ['=' * 10 + 'ACTIVE PROCESSING BUNDLES' + '=' * 10]
+  if not bundle_process_cache.active_bundle_processors:
+active_bundles.append("No active processing bundles.")
+  else:
+cache = []
+for instruction in list(
+bundle_process_cache.active_bundle_processors.keys()):
+  processor = bundle_process_cache.lookup(instruction)
+  if processor:
+info = processor.state_sampler.get_info()
+cache.append((instruction,
+  processor.process_bundle_descriptor.id,
+  info.tracked_thread, info.time_since_transition))
+# reverse sort active bundle by time since last transition, keep top 10.
+cache.sort(key=lambda x: x[-1], reverse=True)
+for s in cache[:10]:
+  state = '--- instruction %s ---\n' % s[0]
+  state += 'ProcessBundleDescriptorId: %s\n' % s[1]
+  state += "tracked thread: %s\n" % s[2]
+  state += "time since transition: %.2f seconds\n" % (s[3] / 1e9)
+  active_bundles.append(state)
+
+  active_bundles.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in active_bundles)
+
+
+DONE = object()
+
+
+class FnApiWorkerStatusHandler(object):
+  def __init__(self, status_address, bundle_process_cache=None):
+self._alive = True
+self._bundle_process_cache = bundle_process_cache
+ch = GRPCChannelFactory.insecure_channel(status_address)
+grpc.channel_ready_future(ch).result(timeout=60)
+self._status_channel = grpc.intercept_channel(ch, WorkerIdInterceptor())
+self._status_stub = beam_fn_api_pb2_grpc.BeamFnWorkerStatusStub(
+self._status_channel)
+self._responses = queue.Queue()
+self._server = threading.Thread(target=lambda: self._serve(),
+name='fn_api_status_handler')
+self._server.daemon = True
+self._server.start()
+
+  def _get_responses(self):
+while True:
+  response = self._responses.get()
+  if response is DONE:
+self._alive = 

[jira] [Updated] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Boyuan Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boyuan Zhang updated BEAM-9137:
---
Component/s: (was: test-failures)
 testing

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: Boyuan Zhang
>Priority: Major
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Boyuan Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boyuan Zhang updated BEAM-9137:
---
Affects Version/s: (was: 2.19.0)

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Boyuan Zhang
>Priority: Major
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Boyuan Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boyuan Zhang updated BEAM-9137:
---
Priority: Major  (was: Blocker)

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Major
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Boyuan Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boyuan Zhang updated BEAM-9137:
---
Fix Version/s: (was: 2.19.0)

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Major
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373987=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373987
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 01:04
Start Date: 18/Jan/20 01:04
Worklog Time Spent: 10m 
  Work Description: suztomo commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575849736
 
 
   Java HadoopFormatIO Performance Test failed. This is critical. I missed to 
check linkage errors of sdks:java:io:hadoop-format:
   
   ```
   19:58:37 > Task :sdks:java:io:hadoop-format:integrationTest FAILED
   19:58:37 
   19:58:37 org.apache.beam.sdk.io.hadoop.format.HadoopFormatIOIT > 
writeAndReadUsingHadoopFormat FAILED
   19:58:37 java.lang.NoSuchMethodError: 
com.google.cloud.ServiceOptions.getQuotaProjectId()Ljava/lang/String;
   ```
   
   I will investigate further next week.
   
   
   ——
   Run JavaPortabilityApi PreCommit failed:
   
   ```
   19:45:37 > Task 
:runners:google-cloud-dataflow-java:buildAndPushDockerContainer FAILED
   19:45:37 Error response from daemon: No such image: 
apachebeam/java_sdk:2.20.0-SNAPSHOT
   19:45:37 
   19:45:37 > Task :runners:google-cloud-dataflow-java:cleanUpDockerImages 
FAILED
   19:45:37 Error: No such image: 
us.gcr.io/apache-beam-testing/java-postcommit-it/java:20200118004259
   ```
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373987)
Time Spent: 2.5h  (was: 2h 20m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 2.5h
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> 

[jira] [Commented] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-17 Thread Boyuan Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018452#comment-17018452
 ] 

Boyuan Zhang commented on BEAM-9137:


I'll drop it from release blocker since it doesn't make sense for now.

> PR10338 breaks beam_PostCommit_Py_ValCont
> -
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9137) beam_PostCommit_Py_ValCont should run with dataflow_worker_jar

2020-01-17 Thread Boyuan Zhang (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Boyuan Zhang updated BEAM-9137:
---
Summary: beam_PostCommit_Py_ValCont should run with dataflow_worker_jar  
(was: PR10338 breaks beam_PostCommit_Py_ValCont)

> beam_PostCommit_Py_ValCont should run with dataflow_worker_jar
> --
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-17 Thread Boyuan Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018451#comment-17018451
 ] 

Boyuan Zhang commented on BEAM-9137:


Turns out the root cause should be the python ValidationContainer tests are 
kind of mis-configured. The test is running with dataflow runner, but not 
provides a dataflow_worker_jar option, which forces dataflow service pulls a 
pre-built runner image. So the mismatch between sdk(built with latest code) and 
runner(maybe last release ver) will break this test.

> PR10338 breaks beam_PostCommit_Py_ValCont
> -
>
> Key: BEAM-9137
> URL: https://issues.apache.org/jira/browse/BEAM-9137
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Affects Versions: 2.19.0
>Reporter: Boyuan Zhang
>Priority: Blocker
> Fix For: 2.19.0
>
>
> For the first failure, please refer to 
> https://builds.apache.org/job/beam_PostCommit_Py_ValCont/5172/#showFailuresLink



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-7273) :beam-sdks-java-io-hadoop-format:test failing on release-2.13.0

2020-01-17 Thread Udi Meiri (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-7273?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018438#comment-17018438
 ] 

Udi Meiri commented on BEAM-7273:
-

I got this as well. Could you explain why this was resolved as not a problem?

> :beam-sdks-java-io-hadoop-format:test failing on release-2.13.0
> ---
>
> Key: BEAM-7273
> URL: https://issues.apache.org/jira/browse/BEAM-7273
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Ankur Goenka
>Priority: Major
> Fix For: Not applicable
>
>
> :beam-sdks-java-io-hadoop-format:test is crashing JVM on release-2.13.0 and 
> master.
>  
> Build scan [https://scans.gradle.com/s/5uhhirrcpgtjw/]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373984=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373984
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:42
Start Date: 18/Jan/20 00:42
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575846752
 
 
   Run Spark ValidatesRunner
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373984)
Time Spent: 2h 10m  (was: 2h)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 2h 10m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please consider upgrading 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373985=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373985
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:42
Start Date: 18/Jan/20 00:42
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575846775
 
 
   Run SQL Postcommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373985)
Time Spent: 2h 20m  (was: 2h 10m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 2h 20m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please consider upgrading the 

[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373978=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373978
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:41
Start Date: 18/Jan/20 00:41
Worklog Time Spent: 10m 
  Work Description: Hannah-Jiang commented on issue #10616: [BEAM-7861] 
update documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575846603
 
 
   In addition, it's better to have a kind of `switch-version` class which is 
similar to `switch-language` class, so users can navigate different versions 
easily. Are there something like this?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373978)
Time Spent: 3h 50m  (was: 3h 40m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 3h 50m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373980=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373980
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:41
Start Date: 18/Jan/20 00:41
Worklog Time Spent: 10m 
  Work Description: Hannah-Jiang commented on issue #10616: [BEAM-7861] 
update documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575846663
 
 
   A fix was uploaded, PTAL.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373980)
Time Spent: 4h  (was: 3h 50m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 4h
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373981=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373981
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:41
Start Date: 18/Jan/20 00:41
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575846676
 
 
   Run Java HadoopFormatIO Performance Test
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373981)
Time Spent: 1h 40m  (was: 1.5h)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1h 40m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373983=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373983
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:41
Start Date: 18/Jan/20 00:41
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575846737
 
 
   Run Dataflow ValidatesRunner
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373983)
Time Spent: 2h  (was: 1h 50m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 2h
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please consider upgrading 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373979=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373979
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:41
Start Date: 18/Jan/20 00:41
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575846664
 
 
   Run Java PostCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373979)
Time Spent: 1.5h  (was: 1h 20m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please consider upgrading the 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373982=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373982
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:41
Start Date: 18/Jan/20 00:41
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575846711
 
 
   Run BigQueryIO Streaming Performance Test Java
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373982)
Time Spent: 1h 50m  (was: 1h 40m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1h 50m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> 

[jira] [Work logged] (BEAM-9151) Dataflow legacy worker tests are mis-configured

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9151?focusedWorklogId=373977=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373977
 ]

ASF GitHub Bot logged work on BEAM-9151:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:39
Start Date: 18/Jan/20 00:39
Worklog Time Spent: 10m 
  Work Description: boyuanzz commented on issue #10635: [BEAM-9151] Fix 
misconfigured legacy dataflow tests.
URL: https://github.com/apache/beam/pull/10635#issuecomment-575846344
 
 
   Run Java_Examples_Dataflow PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373977)
Time Spent: 20m  (was: 10m)

> Dataflow legacy worker tests are mis-configured
> ---
>
> Key: BEAM-9151
> URL: https://issues.apache.org/jira/browse/BEAM-9151
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: Boyuan Zhang
>Assignee: Boyuan Zhang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
> Please refer to the last comment of https://github.com/apache/beam/pull/8183



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373976=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373976
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:38
Start Date: 18/Jan/20 00:38
Worklog Time Spent: 10m 
  Work Description: Hannah-Jiang commented on issue #10616: [BEAM-7861] 
update documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575846174
 
 
   The `Execution Mode` section should be visible only for Python SDK. I tried 
to hide it with Java SDK, but didn't find a good way to do it while keeping 
HTML styles. Are there any recommended way? 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373976)
Time Spent: 3h 40m  (was: 3.5h)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8933) BigQuery IO should support read/write in Arrow format

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8933?focusedWorklogId=373975=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373975
 ]

ASF GitHub Bot logged work on BEAM-8933:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:34
Start Date: 18/Jan/20 00:34
Worklog Time Spent: 10m 
  Work Description: 11moon11 commented on issue #10369: [BEAM-8933] 
BigQueryIO Arrow for read
URL: https://github.com/apache/beam/pull/10369#issuecomment-57683
 
 
   Run Java PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373975)
Time Spent: 8.5h  (was: 8h 20m)

> BigQuery IO should support read/write in Arrow format
> -
>
> Key: BEAM-8933
> URL: https://issues.apache.org/jira/browse/BEAM-8933
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-gcp
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 8.5h
>  Remaining Estimate: 0h
>
> As of right now BigQuery uses Avro format for reading and writing.
> We should add a config to BigQueryIO to specify which format to use: Arrow or 
> Avro (with Avro as default).



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373974=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373974
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:34
Start Date: 18/Jan/20 00:34
Worklog Time Spent: 10m 
  Work Description: suztomo commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575845625
 
 
   @aaltay Your email client might hide the following 6 commands in my email 
(my Gmail did). Would you run them?
   
   Run Java PostCommit
   Run Java HadoopFormatIO Performance Test
   Run BigQueryIO Streaming Performance Test Java
   Run Dataflow ValidatesRunner
   Run Spark ValidatesRunner
   Run SQL Postcommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373974)
Time Spent: 1h 20m  (was: 1h 10m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373973=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373973
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:31
Start Date: 18/Jan/20 00:31
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575845062
 
 
   Run JavaPortabilityApi PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373973)
Time Spent: 1h 10m  (was: 1h)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please consider 

[jira] [Work logged] (BEAM-6936) Add a Jenkins job running Java examples on Java 11 Dataflow

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6936?focusedWorklogId=373966=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373966
 ]

ASF GitHub Bot logged work on BEAM-6936:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:15
Start Date: 18/Jan/20 00:15
Worklog Time Spent: 10m 
  Work Description: boyuanzz commented on pull request #8183: [BEAM-6936] 
Added Jenkins jobs running Java examples on Dataflow with Java 11
URL: https://github.com/apache/beam/pull/8183#discussion_r368185902
 
 

 ##
 File path: runners/google-cloud-dataflow-java/examples/build.gradle
 ##
 @@ -42,57 +42,59 @@ def dockerImageName = 
project(':beam-runners-google-cloud-dataflow-java').ext.do
 // If -PuseExecutableStage is set, the use_executable_stage_bundle_execution 
wil be enabled.
 def fnapiExperiments = project.hasProperty('useExecutableStage') ? 
'beam_fn_api,use_executable_stage_bundle_execution' : "beam_fn_api"
 
+def commonConfig = { dataflowWorkerJar, workerHarnessContainerImage = '', 
additionalOptions = [] ->
+   // return the preevaluated configuration closure
+   return {
+   testClassesDirs = 
files(project(":beam-examples-java").sourceSets.test.output.classesDirs)
+   include "**/WordCountIT.class"
+   include "**/WindowedWordCountIT.class"
+   forkEvery 1
+   maxParallelForks 4
+   
+   def preCommitBeamTestPipelineOptions = [
+   "--project=${gcpProject}",
+   "--tempRoot=${gcsTempRoot}",
+   "--runner=TestDataflowRunner",
+   "--dataflowWorkerJar=${dataflowWorkerJar}",
+   workerHarnessContainerImage.isEmpty() 
?'':"--workerHarnessContainerImage=${workerHarnessContainerImage}"
 
 Review comment:
   Fix here: https://github.com/apache/beam/pull/10635
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373966)
Time Spent: 7h 20m  (was: 7h 10m)

> Add a Jenkins job running Java examples on Java 11 Dataflow
> ---
>
> Key: BEAM-6936
> URL: https://issues.apache.org/jira/browse/BEAM-6936
> Project: Beam
>  Issue Type: Sub-task
>  Components: examples-java, testing
>Reporter: Michal Walenia
>Assignee: Michal Walenia
>Priority: Minor
> Fix For: 2.14.0
>
>  Time Spent: 7h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=373965=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373965
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:15
Start Date: 18/Jan/20 00:15
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10594: [BEAM-7746] Stop 
automatically creating staticmethods in register_urn
URL: https://github.com/apache/beam/pull/10594#discussion_r368185763
 
 

 ##
 File path: sdks/python/apache_beam/coders/coders.py
 ##
 @@ -1014,6 +1015,7 @@ def to_runner_api_parameter(self, context):
 else:
   return super(TupleCoder, self).to_runner_api_parameter(context)
 
+  @staticmethod
   @Coder.register_urn(common_urns.coders.KV.urn, None)
   def from_runner_api_parameter(unused_payload, components, unused_context):
 return TupleCoder(components)
 
 Review comment:
   Should `ParamWindowedValueCoder.from_runner_api_parameter` also get the 
`@staticmethod` decorator?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373965)
Time Spent: 53h 50m  (was: 53h 40m)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 53h 50m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9072) [SQL] Add support for Datastore source

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9072?focusedWorklogId=373964=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373964
 ]

ASF GitHub Bot logged work on BEAM-9072:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:11
Start Date: 18/Jan/20 00:11
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10440: [BEAM-9072] [SQL] 
DataStoreV1 IO connector
URL: https://github.com/apache/beam/pull/10440#issuecomment-575842055
 
 
   Run JavaPortabilityApi PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373964)
Time Spent: 4h 50m  (was: 4h 40m)

> [SQL] Add support for Datastore source
> --
>
> Key: BEAM-9072
> URL: https://issues.apache.org/jira/browse/BEAM-9072
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 4h 50m
>  Remaining Estimate: 0h
>
> * Create a Datastore table and table provider
>  * Conversion between Datastore and Beam data types
>  * Implement buildIOReader
>  * Implement buildIOWrite
>  * Implement getTableStatistics
> Doc: 
> [https://docs.google.com/document/d/1FxuEGewJ3GPDl0IKglfOYf1edwa2m_wryFZYRMpRNbA/edit?pli=1]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9061) New release of pyhamcrest==1.10.0 breaks portable Python precommits.

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9061?focusedWorklogId=373963=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373963
 ]

ASF GitHub Bot logged work on BEAM-9061:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:11
Start Date: 18/Jan/20 00:11
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10568: [BEAM-9061] Add 
version guards to requirements file for integration tests.
URL: https://github.com/apache/beam/pull/10568#issuecomment-575842037
 
 
   1.10.1 does, 2.0.0 does not.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373963)
Time Spent: 3h 10m  (was: 3h)

> New release of pyhamcrest==1.10.0 breaks portable Python precommits.
> 
>
> Key: BEAM-9061
> URL: https://issues.apache.org/jira/browse/BEAM-9061
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Assignee: Valentyn Tymofieiev
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> {noformat}
> 08:35:26   File "apache_beam/runners/portability/fn_api_runner_test.py", line 
> 38, in 
> 08:35:26 import hamcrest  # pylint: disable=ungrouped-imports
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/__init__.py",
>  line 2, in 
> 08:35:26 from hamcrest.library import *
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/__init__.py",
>  line 7, in 
> 08:35:26 from hamcrest.library.object import *
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/object/__init__.py",
>  line 4, in 
> 08:35:26 from .hasproperty import has_properties, has_property
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/object/hasproperty.py",
>  line 174
> 08:35:26 ),
> 08:35:26 ^
> {noformat}
> rootcause: https://github.com/hamcrest/PyHamcrest/issues/131



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9061) New release of pyhamcrest==1.10.0 breaks portable Python precommits.

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9061?focusedWorklogId=373961=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373961
 ]

ASF GitHub Bot logged work on BEAM-9061:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:10
Start Date: 18/Jan/20 00:10
Worklog Time Spent: 10m 
  Work Description: aaltay commented on issue #10568: [BEAM-9061] Add 
version guards to requirements file for integration tests.
URL: https://github.com/apache/beam/pull/10568#issuecomment-575841800
 
 
   Does pyhamcrest > 1.10.0 work with python 2?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373961)
Time Spent: 3h  (was: 2h 50m)

> New release of pyhamcrest==1.10.0 breaks portable Python precommits.
> 
>
> Key: BEAM-9061
> URL: https://issues.apache.org/jira/browse/BEAM-9061
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Assignee: Valentyn Tymofieiev
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 3h
>  Remaining Estimate: 0h
>
> {noformat}
> 08:35:26   File "apache_beam/runners/portability/fn_api_runner_test.py", line 
> 38, in 
> 08:35:26 import hamcrest  # pylint: disable=ungrouped-imports
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/__init__.py",
>  line 2, in 
> 08:35:26 from hamcrest.library import *
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/__init__.py",
>  line 7, in 
> 08:35:26 from hamcrest.library.object import *
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/object/__init__.py",
>  line 4, in 
> 08:35:26 from .hasproperty import has_properties, has_property
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/object/hasproperty.py",
>  line 174
> 08:35:26 ),
> 08:35:26 ^
> {noformat}
> rootcause: https://github.com/hamcrest/PyHamcrest/issues/131



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9151) Dataflow legacy worker tests are mis-configured

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9151?focusedWorklogId=373962=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373962
 ]

ASF GitHub Bot logged work on BEAM-9151:


Author: ASF GitHub Bot
Created on: 18/Jan/20 00:10
Start Date: 18/Jan/20 00:10
Worklog Time Spent: 10m 
  Work Description: boyuanzz commented on pull request #10635: [BEAM-9151] 
Fix misconfigured legacy dataflow tests.
URL: https://github.com/apache/beam/pull/10635
 
 
   R: @lukecwik 
   When `dataflowWorkerJar` is provided, the `workerHarnessContainerImage` 
should be set to empty explicitly, otherwise, the dataflow service will ignore 
worker jar and still pick harness container
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373962)
Remaining Estimate: 0h
Time Spent: 10m

> Dataflow legacy worker tests are mis-configured
> ---
>
> Key: BEAM-9151
> URL: https://issues.apache.org/jira/browse/BEAM-9151
> Project: Beam
>  Issue Type: Bug
>  Components: testing
>Reporter: Boyuan Zhang
>Assignee: Boyuan Zhang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 10m
>  Remaining Estimate: 0h
>
> Please refer to the last comment of https://github.com/apache/beam/pull/8183



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9151) Dataflow legacy worker tests are mis-configured

2020-01-17 Thread Boyuan Zhang (Jira)
Boyuan Zhang created BEAM-9151:
--

 Summary: Dataflow legacy worker tests are mis-configured
 Key: BEAM-9151
 URL: https://issues.apache.org/jira/browse/BEAM-9151
 Project: Beam
  Issue Type: Bug
  Components: testing
Reporter: Boyuan Zhang
Assignee: Boyuan Zhang
 Fix For: 2.19.0


Please refer to the last comment of https://github.com/apache/beam/pull/8183



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9061) New release of pyhamcrest==1.10.0 breaks portable Python precommits.

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9061?focusedWorklogId=373957=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373957
 ]

ASF GitHub Bot logged work on BEAM-9061:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:56
Start Date: 17/Jan/20 23:56
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10619: [BEAM-9061] Add 
version guards to requirements file for integration tests.
URL: https://github.com/apache/beam/pull/10619#issuecomment-575839089
 
 
   Precommits are failing because containers for DF workers are not created 
yet, so a couple of integration tests that we have, fail.
   Given that this commit is a cherry-pick from master and only affects test 
dependencies, this should be safe to merge. 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373957)
Time Spent: 2h 50m  (was: 2h 40m)

> New release of pyhamcrest==1.10.0 breaks portable Python precommits.
> 
>
> Key: BEAM-9061
> URL: https://issues.apache.org/jira/browse/BEAM-9061
> Project: Beam
>  Issue Type: Bug
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Assignee: Valentyn Tymofieiev
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 2h 50m
>  Remaining Estimate: 0h
>
> {noformat}
> 08:35:26   File "apache_beam/runners/portability/fn_api_runner_test.py", line 
> 38, in 
> 08:35:26 import hamcrest  # pylint: disable=ungrouped-imports
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/__init__.py",
>  line 2, in 
> 08:35:26 from hamcrest.library import *
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/__init__.py",
>  line 7, in 
> 08:35:26 from hamcrest.library.object import *
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/object/__init__.py",
>  line 4, in 
> 08:35:26 from .hasproperty import has_properties, has_property
> 08:35:26   File 
> "/home/jenkins/jenkins-slave/workspace/beam_PreCommit_Python2_PVR_Flink_Commit/src/build/gradleenv/1866363813/local/lib/python2.7/site-packages/hamcrest/library/object/hasproperty.py",
>  line 174
> 08:35:26 ),
> 08:35:26 ^
> {noformat}
> rootcause: https://github.com/hamcrest/PyHamcrest/issues/131



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9072) [SQL] Add support for Datastore source

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9072?focusedWorklogId=373954=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373954
 ]

ASF GitHub Bot logged work on BEAM-9072:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:54
Start Date: 17/Jan/20 23:54
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10440: [BEAM-9072] [SQL] 
DataStoreV1 IO connector
URL: https://github.com/apache/beam/pull/10440#issuecomment-575838697
 
 
   Run JavaPortabilityApi PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373954)
Time Spent: 4h 40m  (was: 4.5h)

> [SQL] Add support for Datastore source
> --
>
> Key: BEAM-9072
> URL: https://issues.apache.org/jira/browse/BEAM-9072
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 4h 40m
>  Remaining Estimate: 0h
>
> * Create a Datastore table and table provider
>  * Conversion between Datastore and Beam data types
>  * Implement buildIOReader
>  * Implement buildIOWrite
>  * Implement getTableStatistics
> Doc: 
> [https://docs.google.com/document/d/1FxuEGewJ3GPDl0IKglfOYf1edwa2m_wryFZYRMpRNbA/edit?pli=1]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7746) Add type hints to python code

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7746?focusedWorklogId=373953=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373953
 ]

ASF GitHub Bot logged work on BEAM-7746:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:53
Start Date: 17/Jan/20 23:53
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10590: [BEAM-7746] Fix 
a typing issue where SourceBase was assumed to have a coder attribute
URL: https://github.com/apache/beam/pull/10590
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373953)
Time Spent: 53h 40m  (was: 53.5h)

> Add type hints to python code
> -
>
> Key: BEAM-7746
> URL: https://issues.apache.org/jira/browse/BEAM-7746
> Project: Beam
>  Issue Type: New Feature
>  Components: sdk-py-core
>Reporter: Chad Dombrova
>Assignee: Chad Dombrova
>Priority: Major
>  Time Spent: 53h 40m
>  Remaining Estimate: 0h
>
> As a developer of the beam source code, I would like the code to use pep484 
> type hints so that I can clearly see what types are required, get completion 
> in my IDE, and enforce code correctness via a static analyzer like mypy.
> This may be considered a precursor to BEAM-7060
> Work has been started here:  [https://github.com/apache/beam/pull/9056]
>  
>  



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9072) [SQL] Add support for Datastore source

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9072?focusedWorklogId=373952=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373952
 ]

ASF GitHub Bot logged work on BEAM-9072:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:49
Start Date: 17/Jan/20 23:49
Worklog Time Spent: 10m 
  Work Description: 11moon11 commented on issue #10440: [BEAM-9072] [SQL] 
DataStoreV1 IO connector
URL: https://github.com/apache/beam/pull/10440#issuecomment-575837709
 
 
   JavaPortabilityApi failed with:
   ```
   Execution failed for task 
':runners:google-cloud-dataflow-java:buildAndPushDockerContainer'.
   Process 'command 'docker'' finished with non-zero exit value 1
   ```
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373952)
Time Spent: 4.5h  (was: 4h 20m)

> [SQL] Add support for Datastore source
> --
>
> Key: BEAM-9072
> URL: https://issues.apache.org/jira/browse/BEAM-9072
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 4.5h
>  Remaining Estimate: 0h
>
> * Create a Datastore table and table provider
>  * Conversion between Datastore and Beam data types
>  * Implement buildIOReader
>  * Implement buildIOWrite
>  * Implement getTableStatistics
> Doc: 
> [https://docs.google.com/document/d/1FxuEGewJ3GPDl0IKglfOYf1edwa2m_wryFZYRMpRNbA/edit?pli=1]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373946=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373946
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:32
Start Date: 17/Jan/20 23:32
Worklog Time Spent: 10m 
  Work Description: suztomo commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575834545
 
 
   Java Portability API precommit failed:
   ```
   17:51:37 Execution failed for task 
':runners:google-cloud-dataflow-java:buildAndPushDockerContainer'.
   17:51:37 > Process 'command 'docker'' finished with non-zero exit value 1
   ```
   
   This should be a transient error.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373946)
Time Spent: 1h  (was: 50m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency 

[jira] [Commented] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread Aaron Dixon (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018400#comment-17018400
 ] 

Aaron Dixon commented on BEAM-9144:
---

[~iemejia] Thanks, I'd like to help test. I'm relatively new to Beam and very 
new to its dev/build etc. Will it be sufficient for me to pull 2.20.0-SNAPSHOTS 
from apache snapshots repo 
([https://repository.apache.org/content/repositories/snapshots/])? And I assume 
I'll need to wait til tomorrow so that this work is incorporated in a nightly?

If so, this weekend I can run my Dataflow pipeline against 2.20.0-SNAPHOTS from 
apache/snapshots repo and report back that it is happy. FYI/fwiw, my specific 
Beam dependencies are these, so I'll be testing against these 2.20.0-SNAPSHOTS:
{code:java}
[org.apache.beam/beam-sdks-java-core]
[org.apache.beam/beam-sdks-java-io-kafka]
[org.apache.beam/beam-runners-google-cloud-dataflow-java]{code}

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9072) [SQL] Add support for Datastore source

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9072?focusedWorklogId=373942=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373942
 ]

ASF GitHub Bot logged work on BEAM-9072:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:26
Start Date: 17/Jan/20 23:26
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10440: [BEAM-9072] [SQL] 
DataStoreV1 IO connector
URL: https://github.com/apache/beam/pull/10440#issuecomment-575833263
 
 
   retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373942)
Time Spent: 4h 20m  (was: 4h 10m)

> [SQL] Add support for Datastore source
> --
>
> Key: BEAM-9072
> URL: https://issues.apache.org/jira/browse/BEAM-9072
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Kirill Kozlov
>Assignee: Kirill Kozlov
>Priority: Major
>  Time Spent: 4h 20m
>  Remaining Estimate: 0h
>
> * Create a Datastore table and table provider
>  * Conversion between Datastore and Beam data types
>  * Implement buildIOReader
>  * Implement buildIOWrite
>  * Implement getTableStatistics
> Doc: 
> [https://docs.google.com/document/d/1FxuEGewJ3GPDl0IKglfOYf1edwa2m_wryFZYRMpRNbA/edit?pli=1]



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7961) Add tests for all runner native transforms and some widely used composite transforms to cross-language validates runner test suite

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7961?focusedWorklogId=373939=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373939
 ]

ASF GitHub Bot logged work on BEAM-7961:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:21
Start Date: 17/Jan/20 23:21
Worklog Time Spent: 10m 
  Work Description: ihji commented on pull request #10051: [BEAM-7961] Add 
tests for all runner native transforms for XLang
URL: https://github.com/apache/beam/pull/10051#discussion_r368175531
 
 

 ##
 File path: 
.test-infra/jenkins/job_PostCommit_CrossLanguageValidatesRunner_Flink.groovy
 ##
 @@ -34,10 +34,19 @@ 
PostcommitJobBuilder.postCommitJob('beam_PostCommit_XVR_Flink',
 
   // Gradle goals for this job.
   steps {
+shell('echo *** RUN CROSS-LANGUAGE FLINK USING PYTHON 2.7 ***')
 
 Review comment:
   Followed the same convention in 
https://github.com/apache/beam/blob/master/.test-infra/jenkins/job_PostCommit_Java_Nexmark_Spark.groovy#L36
   Should we also need to modify those?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373939)
Time Spent: 15h 50m  (was: 15h 40m)

> Add tests for all runner native transforms and some widely used composite 
> transforms to cross-language validates runner test suite
> --
>
> Key: BEAM-7961
> URL: https://issues.apache.org/jira/browse/BEAM-7961
> Project: Beam
>  Issue Type: Improvement
>  Components: testing
>Reporter: Heejong Lee
>Assignee: Heejong Lee
>Priority: Major
>  Time Spent: 15h 50m
>  Remaining Estimate: 0h
>
> Add tests for all runner native transforms and some widely used composite 
> transforms to cross-language validates runner test suite



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373938=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373938
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:20
Start Date: 17/Jan/20 23:20
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575831989
 
 
   cc: @soyrice 
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373938)
Time Spent: 3.5h  (was: 3h 20m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 3.5h
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-5504) PubsubAvroTable

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-5504?focusedWorklogId=373936=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373936
 ]

ASF GitHub Bot logged work on BEAM-5504:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:16
Start Date: 17/Jan/20 23:16
Worklog Time Spent: 10m 
  Work Description: amaliujia commented on issue #10487: [BEAM-5504] 
Introduce PubsubAvroTable
URL: https://github.com/apache/beam/pull/10487#issuecomment-575831150
 
 
   Thanks @milantracy! I will take a look!
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373936)
Time Spent: 2h  (was: 1h 50m)

> PubsubAvroTable
> ---
>
> Key: BEAM-5504
> URL: https://issues.apache.org/jira/browse/BEAM-5504
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql
>Reporter: Rui Wang
>Assignee: Jing Chen
>Priority: Major
>  Time Spent: 2h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-6936) Add a Jenkins job running Java examples on Java 11 Dataflow

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-6936?focusedWorklogId=373935=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373935
 ]

ASF GitHub Bot logged work on BEAM-6936:


Author: ASF GitHub Bot
Created on: 17/Jan/20 23:16
Start Date: 17/Jan/20 23:16
Worklog Time Spent: 10m 
  Work Description: boyuanzz commented on pull request #8183: [BEAM-6936] 
Added Jenkins jobs running Java examples on Dataflow with Java 11
URL: https://github.com/apache/beam/pull/8183#discussion_r368174524
 
 

 ##
 File path: runners/google-cloud-dataflow-java/examples/build.gradle
 ##
 @@ -42,57 +42,59 @@ def dockerImageName = 
project(':beam-runners-google-cloud-dataflow-java').ext.do
 // If -PuseExecutableStage is set, the use_executable_stage_bundle_execution 
wil be enabled.
 def fnapiExperiments = project.hasProperty('useExecutableStage') ? 
'beam_fn_api,use_executable_stage_bundle_execution' : "beam_fn_api"
 
+def commonConfig = { dataflowWorkerJar, workerHarnessContainerImage = '', 
additionalOptions = [] ->
+   // return the preevaluated configuration closure
+   return {
+   testClassesDirs = 
files(project(":beam-examples-java").sourceSets.test.output.classesDirs)
+   include "**/WordCountIT.class"
+   include "**/WindowedWordCountIT.class"
+   forkEvery 1
+   maxParallelForks 4
+   
+   def preCommitBeamTestPipelineOptions = [
+   "--project=${gcpProject}",
+   "--tempRoot=${gcsTempRoot}",
+   "--runner=TestDataflowRunner",
+   "--dataflowWorkerJar=${dataflowWorkerJar}",
+   workerHarnessContainerImage.isEmpty() 
?'':"--workerHarnessContainerImage=${workerHarnessContainerImage}"
 
 Review comment:
   When `dataflowWorkerJar` is provided, the `workerHarnessContainerImage` 
should be set to empty explicitly, otherwise, the dataflow service will still 
try to pull harness image rather than using custom-built worker jar,
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373935)
Time Spent: 7h 10m  (was: 7h)

> Add a Jenkins job running Java examples on Java 11 Dataflow
> ---
>
> Key: BEAM-6936
> URL: https://issues.apache.org/jira/browse/BEAM-6936
> Project: Beam
>  Issue Type: Sub-task
>  Components: examples-java, testing
>Reporter: Michal Walenia
>Assignee: Michal Walenia
>Priority: Minor
> Fix For: 2.14.0
>
>  Time Spent: 7h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Comment Edited] (BEAM-9138) beam_Release_Gradle_Build failure in Go

2020-01-17 Thread Udi Meiri (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018395#comment-17018395
 ] 

Udi Meiri edited comment on BEAM-9138 at 1/17/20 11:15 PM:
---

Trying this temporary hack: commenting out Go projects in settings.gradle:
{code}
include ":sdks:go"  // is a dependency of other packages
//include ":sdks:go:container"
//include ":sdks:go:examples"  // goVet failed on this project
//include ":sdks:go:test"
{code}


was (Author: udim):
Trying this temporary hack: commenting out Go projects in settings.gradle:
{code}
include ":sdks:go"  // is a dependency of other packages
//include ":sdks:go:container"
//include ":sdks:go:examples"  // goVet failed on this project
//include ":sdks:go:test"
{codE}

> beam_Release_Gradle_Build failure in Go
> ---
>
> Key: BEAM-9138
> URL: https://issues.apache.org/jira/browse/BEAM-9138
> Project: Beam
>  Issue Type: Bug
>  Components: build-system, test-failures
>Reporter: Udi Meiri
>Priority: Major
>
> Phrase: "Run Release Gradle Build"
> Recent failures:
> https://builds.apache.org/job/beam_Release_Gradle_Build/24/
> https://builds.apache.org/job/beam_Release_Gradle_Build/25/
> Seems like all of these are Go-related, and it seems like the culprit might 
> be the failure in :sdks:go:resolveBuildDependencies (but I have very little 
> knowledge of these tasks).
> CC: [~markflyhigh] who wrote the job, [~lostluck] who knows Go SDK
> Please help in triaging and/or assigning to someone who can look at this.
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:goVet'.
> 13:00:15  > Build failed due to return code 1 of: 
> 13:00:15Command:
> 13:00:15 /usr/bin/go vet github.com/apache/beam/.test-infra/tools
> {code}
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:goTest'.
> 13:00:15  > There are 1 failed tests. Please see 
> /home/jenkins/jenkins-slave/workspace/beam_Release_Gradle_Build/src/.test-infra/tools/.gogradle/reports/test/index.html
>  for more details.
> {code}
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:buildLinuxAmd64'.
> 13:00:15  > Build failed due to return code 1 of: 
> 13:00:15Command:
> 13:00:15 /usr/bin/go build -o ./.gogradle/beam-test-tools-linux-amd64 
> github.com/apache/beam/.test-infra/tools
> {code}
> {code}
> 13:00:15  Execution failed for task ':sdks:go:resolveBuildDependencies'.
> 13:00:15  > Exception in resolution, message is:
> 13:00:15Cannot resolve dependency:google.golang.org/api: 
> commit='386d4e5f4f92f86e6aec85985761bba4b938a2d5', 
> urls=[https://code.googlesource.com/google-api-go-client]
> 13:00:15Resolution stack is:
> 13:00:15+- github.com/apache/beam/sdks/go
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9138) beam_Release_Gradle_Build failure in Go

2020-01-17 Thread Udi Meiri (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018395#comment-17018395
 ] 

Udi Meiri commented on BEAM-9138:
-

Trying this temporary hack: commenting out Go projects in settings.gradle:
{code}
include ":sdks:go"  // is a dependency of other packages
//include ":sdks:go:container"
//include ":sdks:go:examples"  // goVet failed on this project
//include ":sdks:go:test"
{codE}

> beam_Release_Gradle_Build failure in Go
> ---
>
> Key: BEAM-9138
> URL: https://issues.apache.org/jira/browse/BEAM-9138
> Project: Beam
>  Issue Type: Bug
>  Components: build-system, test-failures
>Reporter: Udi Meiri
>Priority: Major
>
> Phrase: "Run Release Gradle Build"
> Recent failures:
> https://builds.apache.org/job/beam_Release_Gradle_Build/24/
> https://builds.apache.org/job/beam_Release_Gradle_Build/25/
> Seems like all of these are Go-related, and it seems like the culprit might 
> be the failure in :sdks:go:resolveBuildDependencies (but I have very little 
> knowledge of these tasks).
> CC: [~markflyhigh] who wrote the job, [~lostluck] who knows Go SDK
> Please help in triaging and/or assigning to someone who can look at this.
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:goVet'.
> 13:00:15  > Build failed due to return code 1 of: 
> 13:00:15Command:
> 13:00:15 /usr/bin/go vet github.com/apache/beam/.test-infra/tools
> {code}
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:goTest'.
> 13:00:15  > There are 1 failed tests. Please see 
> /home/jenkins/jenkins-slave/workspace/beam_Release_Gradle_Build/src/.test-infra/tools/.gogradle/reports/test/index.html
>  for more details.
> {code}
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:buildLinuxAmd64'.
> 13:00:15  > Build failed due to return code 1 of: 
> 13:00:15Command:
> 13:00:15 /usr/bin/go build -o ./.gogradle/beam-test-tools-linux-amd64 
> github.com/apache/beam/.test-infra/tools
> {code}
> {code}
> 13:00:15  Execution failed for task ':sdks:go:resolveBuildDependencies'.
> 13:00:15  > Exception in resolution, message is:
> 13:00:15Cannot resolve dependency:google.golang.org/api: 
> commit='386d4e5f4f92f86e6aec85985761bba4b938a2d5', 
> urls=[https://code.googlesource.com/google-api-go-client]
> 13:00:15Resolution stack is:
> 13:00:15+- github.com/apache/beam/sdks/go
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9138) beam_Release_Gradle_Build failure in Go

2020-01-17 Thread Udi Meiri (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9138?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018392#comment-17018392
 ] 

Udi Meiri commented on BEAM-9138:
-

The goVet failure is actually this previous one: 
https://issues.apache.org/jira/browse/BEAM-8992

> beam_Release_Gradle_Build failure in Go
> ---
>
> Key: BEAM-9138
> URL: https://issues.apache.org/jira/browse/BEAM-9138
> Project: Beam
>  Issue Type: Bug
>  Components: build-system, test-failures
>Reporter: Udi Meiri
>Priority: Major
>
> Phrase: "Run Release Gradle Build"
> Recent failures:
> https://builds.apache.org/job/beam_Release_Gradle_Build/24/
> https://builds.apache.org/job/beam_Release_Gradle_Build/25/
> Seems like all of these are Go-related, and it seems like the culprit might 
> be the failure in :sdks:go:resolveBuildDependencies (but I have very little 
> knowledge of these tasks).
> CC: [~markflyhigh] who wrote the job, [~lostluck] who knows Go SDK
> Please help in triaging and/or assigning to someone who can look at this.
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:goVet'.
> 13:00:15  > Build failed due to return code 1 of: 
> 13:00:15Command:
> 13:00:15 /usr/bin/go vet github.com/apache/beam/.test-infra/tools
> {code}
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:goTest'.
> 13:00:15  > There are 1 failed tests. Please see 
> /home/jenkins/jenkins-slave/workspace/beam_Release_Gradle_Build/src/.test-infra/tools/.gogradle/reports/test/index.html
>  for more details.
> {code}
> {code}
> 13:00:15  Execution failed for task ':beam-test-tools:buildLinuxAmd64'.
> 13:00:15  > Build failed due to return code 1 of: 
> 13:00:15Command:
> 13:00:15 /usr/bin/go build -o ./.gogradle/beam-test-tools-linux-amd64 
> github.com/apache/beam/.test-infra/tools
> {code}
> {code}
> 13:00:15  Execution failed for task ':sdks:go:resolveBuildDependencies'.
> 13:00:15  > Exception in resolution, message is:
> 13:00:15Cannot resolve dependency:google.golang.org/api: 
> commit='386d4e5f4f92f86e6aec85985761bba4b938a2d5', 
> urls=[https://code.googlesource.com/google-api-go-client]
> 13:00:15Resolution stack is:
> 13:00:15+- github.com/apache/beam/sdks/go
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9150) beam_PostRelease_Python_Candidate fails

2020-01-17 Thread Udi Meiri (Jira)
Udi Meiri created BEAM-9150:
---

 Summary: beam_PostRelease_Python_Candidate fails
 Key: BEAM-9150
 URL: https://issues.apache.org/jira/browse/BEAM-9150
 Project: Beam
  Issue Type: Bug
  Components: build-system
Reporter: Udi Meiri
Assignee: Yifan Zou


https://builds.apache.org/job/beam_PostRelease_Python_Candidate/171/timestamps/?time=HH:mm:ss=GMT-8=en_US

Some interesting lines:
{code}
17:48:36  
release/src/main/python-release/run_release_candidate_python_quickstart.sh: 
line 144: 20935 Killed  python -m 
apache_beam.examples.streaming_wordcount --input_topic 
projects/$PROJECT_ID/topics/$PUBSUB_TOPIC1 --output_topic 
projects/$PROJECT_ID/topics/$PUBSUB_TOPIC2 --streaming
{code}

{code}
18:05:25  

18:05:25  Running HourlyTeamScore example with DirectRunner
18:05:25  

18:05:25  
/tmp/tmp.gnpMNIFhcL/temp_virtualenv_python2.7/local/lib/python2.7/site-packages/apache_beam/__init__.py:84:
 UserWarning: You are using Apache Beam with Python 2. New releases of Apache 
Beam will soon support Python 3 only.
18:05:25'You are using Apache Beam with Python 2. '
18:05:33  INFO:root:Missing pipeline option (runner). Executing pipeline using 
the default runner: DirectRunner.
18:05:33  INFO:root:Setting socket default timeout to 60 seconds.
18:05:33  INFO:root:socket default timeout is 60.0 seconds.
18:05:33  INFO:oauth2client.transport:Attempting refresh to obtain initial 
access_token
18:05:34  
/tmp/tmp.gnpMNIFhcL/temp_virtualenv_python2.7/local/lib/python2.7/site-packages/apache_beam/io/gcp/bigquery.py:1220:
 BeamDeprecationWarning: options is deprecated since First stable release. 
References to .options will not be supported
18:05:34experiments = p.options.view_as(DebugOptions).experiments or []
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:  
18:05:37  INFO:root:Creating state cache with size 100
18:05:37  INFO:root:Created Worker handler 
 for environment urn: "beam:env:embedded_python:v1"
18:05:37  
18:05:37  INFO:root:Running 
((ref_AppliedPTransform_ReadInputText/Read/_SDFBoundedSourceWrapper/Impulse_5)+(ReadInputText/Read/_SDFBoundedSourceWrapper/ParDo(SDFBoundedSourceDoFn)/PairWithRestriction))+((ReadInputText/Read/_SDFBoundedSourceWrapper/ParDo(SDFBoundedSourceDoFn)/SplitAndSizeRestriction)+(ref_PCollection_PCollection_1_split/Write))
18:05:38  INFO:root:Running 
(ref_PCollection_PCollection_1_split/Read)+ReadInputText/Read/_SDFBoundedSourceWrapper/ParDo(SDFBoundedSourceDoFn)/Process)+((ref_AppliedPTransform_HourlyTeamScore/ParseGameEventFn_8)+((ref_AppliedPTransform_HourlyTeamScore/FilterStartTime_9)+((ref_AppliedPTransform_HourlyTeamScore/FilterEndTime_10)+((ref_AppliedPTransform_HourlyTeamScore/AddEventTimestamps_11)+((ref_AppliedPTransform_HourlyTeamScore/FixedWindowsTeam_12)+(ref_AppliedPTransform_HourlyTeamScore/ExtractAndSumScore/Map()_14)))+(HourlyTeamScore/ExtractAndSumScore/CombinePerKey(sum)/Precombine))+(HourlyTeamScore/ExtractAndSumScore/CombinePerKey(sum)/Group/Write))
18:05:39  INFO:root:Running 
((HourlyTeamScore/ExtractAndSumScore/CombinePerKey(sum)/Group/Read)+(HourlyTeamScore/ExtractAndSumScore/CombinePerKey(sum)/Merge))+((HourlyTeamScore/ExtractAndSumScore/CombinePerKey(sum)/ExtractOutputs)+((ref_AppliedPTransform_TeamScoresDict_22)+((ref_AppliedPTransform_WriteTeamScoreSums/ConvertToRow_24)+((ref_AppliedPTransform_WriteTeamScoreSums/WriteToBigQuery/_StreamToBigQuery/AppendDestination_27)+((ref_AppliedPTransform_WriteTeamScoreSums/WriteToBigQuery/_StreamToBigQuery/AddInsertIds_28)+((ref_AppliedPTransform_WriteTeamScoreSums/WriteToBigQuery/_StreamToBigQuery/CommitInsertIds/AddRandomKeys_30)+((ref_AppliedPTransform_WriteTeamScoreSums/WriteToBigQuery/_StreamToBigQuery/CommitInsertIds/ReshufflePerKey/Map(reify_timestamps)_32)+(WriteTeamScoreSums/WriteToBigQuery/_StreamToBigQuery/CommitInsertIds/ReshufflePerKey/GroupByKey/Write
18:05:39  INFO:root:Running 

[jira] [Work logged] (BEAM-5086) Beam Dependency Update Request: org.apache.kudu

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-5086?focusedWorklogId=373932=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373932
 ]

ASF GitHub Bot logged work on BEAM-5086:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:36
Start Date: 17/Jan/20 22:36
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #10561: [BEAM-5086] Update 
kudu-client to version 1.11.1
URL: https://github.com/apache/beam/pull/10561#issuecomment-575821994
 
 
   If the linkage issues are with the client, it would be good to explicitly 
resolve them since upgrades may cause issues even if no new errors are being 
reported since the new version may use different code paths.
   
   In the end its a judgement call always when errors are still being reported 
that can be done based upon the number of unit/integration/manual testing that 
has been performed vs how popular something is and how many people might be 
broken by the change.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373932)
Time Spent: 1h 20m  (was: 1h 10m)

> Beam Dependency Update Request: org.apache.kudu
> ---
>
> Key: BEAM-5086
> URL: https://issues.apache.org/jira/browse/BEAM-5086
> Project: Beam
>  Issue Type: Bug
>  Components: dependencies, io-java-kudu
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> 2018-08-06 12:13:43.606551
> Please review and upgrade the org.apache.kudu to the latest version 
> None 
>  
> cc: 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373928=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373928
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:34
Start Date: 17/Jan/20 22:34
Worklog Time Spent: 10m 
  Work Description: suztomo commented on issue #10631: [BEAM-8684] Google 
proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575821457
 
 
   R: @lukecwik 
   
   Ready for review. There's no difference in linkage checks.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373928)
Time Spent: 50m  (was: 40m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373926=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373926
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:33
Start Date: 17/Jan/20 22:33
Worklog Time Spent: 10m 
  Work Description: suztomo commented on pull request #10631: [BEAM-8684] 
Google proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#discussion_r368163674
 
 

 ##
 File path: 
buildSrc/src/main/groovy/org/apache/beam/gradle/BeamModulePlugin.groovy
 ##
 @@ -365,11 +365,10 @@ class BeamModulePlugin implements Plugin {
 def cassandra_driver_version = "3.8.0"
 def classgraph_version = "4.8.56"
 def gax_version = "1.52.0"
-def generated_grpc_beta_version = "0.44.0"
 def generated_grpc_ga_version = "1.83.0"
-def generated_grpc_dc_beta_version = "0.27.0-alpha"
+def generated_grpc_dc_beta_version = "0.29.0-alpha"
 
 Review comment:
   This is not the latest but 0.30.0-alpha requires code change. Stopping at 
0.29.0-alpha for simplicity.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373926)
Time Spent: 40m  (was: 0.5h)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 

[jira] [Commented] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread Jira


[ 
https://issues.apache.org/jira/browse/BEAM-9144?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018367#comment-17018367
 ] 

Ismaël Mejía commented on BEAM-9144:


The fix was merged in master, so it would be great if you can test it with the 
most recent SNAPSHOT version 2.20.0-SNAPSHOT [~atdixon] The fix should be 
published tomorrow. I also cherry picked it hoping it gets into 2.19.0 since 
2.18.0 vote is almost done at this point.

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9144?focusedWorklogId=373925=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373925
 ]

ASF GitHub Bot logged work on BEAM-9144:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:29
Start Date: 17/Jan/20 22:29
Worklog Time Spent: 10m 
  Work Description: iemejia commented on pull request #10632: 
[release-2.19.0][BEAM-9144] Beam's own Avro TimeConversion class in 
beam-sdk-java-core
URL: https://github.com/apache/beam/pull/10632
 
 
   A cherry pick that fixes an issue reported by an incompatibility with Avro 
versions reported by an user on dev@
   
   R: @boyuanzz 
   CC: @suztomo 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373925)
Time Spent: 1.5h  (was: 1h 20m)

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9140) Update to ZetaSQL 2020.01.1

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9140?focusedWorklogId=373924=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373924
 ]

ASF GitHub Bot logged work on BEAM-9140:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:27
Start Date: 17/Jan/20 22:27
Worklog Time Spent: 10m 
  Work Description: apilloud commented on pull request #10620: [BEAM-9140] 
Upgrade to ZetaSQL 2020.01.1
URL: https://github.com/apache/beam/pull/10620#discussion_r368162123
 
 

 ##
 File path: build.gradle
 ##
 @@ -33,6 +33,12 @@ plugins {
   id "org.sonarqube" version "2.7"
 }
 
+repositories {
+  maven {
+url "https://oss.sonatype.org/content/repositories/comgooglezetasql-1013;
 
 Review comment:
   Don't tell anyone, running with Apache Beam is the current release 
verification step for ZetaSQL. Once this PR is passing, I'll publish the 
ZetaSQL release, delete this line and send it out for review.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373924)
Time Spent: 1h 10m  (was: 1h)

> Update to ZetaSQL 2020.01.1
> ---
>
> Key: BEAM-9140
> URL: https://issues.apache.org/jira/browse/BEAM-9140
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Andrew Pilloud
>Assignee: Andrew Pilloud
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> I hear ZetaSQL 2020.01.1 will be coming out in the next few hours. We should 
> upgrade.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373923=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373923
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:25
Start Date: 17/Jan/20 22:25
Worklog Time Spent: 10m 
  Work Description: suztomo commented on issue #10631: [BEAM-8684] [WIP] 
Google proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575819172
 
 
   I just added them. Now running diffs.
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373923)
Time Spent: 0.5h  (was: 20m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 0.5h
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> Please 

[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373922=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373922
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:24
Start Date: 17/Jan/20 22:24
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on issue #10631: [BEAM-8684] [WIP] 
Google proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631#issuecomment-575818958
 
 
   What about the before linkage checker results?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373922)
Time Spent: 20m  (was: 10m)

> Beam Dependency Update Request: com.google.apis:google-api-services-bigquery
> 
>
> Key: BEAM-8684
> URL: https://issues.apache.org/jira/browse/BEAM-8684
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Priority: Major
>  Time Spent: 20m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:07.113511 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:04:31.460554 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:09:35.656962 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:08:45.501496 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:08:46.385699 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:04:15.902178 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:07:51.708019 
> -
> Please consider upgrading the dependency 
> com.google.apis:google-api-services-bigquery. 
> The current version is v2-rev20181221-1.28.0. The latest version is 
> v2-rev20190917-1.30.3 
> cc: [~chamikara], 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-13 12:07:45.384750 
> -
> 

[jira] [Work logged] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9144?focusedWorklogId=373920=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373920
 ]

ASF GitHub Bot logged work on BEAM-9144:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:23
Start Date: 17/Jan/20 22:23
Worklog Time Spent: 10m 
  Work Description: iemejia commented on pull request #10628: [BEAM-9144] 
Beam's own Avro TimeConversion class in beam-sdk-java-core
URL: https://github.com/apache/beam/pull/10628
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373920)
Time Spent: 1h 20m  (was: 1h 10m)

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9140) Update to ZetaSQL 2020.01.1

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9140?focusedWorklogId=373918=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373918
 ]

ASF GitHub Bot logged work on BEAM-9140:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:21
Start Date: 17/Jan/20 22:21
Worklog Time Spent: 10m 
  Work Description: lukecwik commented on pull request #10620: [BEAM-9140] 
Upgrade to ZetaSQL 2020.01.1
URL: https://github.com/apache/beam/pull/10620#discussion_r368160408
 
 

 ##
 File path: build.gradle
 ##
 @@ -33,6 +33,12 @@ plugins {
   id "org.sonarqube" version "2.7"
 }
 
+repositories {
+  maven {
+url "https://oss.sonatype.org/content/repositories/comgooglezetasql-1013;
 
 Review comment:
   This is an unreleased repository. How do our clients get this dependency in 
a maven project without this?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373918)
Time Spent: 1h  (was: 50m)

> Update to ZetaSQL 2020.01.1
> ---
>
> Key: BEAM-9140
> URL: https://issues.apache.org/jira/browse/BEAM-9140
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Andrew Pilloud
>Assignee: Andrew Pilloud
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> I hear ZetaSQL 2020.01.1 will be coming out in the next few hours. We should 
> upgrade.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-9137) PR10338 breaks beam_PostCommit_Py_ValCont

2020-01-17 Thread Boyuan Zhang (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-9137?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018354#comment-17018354
 ] 

Boyuan Zhang commented on BEAM-9137:


Thanks for your confirmation and update! Turns out this test target only fails 
on Py2 and even the last successful commit also failed on the same error:

Error processing instruction -77. Original traceback is Traceback (most recent 
call last): File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
 line 161, in _execute response = task() File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
 line 217, in  lambda: self.create_worker().do_instruction(request), 
request) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
 line 346, in do_instruction request.instruction_id) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
 line 374, in process_bundle instruction_id, 
request.process_bundle_descriptor_id) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/sdk_worker.py",
 line 282, in get self.data_channel_factory) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 717, in __init__ self.ops = 
self.create_execution_tree(self.process_bundle_descriptor) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 766, in create_execution_tree descriptor.transforms, 
key=topological_height, reverse=True)]) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 680, in wrapper result = cache[args] = func(*args) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 748, in get_operation in 
descriptor.transforms[transform_id].outputs.items() File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 747, in  for tag, pcoll_id File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 680, in wrapper result = cache[args] = func(*args) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 748, in get_operation in 
descriptor.transforms[transform_id].outputs.items() File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 747, in  for tag, pcoll_id File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 680, in wrapper result = cache[args] = func(*args) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 748, in get_operation in 
descriptor.transforms[transform_id].outputs.items() File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 747, in  for tag, pcoll_id File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 680, in wrapper result = cache[args] = func(*args) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 751, in get_operation transform_id, transform_consumers) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1053, in create_operation return creator(self, transform_id, 
transform_proto, payload, consumers) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1168, in create output_coder = 
factory.get_only_input_coder(transform_proto) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1099, in get_only_input_coder return 
only_element(list(self.get_input_coders(transform_proto).values())) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1094, in get_input_coders for tag, pcoll_id in 
transform_proto.inputs.items() File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1094, in  for tag, pcoll_id in transform_proto.inputs.items() 
File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1069, in get_windowed_coder coder = 
self.get_coder(self.descriptor.pcollections[pcoll_id].coder_id) File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/bundle_processor.py",
 line 1065, in get_coder json.loads(coder_proto.spec.payload.decode('utf-8'))) 
File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/operation_specs.py",
 line 331, in get_coder_from_spec get_coder_from_spec(c) for c in 
coder_spec['component_encodings']] File 
"/usr/local/lib/python2.7/site-packages/apache_beam/runners/worker/operation_specs.py",
 line 349, in get_coder_from_spec 
get_coder_from_spec(coder_spec['component_encodings'][0])) File 

[jira] [Work logged] (BEAM-9140) Update to ZetaSQL 2020.01.1

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9140?focusedWorklogId=373901=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373901
 ]

ASF GitHub Bot logged work on BEAM-9140:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:01
Start Date: 17/Jan/20 22:01
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10620: [BEAM-9140] Upgrade 
to ZetaSQL 2020.01.1
URL: https://github.com/apache/beam/pull/10620#issuecomment-575812048
 
 
   Run Java PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373901)
Time Spent: 50m  (was: 40m)

> Update to ZetaSQL 2020.01.1
> ---
>
> Key: BEAM-9140
> URL: https://issues.apache.org/jira/browse/BEAM-9140
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Andrew Pilloud
>Assignee: Andrew Pilloud
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> I hear ZetaSQL 2020.01.1 will be coming out in the next few hours. We should 
> upgrade.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8889) Make GcsUtil use GoogleCloudStorage

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8889?focusedWorklogId=373899=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373899
 ]

ASF GitHub Bot logged work on BEAM-8889:


Author: ASF GitHub Bot
Created on: 17/Jan/20 22:00
Start Date: 17/Jan/20 22:00
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #10617: [BEAM-8889] 
adding gRPC connectivity to Beam/GCS connector
URL: https://github.com/apache/beam/pull/10617#issuecomment-575811548
 
 
   R: @veblush
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373899)
Remaining Estimate: 167h 40m  (was: 167h 50m)
Time Spent: 20m  (was: 10m)

> Make GcsUtil use GoogleCloudStorage
> ---
>
> Key: BEAM-8889
> URL: https://issues.apache.org/jira/browse/BEAM-8889
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-gcp
>Affects Versions: 2.16.0
>Reporter: Esun Kim
>Assignee: VASU NORI
>Priority: Major
>  Labels: gcs
>   Original Estimate: 168h
>  Time Spent: 20m
>  Remaining Estimate: 167h 40m
>
> [GcsUtil|https://github.com/apache/beam/blob/master/sdks/java/extensions/google-cloud-platform-core/src/main/java/org/apache/beam/sdk/extensions/gcp/util/GcsUtil.java]
>  is a primary class to access Google Cloud Storage on Apache Beam. Current 
> implementation directly creates GoogleCloudStorageReadChannel and 
> GoogleCloudStorageWriteChannel by itself to read and write GCS data rather 
> than using 
> [GoogleCloudStorage|https://github.com/GoogleCloudPlatform/bigdata-interop/blob/master/gcsio/src/main/java/com/google/cloud/hadoop/gcsio/GoogleCloudStorage.java]
>  which is an abstract class providing basic IO capability which eventually 
> creates channel objects. This request is about updating GcsUtil to use 
> GoogleCloudStorage to create read and write channel, which is expected 
> flexible because it can easily pick up the new change; e.g. new channel 
> implementation using new protocol without code change.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8889) Make GcsUtil use GoogleCloudStorage

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8889?focusedWorklogId=373897=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373897
 ]

ASF GitHub Bot logged work on BEAM-8889:


Author: ASF GitHub Bot
Created on: 17/Jan/20 21:59
Start Date: 17/Jan/20 21:59
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #10617: [BEAM-8889] 
adding gRPC connectivity to Beam/GCS connector
URL: https://github.com/apache/beam/pull/10617#issuecomment-575811548
 
 
   R: @vebl...@google.com
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373897)
Remaining Estimate: 167h 50m  (was: 168h)
Time Spent: 10m

> Make GcsUtil use GoogleCloudStorage
> ---
>
> Key: BEAM-8889
> URL: https://issues.apache.org/jira/browse/BEAM-8889
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-gcp
>Affects Versions: 2.16.0
>Reporter: Esun Kim
>Assignee: VASU NORI
>Priority: Major
>  Labels: gcs
>   Original Estimate: 168h
>  Time Spent: 10m
>  Remaining Estimate: 167h 50m
>
> [GcsUtil|https://github.com/apache/beam/blob/master/sdks/java/extensions/google-cloud-platform-core/src/main/java/org/apache/beam/sdk/extensions/gcp/util/GcsUtil.java]
>  is a primary class to access Google Cloud Storage on Apache Beam. Current 
> implementation directly creates GoogleCloudStorageReadChannel and 
> GoogleCloudStorageWriteChannel by itself to read and write GCS data rather 
> than using 
> [GoogleCloudStorage|https://github.com/GoogleCloudPlatform/bigdata-interop/blob/master/gcsio/src/main/java/com/google/cloud/hadoop/gcsio/GoogleCloudStorage.java]
>  which is an abstract class providing basic IO capability which eventually 
> creates channel objects. This request is about updating GcsUtil to use 
> GoogleCloudStorage to create read and write channel, which is expected 
> flexible because it can easily pick up the new change; e.g. new channel 
> implementation using new protocol without code change.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373896=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373896
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 17/Jan/20 21:57
Start Date: 17/Jan/20 21:57
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation about --direct_running_mode option with direct runner.
URL: https://github.com/apache/beam/pull/10616#issuecomment-575810811
 
 
   Looking at 
http://apache-beam-website-pull-requests.storage.googleapis.com/10616/documentation/runners/direct/index.html,
 we will need to fix formatting.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373896)
Time Spent: 3h 20m  (was: 3h 10m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 3h 20m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Created] (BEAM-9149) Support ZetaSQL positional parameters

2020-01-17 Thread Kyle Weaver (Jira)
Kyle Weaver created BEAM-9149:
-

 Summary: Support ZetaSQL positional parameters
 Key: BEAM-9149
 URL: https://issues.apache.org/jira/browse/BEAM-9149
 Project: Beam
  Issue Type: New Feature
  Components: dsl-sql-zetasql
Reporter: Kyle Weaver
Assignee: Kyle Weaver


While they are not yet exposed to the end user, ZetaSQL query parameters are 
currently being passed internally. However, the existing code assumes that all 
parameters are named parameters, not positional parameters. To support 
positional parameters, we will need to make at least the following changes:

1) Set mode to PARAMETER_POSITIONAL and use addPositionalQueryParameter instead 
of addQueryParameter in SqlAnalyzer:
https://github.com/apache/beam/blob/671b02ac5f1be87a591de8f5f456d0e5a199d771/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/SqlAnalyzer.java#L119

2) Code currently assumes that resolved parameters are named. While even 
positional parameters must be named when they are used as inputs, after they 
are resolved their names are removed. Thus this check will deref a null pointer 
and must be fixed:
https://github.com/apache/beam/blob/8915d6e95c405aeee0f29152545d3210e8e09f1f/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/translation/ExpressionConverter.java#L1004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9149) Support ZetaSQL positional parameters

2020-01-17 Thread Kyle Weaver (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9149?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Kyle Weaver updated BEAM-9149:
--
Status: Open  (was: Triage Needed)

> Support ZetaSQL positional parameters
> -
>
> Key: BEAM-9149
> URL: https://issues.apache.org/jira/browse/BEAM-9149
> Project: Beam
>  Issue Type: New Feature
>  Components: dsl-sql-zetasql
>Reporter: Kyle Weaver
>Assignee: Kyle Weaver
>Priority: Major
>
> While they are not yet exposed to the end user, ZetaSQL query parameters are 
> currently being passed internally. However, the existing code assumes that 
> all parameters are named parameters, not positional parameters. To support 
> positional parameters, we will need to make at least the following changes:
> 1) Set mode to PARAMETER_POSITIONAL and use addPositionalQueryParameter 
> instead of addQueryParameter in SqlAnalyzer:
> https://github.com/apache/beam/blob/671b02ac5f1be87a591de8f5f456d0e5a199d771/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/SqlAnalyzer.java#L119
> 2) Code currently assumes that resolved parameters are named. While even 
> positional parameters must be named when they are used as inputs, after they 
> are resolved their names are removed. Thus this check will deref a null 
> pointer and must be fixed:
> https://github.com/apache/beam/blob/8915d6e95c405aeee0f29152545d3210e8e09f1f/sdks/java/extensions/sql/zetasql/src/main/java/org/apache/beam/sdk/extensions/sql/zetasql/translation/ExpressionConverter.java#L1004



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9144?focusedWorklogId=373879=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373879
 ]

ASF GitHub Bot logged work on BEAM-9144:


Author: ASF GitHub Bot
Created on: 17/Jan/20 21:20
Start Date: 17/Jan/20 21:20
Worklog Time Spent: 10m 
  Work Description: iemejia commented on issue #10628: [BEAM-9144] Beam's 
own Avro TimeConversion class in beam-sdk-java-core
URL: https://github.com/apache/beam/pull/10628#issuecomment-575799224
 
 
   Run Java PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373879)
Time Spent: 1h 10m  (was: 1h)

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9063) Migrate docker images to apache namespace.

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9063?focusedWorklogId=373871=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373871
 ]

ASF GitHub Bot logged work on BEAM-9063:


Author: ASF GitHub Bot
Created on: 17/Jan/20 21:02
Start Date: 17/Jan/20 21:02
Worklog Time Spent: 10m 
  Work Description: ibzib commented on pull request #10612: [NOT READY TO 
MERGE][BEAM-9063] migrate docker images to apache
URL: https://github.com/apache/beam/pull/10612#discussion_r368134576
 
 

 ##
 File path: runners/spark/job-server/container/build.gradle
 ##
 @@ -54,9 +54,10 @@ task copyDockerfileDependencies(type: Copy) {
 }
 
 docker {
-  name containerImageName(name: 'spark-job-server',
+  name containerImageName(name: ${project.docker_image_default_repo_prefix} + 
'job-server',
 
 Review comment:
   Looks like we lost `spark` here.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373871)
Time Spent: 1h  (was: 50m)

> Migrate docker images to apache namespace.
> --
>
> Key: BEAM-9063
> URL: https://issues.apache.org/jira/browse/BEAM-9063
> Project: Beam
>  Issue Type: Task
>  Components: beam-community
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> https://hub.docker.com/u/apache



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9063) Migrate docker images to apache namespace.

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9063?focusedWorklogId=373870=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373870
 ]

ASF GitHub Bot logged work on BEAM-9063:


Author: ASF GitHub Bot
Created on: 17/Jan/20 21:02
Start Date: 17/Jan/20 21:02
Worklog Time Spent: 10m 
  Work Description: ibzib commented on pull request #10612: [NOT READY TO 
MERGE][BEAM-9063] migrate docker images to apache
URL: https://github.com/apache/beam/pull/10612#discussion_r368133471
 
 

 ##
 File path: gradle.properties
 ##
 @@ -27,3 +27,6 @@ version=2.20.0-SNAPSHOT
 sdk_version=2.20.0.dev
 
 javaVersion=1.8
+
+docker_image_default_repo_root=apache
+docker_image_default_repo_prefix=beam-
 
 Review comment:
   Elsewhere, we use underscores in our docker image names (e.g. `java_sdk`). 
We should use an underscore here to keep it consistent.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373870)
Time Spent: 50m  (was: 40m)

> Migrate docker images to apache namespace.
> --
>
> Key: BEAM-9063
> URL: https://issues.apache.org/jira/browse/BEAM-9063
> Project: Beam
>  Issue Type: Task
>  Components: beam-community
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 50m
>  Remaining Estimate: 0h
>
> https://hub.docker.com/u/apache



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7246) Create a Spanner IO for Python

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7246?focusedWorklogId=373868=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373868
 ]

ASF GitHub Bot logged work on BEAM-7246:


Author: ASF GitHub Bot
Created on: 17/Jan/20 20:55
Start Date: 17/Jan/20 20:55
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on pull request #9606: 
[BEAM-7246] Add Google Spanner IO Read on Python SDK
URL: https://github.com/apache/beam/pull/9606
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373868)
Time Spent: 12.5h  (was: 12h 20m)

> Create a Spanner IO for Python
> --
>
> Key: BEAM-7246
> URL: https://issues.apache.org/jira/browse/BEAM-7246
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp
>Reporter: Reuven Lax
>Assignee: Shehzaad Nakhoda
>Priority: Major
>  Time Spent: 12.5h
>  Remaining Estimate: 0h
>
> Add I/O support for Google Cloud Spanner for the Python SDK (Batch Only).
> Testing in this work item will be in the form of DirectRunner tests and 
> manual testing.
> Integration and performance tests are a separate work item (not included 
> here).
> See https://beam.apache.org/documentation/io/built-in/. The goal is to add 
> Google Clound Spanner to the Database column for the Python/Batch row.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7246) Create a Spanner IO for Python

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7246?focusedWorklogId=373869=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373869
 ]

ASF GitHub Bot logged work on BEAM-7246:


Author: ASF GitHub Bot
Created on: 17/Jan/20 20:55
Start Date: 17/Jan/20 20:55
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #9606: [BEAM-7246] Add 
Google Spanner IO Read on Python SDK
URL: https://github.com/apache/beam/pull/9606#issuecomment-575791689
 
 
   Thank you. 
   Let's get integration tests in so that we can move this out of experimental 
:)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373869)
Time Spent: 12h 40m  (was: 12.5h)

> Create a Spanner IO for Python
> --
>
> Key: BEAM-7246
> URL: https://issues.apache.org/jira/browse/BEAM-7246
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp
>Reporter: Reuven Lax
>Assignee: Shehzaad Nakhoda
>Priority: Major
>  Time Spent: 12h 40m
>  Remaining Estimate: 0h
>
> Add I/O support for Google Cloud Spanner for the Python SDK (Batch Only).
> Testing in this work item will be in the form of DirectRunner tests and 
> manual testing.
> Integration and performance tests are a separate work item (not included 
> here).
> See https://beam.apache.org/documentation/io/built-in/. The goal is to add 
> Google Clound Spanner to the Database column for the Python/Batch row.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Commented] (BEAM-8889) Make GcsUtil use GoogleCloudStorage

2020-01-17 Thread VASU NORI (Jira)


[ 
https://issues.apache.org/jira/browse/BEAM-8889?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel=17018305#comment-17018305
 ] 

VASU NORI commented on BEAM-8889:
-

PR in progress [https://github.com/apache/beam/pull/10617]

> Make GcsUtil use GoogleCloudStorage
> ---
>
> Key: BEAM-8889
> URL: https://issues.apache.org/jira/browse/BEAM-8889
> Project: Beam
>  Issue Type: Improvement
>  Components: io-java-gcp
>Affects Versions: 2.16.0
>Reporter: Esun Kim
>Assignee: VASU NORI
>Priority: Major
>  Labels: gcs
>   Original Estimate: 168h
>  Remaining Estimate: 168h
>
> [GcsUtil|https://github.com/apache/beam/blob/master/sdks/java/extensions/google-cloud-platform-core/src/main/java/org/apache/beam/sdk/extensions/gcp/util/GcsUtil.java]
>  is a primary class to access Google Cloud Storage on Apache Beam. Current 
> implementation directly creates GoogleCloudStorageReadChannel and 
> GoogleCloudStorageWriteChannel by itself to read and write GCS data rather 
> than using 
> [GoogleCloudStorage|https://github.com/GoogleCloudPlatform/bigdata-interop/blob/master/gcsio/src/main/java/com/google/cloud/hadoop/gcsio/GoogleCloudStorage.java]
>  which is an abstract class providing basic IO capability which eventually 
> creates channel objects. This request is about updating GcsUtil to use 
> GoogleCloudStorage to create read and write channel, which is expected 
> flexible because it can easily pick up the new change; e.g. new channel 
> implementation using new protocol without code change.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7246) Create a Spanner IO for Python

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7246?focusedWorklogId=373866=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373866
 ]

ASF GitHub Bot logged work on BEAM-7246:


Author: ASF GitHub Bot
Created on: 17/Jan/20 20:36
Start Date: 17/Jan/20 20:36
Worklog Time Spent: 10m 
  Work Description: shehzaadn-vd commented on issue #9606: [BEAM-7246] Add 
Google Spanner IO Read on Python SDK
URL: https://github.com/apache/beam/pull/9606#issuecomment-575785491
 
 
   Thanks @chamikaramj for your support! @aaltay looks like the tests are 
passing. Would you be able to merge this?
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373866)
Time Spent: 12h 20m  (was: 12h 10m)

> Create a Spanner IO for Python
> --
>
> Key: BEAM-7246
> URL: https://issues.apache.org/jira/browse/BEAM-7246
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp
>Reporter: Reuven Lax
>Assignee: Shehzaad Nakhoda
>Priority: Major
>  Time Spent: 12h 20m
>  Remaining Estimate: 0h
>
> Add I/O support for Google Cloud Spanner for the Python SDK (Batch Only).
> Testing in this work item will be in the form of DirectRunner tests and 
> manual testing.
> Integration and performance tests are a separate work item (not included 
> here).
> See https://beam.apache.org/documentation/io/built-in/. The goal is to add 
> Google Clound Spanner to the Database column for the Python/Batch row.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8684) Beam Dependency Update Request: com.google.apis:google-api-services-bigquery

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8684?focusedWorklogId=373852=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373852
 ]

ASF GitHub Bot logged work on BEAM-8684:


Author: ASF GitHub Bot
Created on: 17/Jan/20 20:22
Start Date: 17/Jan/20 20:22
Worklog Time Spent: 10m 
  Work Description: suztomo commented on pull request #10631: [BEAM-8684] 
Google proto and google-services library upgrades
URL: https://github.com/apache/beam/pull/10631
 
 
   https://issues.apache.org/jira/browse/BEAM-8684
   
   Google proto and google-services library upgrades.
   
   
   
   Thank you for your contribution! Follow this checklist to help us 
incorporate your contribution quickly and easily:
   
- [ ] [**Choose 
reviewer(s)**](https://beam.apache.org/contribute/#make-your-change) and 
mention them in a comment (`R: @username`).
- [ ] Format the pull request title like `[BEAM-XXX] Fixes bug in 
ApproximateQuantiles`, where you replace `BEAM-XXX` with the appropriate JIRA 
issue, if applicable. This will automatically link the pull request to the 
issue.
- [ ] If this contribution is large, please file an Apache [Individual 
Contributor License Agreement](https://www.apache.org/licenses/icla.pdf).
   
   See the [Contributor Guide](https://beam.apache.org/contribute) for more 
tips on [how to make review process 
smoother](https://beam.apache.org/contribute/#make-reviewers-job-easier).
   
   Post-Commit Tests Status (on master branch)
   

   
   Lang | SDK | Apex | Dataflow | Flink | Gearpump | Samza | Spark
   --- | --- | --- | --- | --- | --- | --- | ---
   Go | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Flink/lastCompletedBuild/)
 | --- | --- | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Go_VR_Spark/lastCompletedBuild/)
   Java | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Apex/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Dataflow/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Flink/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Flink_Streaming/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Gearpump/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Samza/lastCompletedBuild/)
 | [![Build 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_Spark/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_PVR_Spark_Batch/lastCompletedBuild/)[![Build
 
Status](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/badge/icon)](https://builds.apache.org/job/beam_PostCommit_Java_ValidatesRunner_SparkStructuredStreaming/lastCompletedBuild/)
   Python | [![Build 

[jira] [Work logged] (BEAM-9140) Update to ZetaSQL 2020.01.1

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9140?focusedWorklogId=373851=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373851
 ]

ASF GitHub Bot logged work on BEAM-9140:


Author: ASF GitHub Bot
Created on: 17/Jan/20 20:22
Start Date: 17/Jan/20 20:22
Worklog Time Spent: 10m 
  Work Description: apilloud commented on issue #10620: [BEAM-9140] Upgrade 
to ZetaSQL 2020.01.1
URL: https://github.com/apache/beam/pull/10620#issuecomment-575780988
 
 
   Run Java PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373851)
Time Spent: 40m  (was: 0.5h)

> Update to ZetaSQL 2020.01.1
> ---
>
> Key: BEAM-9140
> URL: https://issues.apache.org/jira/browse/BEAM-9140
> Project: Beam
>  Issue Type: Improvement
>  Components: dsl-sql-zetasql
>Reporter: Andrew Pilloud
>Assignee: Andrew Pilloud
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> I hear ZetaSQL 2020.01.1 will be coming out in the next few hours. We should 
> upgrade.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373835=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373835
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:50
Start Date: 17/Jan/20 19:50
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368108678
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/worker_status.py
 ##
 @@ -0,0 +1,139 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+"""Worker status api handler for reporting SDK harness debug info."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+import queue
+import sys
+import threading
+import traceback
+from collections import defaultdict
+
+import grpc
+
+from apache_beam.portability.api import beam_fn_api_pb2
+from apache_beam.portability.api import beam_fn_api_pb2_grpc
+from apache_beam.runners.worker.channel_factory import GRPCChannelFactory
+from apache_beam.runners.worker.worker_id_interceptor import 
WorkerIdInterceptor
+
+
+def thread_dump():
+  # deduplicate threads with same stack trace
+  stack_traces = defaultdict(list)
+  frames = sys._current_frames()  # pylint: disable=protected-access
+
+  for t in threading.enumerate():
+stack_trace = ''.join(traceback.format_stack(frames[t.ident]))
+thread_ident_name = (t.ident, t.name)
+stack_traces[stack_trace].append(thread_ident_name)
+
+  all_traces = ['=' * 10 + 'THREAD DUMP' + '=' * 10]
+  for stack, identity in stack_traces.items():
+ident, name = identity[0]
+trace = '--- Thread #%s name: %s %s---\n' % (
+ident, name, 'and other %d threads' %
+(len(identity) - 1) if len(identity) > 1 else '')
+if len(identity) > 1:
+  trace += 'threads: %s\n' % identity
+trace += stack
+all_traces.append(trace)
+  all_traces.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in all_traces)
+
+
+def active_processing_bundles_state(bundle_process_cache):
+  active_bundles = ['=' * 10 + 'ACTIVE PROCESSING BUNDLES' + '=' * 10]
+  if not bundle_process_cache.active_bundle_processors:
+active_bundles.append("No active processing bundles.")
+  else:
+cache = []
+for instruction in list(
+bundle_process_cache.active_bundle_processors.keys()):
+  processor = bundle_process_cache.lookup(instruction)
+  if processor:
+info = processor.state_sampler.get_info()
+cache.append((instruction,
+  processor.process_bundle_descriptor.id,
+  info.tracked_thread, info.time_since_transition))
+# reverse sort active bundle by time since last transition, keep top 10.
+cache.sort(key=lambda x: x[-1], reverse=True)
+for s in cache[:10]:
+  state = '--- instruction %s ---\n' % s[0]
+  state += 'ProcessBundleDescriptorId: %s\n' % s[1]
+  state += "tracked thread: %s\n" % s[2]
+  state += "time since transition: %.2f seconds\n" % (s[3] / 1e9)
+  active_bundles.append(state)
+
+  active_bundles.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in active_bundles)
+
+
+DONE = object()
+
+
+class FnApiWorkerStatusHandler(object):
+  def __init__(self, status_address, bundle_process_cache=None):
+self._alive = True
+self._bundle_process_cache = bundle_process_cache
+ch = GRPCChannelFactory.insecure_channel(status_address)
+grpc.channel_ready_future(ch).result(timeout=60)
+self._status_channel = grpc.intercept_channel(ch, WorkerIdInterceptor())
+self._status_stub = beam_fn_api_pb2_grpc.BeamFnWorkerStatusStub(
+self._status_channel)
+self._responses = queue.Queue()
+self._server = threading.Thread(target=lambda: self._serve(),
+name='fn_api_status_handler')
+self._server.daemon = True
+self._server.start()
+
+  def _get_responses(self):
+while True:
+  response = self._responses.get()
+  if response is DONE:
+self._alive = 

[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373834=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373834
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:50
Start Date: 17/Jan/20 19:50
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368108652
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/worker_status.py
 ##
 @@ -0,0 +1,139 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+"""Worker status api handler for reporting SDK harness debug info."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+import queue
+import sys
+import threading
+import traceback
+from collections import defaultdict
+
+import grpc
+
+from apache_beam.portability.api import beam_fn_api_pb2
+from apache_beam.portability.api import beam_fn_api_pb2_grpc
+from apache_beam.runners.worker.channel_factory import GRPCChannelFactory
+from apache_beam.runners.worker.worker_id_interceptor import 
WorkerIdInterceptor
+
+
+def thread_dump():
+  # deduplicate threads with same stack trace
+  stack_traces = defaultdict(list)
+  frames = sys._current_frames()  # pylint: disable=protected-access
+
+  for t in threading.enumerate():
+stack_trace = ''.join(traceback.format_stack(frames[t.ident]))
+thread_ident_name = (t.ident, t.name)
+stack_traces[stack_trace].append(thread_ident_name)
+
+  all_traces = ['=' * 10 + 'THREAD DUMP' + '=' * 10]
+  for stack, identity in stack_traces.items():
+ident, name = identity[0]
+trace = '--- Thread #%s name: %s %s---\n' % (
+ident, name, 'and other %d threads' %
+(len(identity) - 1) if len(identity) > 1 else '')
+if len(identity) > 1:
+  trace += 'threads: %s\n' % identity
+trace += stack
+all_traces.append(trace)
+  all_traces.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in all_traces)
+
+
+def active_processing_bundles_state(bundle_process_cache):
+  active_bundles = ['=' * 10 + 'ACTIVE PROCESSING BUNDLES' + '=' * 10]
+  if not bundle_process_cache.active_bundle_processors:
+active_bundles.append("No active processing bundles.")
+  else:
+cache = []
+for instruction in list(
+bundle_process_cache.active_bundle_processors.keys()):
+  processor = bundle_process_cache.lookup(instruction)
+  if processor:
+info = processor.state_sampler.get_info()
+cache.append((instruction,
+  processor.process_bundle_descriptor.id,
+  info.tracked_thread, info.time_since_transition))
+# reverse sort active bundle by time since last transition, keep top 10.
+cache.sort(key=lambda x: x[-1], reverse=True)
+for s in cache[:10]:
+  state = '--- instruction %s ---\n' % s[0]
+  state += 'ProcessBundleDescriptorId: %s\n' % s[1]
+  state += "tracked thread: %s\n" % s[2]
+  state += "time since transition: %.2f seconds\n" % (s[3] / 1e9)
+  active_bundles.append(state)
+
+  active_bundles.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in active_bundles)
+
+
+DONE = object()
+
+
+class FnApiWorkerStatusHandler(object):
+  def __init__(self, status_address, bundle_process_cache=None):
+self._alive = True
+self._bundle_process_cache = bundle_process_cache
+ch = GRPCChannelFactory.insecure_channel(status_address)
+grpc.channel_ready_future(ch).result(timeout=60)
+self._status_channel = grpc.intercept_channel(ch, WorkerIdInterceptor())
+self._status_stub = beam_fn_api_pb2_grpc.BeamFnWorkerStatusStub(
+self._status_channel)
+self._responses = queue.Queue()
+self._server = threading.Thread(target=lambda: self._serve(),
+name='fn_api_status_handler')
+self._server.daemon = True
+self._server.start()
+
+  def _get_responses(self):
+while True:
+  response = self._responses.get()
+  if response is DONE:
+self._alive = 

[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373828=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373828
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:45
Start Date: 17/Jan/20 19:45
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368106706
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/worker_status.py
 ##
 @@ -0,0 +1,139 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+"""Worker status api handler for reporting SDK harness debug info."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+import queue
+import sys
+import threading
+import traceback
+from collections import defaultdict
+
+import grpc
+
+from apache_beam.portability.api import beam_fn_api_pb2
+from apache_beam.portability.api import beam_fn_api_pb2_grpc
+from apache_beam.runners.worker.channel_factory import GRPCChannelFactory
+from apache_beam.runners.worker.worker_id_interceptor import 
WorkerIdInterceptor
+
+
+def thread_dump():
+  # deduplicate threads with same stack trace
+  stack_traces = defaultdict(list)
+  frames = sys._current_frames()  # pylint: disable=protected-access
+
+  for t in threading.enumerate():
+stack_trace = ''.join(traceback.format_stack(frames[t.ident]))
+thread_ident_name = (t.ident, t.name)
+stack_traces[stack_trace].append(thread_ident_name)
+
+  all_traces = ['=' * 10 + 'THREAD DUMP' + '=' * 10]
+  for stack, identity in stack_traces.items():
+ident, name = identity[0]
+trace = '--- Thread #%s name: %s %s---\n' % (
+ident, name, 'and other %d threads' %
+(len(identity) - 1) if len(identity) > 1 else '')
+if len(identity) > 1:
+  trace += 'threads: %s\n' % identity
+trace += stack
+all_traces.append(trace)
+  all_traces.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in all_traces)
+
+
+def active_processing_bundles_state(bundle_process_cache):
+  active_bundles = ['=' * 10 + 'ACTIVE PROCESSING BUNDLES' + '=' * 10]
+  if not bundle_process_cache.active_bundle_processors:
+active_bundles.append("No active processing bundles.")
+  else:
+cache = []
+for instruction in list(
+bundle_process_cache.active_bundle_processors.keys()):
+  processor = bundle_process_cache.lookup(instruction)
+  if processor:
+info = processor.state_sampler.get_info()
+cache.append((instruction,
+  processor.process_bundle_descriptor.id,
+  info.tracked_thread, info.time_since_transition))
+# reverse sort active bundle by time since last transition, keep top 10.
+cache.sort(key=lambda x: x[-1], reverse=True)
+for s in cache[:10]:
+  state = '--- instruction %s ---\n' % s[0]
+  state += 'ProcessBundleDescriptorId: %s\n' % s[1]
+  state += "tracked thread: %s\n" % s[2]
+  state += "time since transition: %.2f seconds\n" % (s[3] / 1e9)
+  active_bundles.append(state)
+
+  active_bundles.append('=' * 30)
+  return '\n'.join(x.encode('utf-8') for x in active_bundles)
+
+
+DONE = object()
+
+
+class FnApiWorkerStatusHandler(object):
+  def __init__(self, status_address, bundle_process_cache=None):
+self._alive = True
+self._bundle_process_cache = bundle_process_cache
+ch = GRPCChannelFactory.insecure_channel(status_address)
+grpc.channel_ready_future(ch).result(timeout=60)
+self._status_channel = grpc.intercept_channel(ch, WorkerIdInterceptor())
+self._status_stub = beam_fn_api_pb2_grpc.BeamFnWorkerStatusStub(
+self._status_channel)
+self._responses = queue.Queue()
+self._server = threading.Thread(target=lambda: self._serve(),
+name='fn_api_status_handler')
+self._server.daemon = True
+self._server.start()
+
+  def _get_responses(self):
+while True:
+  response = self._responses.get()
+  if response is DONE:
+self._alive = 

[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373825=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373825
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:43
Start Date: 17/Jan/20 19:43
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368105639
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/sdk_worker.py
 ##
 @@ -110,6 +112,15 @@ def __init__(self,
 data_channel_factory=self._data_channel_factory,
 fns=self._fns)
 
+if status_address:
 
 Review comment:
   I need to do the actual initialization inside sdk_worker since I want to 
pass the active bundle cache in sdk worker in order to report the dangling 
operation.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373825)
Time Spent: 1h 20m  (was: 1h 10m)

> Implement status api handler in python sdk harness
> --
>
> Key: BEAM-8626
> URL: https://issues.apache.org/jira/browse/BEAM-8626
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: Yichi Zhang
>Assignee: Yichi Zhang
>Priority: Major
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373823=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373823
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:41
Start Date: 17/Jan/20 19:41
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368104854
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/sdk_worker.py
 ##
 @@ -73,6 +74,7 @@ class SdkHarness(object):
 
   def __init__(self,
control_address,  # type: str
+   status_address=None,  # type: Optional[str, unicode]
 
 Review comment:
   done.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373823)
Time Spent: 1h 10m  (was: 1h)

> Implement status api handler in python sdk harness
> --
>
> Key: BEAM-8626
> URL: https://issues.apache.org/jira/browse/BEAM-8626
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: Yichi Zhang
>Assignee: Yichi Zhang
>Priority: Major
>  Time Spent: 1h 10m
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7246) Create a Spanner IO for Python

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7246?focusedWorklogId=373821=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373821
 ]

ASF GitHub Bot logged work on BEAM-7246:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:38
Start Date: 17/Jan/20 19:38
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #9606: [BEAM-7246] Add 
Google Spanner IO Read on Python SDK
URL: https://github.com/apache/beam/pull/9606#issuecomment-575766434
 
 
   LGTM. Thanks.
   
   We can get this in when tests pass.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373821)
Time Spent: 12h 10m  (was: 12h)

> Create a Spanner IO for Python
> --
>
> Key: BEAM-7246
> URL: https://issues.apache.org/jira/browse/BEAM-7246
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp
>Reporter: Reuven Lax
>Assignee: Shehzaad Nakhoda
>Priority: Major
>  Time Spent: 12h 10m
>  Remaining Estimate: 0h
>
> Add I/O support for Google Cloud Spanner for the Python SDK (Batch Only).
> Testing in this work item will be in the form of DirectRunner tests and 
> manual testing.
> Integration and performance tests are a separate work item (not included 
> here).
> See https://beam.apache.org/documentation/io/built-in/. The goal is to add 
> Google Clound Spanner to the Database column for the Python/Batch row.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-8626) Implement status api handler in python sdk harness

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8626?focusedWorklogId=373820=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373820
 ]

ASF GitHub Bot logged work on BEAM-8626:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:36
Start Date: 17/Jan/20 19:36
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10598: [BEAM-8626] 
Implement status fn api handler in python sdk
URL: https://github.com/apache/beam/pull/10598#discussion_r368102764
 
 

 ##
 File path: sdks/python/apache_beam/runners/worker/worker_status.py
 ##
 @@ -0,0 +1,139 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+"""Worker status api handler for reporting SDK harness debug info."""
+
+from __future__ import absolute_import
+from __future__ import division
+
+import queue
+import sys
+import threading
+import traceback
+from collections import defaultdict
+
+import grpc
+
+from apache_beam.portability.api import beam_fn_api_pb2
+from apache_beam.portability.api import beam_fn_api_pb2_grpc
+from apache_beam.runners.worker.channel_factory import GRPCChannelFactory
+from apache_beam.runners.worker.worker_id_interceptor import 
WorkerIdInterceptor
+
+
+def thread_dump():
 
 Review comment:
   I made few changes to the thread dump format. I'll reuse the function, I 
think eventually we probably won't need the status http server.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373820)
Time Spent: 1h  (was: 50m)

> Implement status api handler in python sdk harness
> --
>
> Key: BEAM-8626
> URL: https://issues.apache.org/jira/browse/BEAM-8626
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-harness
>Reporter: Yichi Zhang
>Assignee: Yichi Zhang
>Priority: Major
>  Time Spent: 1h
>  Remaining Estimate: 0h
>




--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9148) test flakiness: BigQueryQueryToTableIT.test_big_query_standard_sql

2020-01-17 Thread Udi Meiri (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Udi Meiri reassigned BEAM-9148:
---

Assignee: Boyuan Zhang

> test flakiness: BigQueryQueryToTableIT.test_big_query_standard_sql
> --
>
> Key: BEAM-9148
> URL: https://issues.apache.org/jira/browse/BEAM-9148
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp, sdk-py-core, test-failures
>Reporter: Udi Meiri
>Assignee: Boyuan Zhang
>Priority: Major
>
> There might be other flaky test cases from the same class, but I'm focusing 
> on test_big_query_standard_sql here.
> {code}
> 19:39:12  
> ==
> 19:39:12  FAIL: test_big_query_standard_sql 
> (apache_beam.io.gcp.big_query_query_to_table_it_test.BigQueryQueryToTableIT)
> 19:39:12  
> --
> 19:39:12  Traceback (most recent call last):
> 19:39:12File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/io/gcp/big_query_query_to_table_it_test.py",
>  line 172, in test_big_query_standard_sql
> 19:39:12  big_query_query_to_table_pipeline.run_bq_pipeline(options)
> 19:39:12File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/io/gcp/big_query_query_to_table_pipeline.py",
>  line 84, in run_bq_pipeline
> 19:39:12  result = p.run()
> 19:39:12File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/testing/test_pipeline.py",
>  line 112, in run
> 19:39:12  else test_runner_api))
> 19:39:12File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/pipeline.py",
>  line 461, in run
> 19:39:12  self._options).run(False)
> 19:39:12File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/pipeline.py",
>  line 474, in run
> 19:39:12  return self.runner.run_pipeline(self, self._options)
> 19:39:12File 
> "/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/runners/direct/test_direct_runner.py",
>  line 53, in run_pipeline
> 19:39:12  hc_assert_that(self.result, pickler.loads(on_success_matcher))
> 19:39:12  AssertionError: 
> 19:39:12  Expected: (Test pipeline expected terminated in state: DONE and 
> Expected checksum is 158a8ea1c254fcf40d4ed3e7c0242c3ea0a29e72)
> 19:39:12   but: Expected checksum is 
> 158a8ea1c254fcf40d4ed3e7c0242c3ea0a29e72 Actual checksum is 
> da39a3ee5e6b4b0d3255bfef95601890afd80709
> 19:39:12  
> 19:39:12   >> begin captured logging << 
> 
> 19:39:12  root: DEBUG: Unhandled type_constraint: Union[]
> 19:39:12  root: DEBUG: Unhandled type_constraint: Union[]
> 19:39:12  apache_beam.runners.direct.direct_runner: INFO: Running pipeline 
> with DirectRunner.
> 19:39:12  apache_beam.io.gcp.bigquery_tools: DEBUG: Query SELECT * FROM 
> (SELECT "apple" as fruit) UNION ALL (SELECT "orange" as fruit) does not 
> reference any tables.
> 19:39:12  apache_beam.io.gcp.bigquery_tools: WARNING: Dataset 
> apache-beam-testing:temp_dataset_90f5797bdb5f4137af750399f91a8e66 does not 
> exist so we will create it as temporary with location=None
> 19:39:12  apache_beam.io.gcp.bigquery: DEBUG: Creating or getting table 
>  19:39:12   datasetId: 'python_query_to_table_15792323245106'
> 19:39:12   projectId: 'apache-beam-testing'
> 19:39:12   tableId: 'output_table'> with schema {'fields': [{'name': 'fruit', 
> 'type': 'STRING', 'mode': 'NULLABLE'}]}.
> 19:39:12  apache_beam.io.gcp.bigquery_tools: DEBUG: Created the table with id 
> output_table
> 19:39:12  apache_beam.io.gcp.bigquery_tools: INFO: Created table 
> apache-beam-testing.python_query_to_table_15792323245106.output_table with 
> schema  19:39:12   fields: [ 19:39:12   fields: []
> 19:39:12   mode: 'NULLABLE'
> 19:39:12   name: 'fruit'
> 19:39:12   type: 'STRING'>]>. Result:  19:39:12   creationTime: 1579232328576
> 19:39:12   etag: 'WYysl6UIvc8IWMmTiiKhbg=='
> 19:39:12   id: 
> 'apache-beam-testing:python_query_to_table_15792323245106.output_table'
> 19:39:12   kind: 'bigquery#table'
> 19:39:12   lastModifiedTime: 1579232328629
> 19:39:12   location: 'US'
> 19:39:12   numBytes: 0
> 19:39:12   numLongTermBytes: 0
> 19:39:12   numRows: 0
> 19:39:12   schema:  19:39:12   fields: [ 19:39:12   fields: []
> 19:39:12   mode: 'NULLABLE'
> 19:39:12   name: 'fruit'
> 19:39:12   type: 'STRING'>]>
> 19:39:12   selfLink: 
> 'https://www.googleapis.com/bigquery/v2/projects/apache-beam-testing/datasets/python_query_to_table_15792323245106/tables/output_table'
> 19:39:12   tableReference:  19:39:12   datasetId: 

[jira] [Work logged] (BEAM-7246) Create a Spanner IO for Python

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7246?focusedWorklogId=373814=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373814
 ]

ASF GitHub Bot logged work on BEAM-7246:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:29
Start Date: 17/Jan/20 19:29
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #9606: [BEAM-7246] Add 
Google Spanner IO Read on Python SDK
URL: https://github.com/apache/beam/pull/9606#issuecomment-575763166
 
 
   Retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373814)
Time Spent: 11h 50m  (was: 11h 40m)

> Create a Spanner IO for Python
> --
>
> Key: BEAM-7246
> URL: https://issues.apache.org/jira/browse/BEAM-7246
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp
>Reporter: Reuven Lax
>Assignee: Shehzaad Nakhoda
>Priority: Major
>  Time Spent: 11h 50m
>  Remaining Estimate: 0h
>
> Add I/O support for Google Cloud Spanner for the Python SDK (Batch Only).
> Testing in this work item will be in the form of DirectRunner tests and 
> manual testing.
> Integration and performance tests are a separate work item (not included 
> here).
> See https://beam.apache.org/documentation/io/built-in/. The goal is to add 
> Google Clound Spanner to the Database column for the Python/Batch row.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7246) Create a Spanner IO for Python

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7246?focusedWorklogId=373815=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373815
 ]

ASF GitHub Bot logged work on BEAM-7246:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:29
Start Date: 17/Jan/20 19:29
Worklog Time Spent: 10m 
  Work Description: chamikaramj commented on issue #9606: [BEAM-7246] Add 
Google Spanner IO Read on Python SDK
URL: https://github.com/apache/beam/pull/9606#issuecomment-575763362
 
 
   Retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373815)
Time Spent: 12h  (was: 11h 50m)

> Create a Spanner IO for Python
> --
>
> Key: BEAM-7246
> URL: https://issues.apache.org/jira/browse/BEAM-7246
> Project: Beam
>  Issue Type: Bug
>  Components: io-py-gcp
>Reporter: Reuven Lax
>Assignee: Shehzaad Nakhoda
>Priority: Major
>  Time Spent: 12h
>  Remaining Estimate: 0h
>
> Add I/O support for Google Cloud Spanner for the Python SDK (Batch Only).
> Testing in this work item will be in the form of DirectRunner tests and 
> manual testing.
> Integration and performance tests are a separate work item (not included 
> here).
> See https://beam.apache.org/documentation/io/built-in/. The goal is to add 
> Google Clound Spanner to the Database column for the Python/Batch row.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9122) Add uses_keyed_state step property to python dataflow runner

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9122?focusedWorklogId=373808=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373808
 ]

ASF GitHub Bot logged work on BEAM-9122:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:24
Start Date: 17/Jan/20 19:24
Worklog Time Spent: 10m 
  Work Description: y1chi commented on pull request #10596: [BEAM-9122] Add 
uses_keyed_state step property in python dataflow run…
URL: https://github.com/apache/beam/pull/10596#discussion_r368097986
 
 

 ##
 File path: sdks/python/apache_beam/runners/dataflow/dataflow_runner.py
 ##
 @@ -950,6 +950,10 @@ def run_ParDo(self, transform_node, options):
   step.add_property(PropertyNames.RESTRICTION_ENCODING,
 self._get_cloud_encoding(restriction_coder))
 
+from apache_beam.runners.common import DoFnSignature
+if DoFnSignature(transform.dofn).is_stateful_dofn():
 
 Review comment:
   I see, done.
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373808)
Time Spent: 1.5h  (was: 1h 20m)

> Add uses_keyed_state step property to python dataflow runner
> 
>
> Key: BEAM-9122
> URL: https://issues.apache.org/jira/browse/BEAM-9122
> Project: Beam
>  Issue Type: Improvement
>  Components: sdk-py-core
>Reporter: Yichi Zhang
>Assignee: Yichi Zhang
>Priority: Major
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> Add additional step property to dataflow job property when a DoFn is stateful 
> in python sdk. So that the backend runner can recognize stateful steps.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373798=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373798
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:13
Start Date: 17/Jan/20 19:13
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation
URL: https://github.com/apache/beam/pull/10616#issuecomment-575757817
 
 
   Run Website_Stage_GCS PreCommit
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373798)
Time Spent: 3h 10m  (was: 3h)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 3h 10m
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (BEAM-8685) Beam Dependency Update Request: com.google.auth:google-auth-library-oauth2-http

2020-01-17 Thread Tomo Suzuki (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-8685?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Tomo Suzuki closed BEAM-8685.
-
Fix Version/s: 2.20.0
   Resolution: Fixed

Now it's 0.19 in master

> Beam Dependency Update Request: 
> com.google.auth:google-auth-library-oauth2-http
> ---
>
> Key: BEAM-8685
> URL: https://issues.apache.org/jira/browse/BEAM-8685
> Project: Beam
>  Issue Type: Sub-task
>  Components: dependencies
>Reporter: Beam JIRA Bot
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.20.0
>
>  Time Spent: 3h 40m
>  Remaining Estimate: 0h
>
>  - 2019-11-15 19:39:27.324449 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.18.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-11-19 21:05:03.844285 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.18.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-02 12:10:30.864371 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.18.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-09 12:09:38.646889 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.18.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-23 12:09:39.967215 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.19.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2019-12-30 14:05:10.534268 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.19.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 
>  - 2020-01-06 12:08:45.451960 
> -
> Please consider upgrading the dependency 
> com.google.auth:google-auth-library-oauth2-http. 
> The current version is 0.12.0. The latest version is 0.19.0 
> cc: 
>  Please refer to [Beam Dependency Guide 
> |https://beam.apache.org/contribute/dependencies/]for more information. 
> Do Not Modify The Description Above. 



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-7861) Make it easy to change between multi-process and multi-thread mode for Python Direct runners

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-7861?focusedWorklogId=373796=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373796
 ]

ASF GitHub Bot logged work on BEAM-7861:


Author: ASF GitHub Bot
Created on: 17/Jan/20 19:10
Start Date: 17/Jan/20 19:10
Worklog Time Spent: 10m 
  Work Description: tvalentyn commented on issue #10616: [BEAM-7861] update 
documentation
URL: https://github.com/apache/beam/pull/10616#issuecomment-575756723
 
 
   retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373796)
Time Spent: 3h  (was: 2h 50m)

> Make it easy to change between multi-process and multi-thread mode for Python 
> Direct runners
> 
>
> Key: BEAM-7861
> URL: https://issues.apache.org/jira/browse/BEAM-7861
> Project: Beam
>  Issue Type: Sub-task
>  Components: sdk-py-core
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: 2.19.0
>
>  Time Spent: 3h
>  Remaining Estimate: 0h
>
> BEAM-3645 makes it possible to run a map task parallel.
> However, users need to change runner when switch between multithreading and 
> multiprocessing mode.
> We want to add a flag (ex: --use-multiprocess) to make the switch easy 
> without changing the runner each time.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Closed] (BEAM-9130) sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with ImportError: No module named google.protobuf.message

2020-01-17 Thread Udi Meiri (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Udi Meiri closed BEAM-9130.
---
Fix Version/s: Not applicable
   Resolution: Fixed

> sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with 
> ImportError: No module named google.protobuf.message
> ---
>
> Key: BEAM-9130
> URL: https://issues.apache.org/jira/browse/BEAM-9130
> Project: Beam
>  Issue Type: Improvement
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Assignee: Udi Meiri
>Priority: Major
>  Labels: currently-failing
> Fix For: Not applicable
>
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From logs:
> {noformat}
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 267, in _bootstrap
> 16:33:50  self.run()
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 114, in run
> 16:33:50  self._target(*self._args, **self._kwargs)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 357, in 
> _install_grpcio_tools_and_generate_proto_files
> 16:33:50  generate_proto_files(force=force)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 324, in 
> generate_proto_files
> 16:33:50  generate_urn_files(log, out_dir)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 65, in 
> generate_urn_files
> 16:33:50  import google.protobuf.message as message
> 16:33:50  ImportError: No module named google.protobuf.message
> 16:33:50  Traceback (most recent call last):
> 16:33:50File "setup.py", line 305, in 
> 16:33:50  'mypy': generate_protos_first(mypy),
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/__init__.py", line 145, in 
> setup
> 16:33:50  return distutils.core.setup(**attrs)
> 16:33:50File "/usr/local/lib/python2.7/distutils/core.py", line 151, in 
> setup
> 16:33:50  dist.run_commands()
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 953, in 
> run_commands
> 16:33:50  self.run_command(cmd)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/command/sdist.py", line 
> 44, in run
> 16:33:50  self.run_command('egg_info')
> 16:33:50File "/usr/local/lib/python2.7/distutils/cmd.py", line 326, in 
> run_command
> 16:33:50  self.distribution.run_command(command)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File "setup.py", line 229, in run
> 16:33:50  gen_protos.generate_proto_files(log=log)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 291, in 
> generate_proto_files
> 16:33:50  raise ValueError("Proto generation failed (see log for 
> details).")
> 16:33:50  ValueError: Proto generation failed (see log for 
> details
> {noformat}
> {noformat}
> import google.protobuf.message as message
> ImportError: No module named google.protobuf.message
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Assigned] (BEAM-9130) sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with ImportError: No module named google.protobuf.message

2020-01-17 Thread Udi Meiri (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9130?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Udi Meiri reassigned BEAM-9130:
---

Assignee: Udi Meiri

> sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with 
> ImportError: No module named google.protobuf.message
> ---
>
> Key: BEAM-9130
> URL: https://issues.apache.org/jira/browse/BEAM-9130
> Project: Beam
>  Issue Type: Improvement
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Assignee: Udi Meiri
>Priority: Major
>  Labels: currently-failing
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From logs:
> {noformat}
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 267, in _bootstrap
> 16:33:50  self.run()
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 114, in run
> 16:33:50  self._target(*self._args, **self._kwargs)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 357, in 
> _install_grpcio_tools_and_generate_proto_files
> 16:33:50  generate_proto_files(force=force)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 324, in 
> generate_proto_files
> 16:33:50  generate_urn_files(log, out_dir)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 65, in 
> generate_urn_files
> 16:33:50  import google.protobuf.message as message
> 16:33:50  ImportError: No module named google.protobuf.message
> 16:33:50  Traceback (most recent call last):
> 16:33:50File "setup.py", line 305, in 
> 16:33:50  'mypy': generate_protos_first(mypy),
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/__init__.py", line 145, in 
> setup
> 16:33:50  return distutils.core.setup(**attrs)
> 16:33:50File "/usr/local/lib/python2.7/distutils/core.py", line 151, in 
> setup
> 16:33:50  dist.run_commands()
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 953, in 
> run_commands
> 16:33:50  self.run_command(cmd)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/command/sdist.py", line 
> 44, in run
> 16:33:50  self.run_command('egg_info')
> 16:33:50File "/usr/local/lib/python2.7/distutils/cmd.py", line 326, in 
> run_command
> 16:33:50  self.distribution.run_command(command)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File "setup.py", line 229, in run
> 16:33:50  gen_protos.generate_proto_files(log=log)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 291, in 
> generate_proto_files
> 16:33:50  raise ValueError("Proto generation failed (see log for 
> details).")
> 16:33:50  ValueError: Proto generation failed (see log for 
> details
> {noformat}
> {noformat}
> import google.protobuf.message as message
> ImportError: No module named google.protobuf.message
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9130) sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with ImportError: No module named google.protobuf.message

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9130?focusedWorklogId=373791=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373791
 ]

ASF GitHub Bot logged work on BEAM-9130:


Author: ASF GitHub Bot
Created on: 17/Jan/20 18:58
Start Date: 17/Jan/20 18:58
Worklog Time Spent: 10m 
  Work Description: udim commented on pull request #10629: [BEAM-9130] 
Migrate HDFS IT to use tox env.
URL: https://github.com/apache/beam/pull/10629
 
 
   
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373791)
Time Spent: 1.5h  (was: 1h 20m)

> sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with 
> ImportError: No module named google.protobuf.message
> ---
>
> Key: BEAM-9130
> URL: https://issues.apache.org/jira/browse/BEAM-9130
> Project: Beam
>  Issue Type: Improvement
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Priority: Major
>  Labels: currently-failing
>  Time Spent: 1.5h
>  Remaining Estimate: 0h
>
> From logs:
> {noformat}
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 267, in _bootstrap
> 16:33:50  self.run()
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 114, in run
> 16:33:50  self._target(*self._args, **self._kwargs)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 357, in 
> _install_grpcio_tools_and_generate_proto_files
> 16:33:50  generate_proto_files(force=force)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 324, in 
> generate_proto_files
> 16:33:50  generate_urn_files(log, out_dir)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 65, in 
> generate_urn_files
> 16:33:50  import google.protobuf.message as message
> 16:33:50  ImportError: No module named google.protobuf.message
> 16:33:50  Traceback (most recent call last):
> 16:33:50File "setup.py", line 305, in 
> 16:33:50  'mypy': generate_protos_first(mypy),
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/__init__.py", line 145, in 
> setup
> 16:33:50  return distutils.core.setup(**attrs)
> 16:33:50File "/usr/local/lib/python2.7/distutils/core.py", line 151, in 
> setup
> 16:33:50  dist.run_commands()
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 953, in 
> run_commands
> 16:33:50  self.run_command(cmd)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/command/sdist.py", line 
> 44, in run
> 16:33:50  self.run_command('egg_info')
> 16:33:50File "/usr/local/lib/python2.7/distutils/cmd.py", line 326, in 
> run_command
> 16:33:50  self.distribution.run_command(command)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File "setup.py", line 229, in run
> 16:33:50  gen_protos.generate_proto_files(log=log)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 291, in 
> generate_proto_files
> 16:33:50  raise ValueError("Proto generation failed (see log for 
> details).")
> 16:33:50  ValueError: Proto generation failed (see log for 
> details
> {noformat}
> {noformat}
> import google.protobuf.message as message
> ImportError: No module named google.protobuf.message
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9144) Beam's own Avro TimeConversion class in beam-sdk-java-core

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9144?focusedWorklogId=373789=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373789
 ]

ASF GitHub Bot logged work on BEAM-9144:


Author: ASF GitHub Bot
Created on: 17/Jan/20 18:57
Start Date: 17/Jan/20 18:57
Worklog Time Spent: 10m 
  Work Description: suztomo commented on issue #10628: [BEAM-9144] Beam's 
own Avro TimeConversion class in beam-sdk-java-core
URL: https://github.com/apache/beam/pull/10628#issuecomment-575752067
 
 
   Java PreCommit failed:
   ```
   12:51:00 Execution failed for task ':sdks:java:io:rabbitmq:test'.
   12:51:00 > Process 'Gradle Test Executor 101' finished with non-zero exit 
value 143
   ```
   https://builds.apache.org/job/beam_PreCommit_Java_Commit/9677/console
   
   I believe it's transient (It worked in my local)
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373789)
Time Spent: 1h  (was: 50m)

> Beam's own Avro TimeConversion class in beam-sdk-java-core 
> ---
>
> Key: BEAM-9144
> URL: https://issues.apache.org/jira/browse/BEAM-9144
> Project: Beam
>  Issue Type: Bug
>  Components: sdk-java-core
>Reporter: Tomo Suzuki
>Assignee: Tomo Suzuki
>Priority: Major
> Fix For: 2.19.0
>
> Attachments: avro-beam-dependency-graph.png
>
>  Time Spent: 1h
>  Remaining Estimate: 0h
>
> From Aaron's comment in 
> https://issues.apache.org/jira/browse/BEAM-8388?focusedCommentId=17016476=com.atlassian.jira.plugin.system.issuetabpanels%3Acomment-tabpanel#comment-17016476
>  .
> {quote}My org must use Avro 1.9.x (due to some Avro schema resolution issues 
> resolved in 1.9.x) so downgrading Avro is not possible for us.
>  Beam 2.16.0 is compatible with our usage of Avro 1.9.x – but upgrading to 
> 2.17.0 we are broken as 2.17.0 links to Java classes in Avro 1.8.x that are 
> not available in 1.9.x.
> {quote}
> The Java class is 
> {{org.apache.avro.data.TimeConversions.TimestampConversion}} in Avro 1.8.
>  It's renamed to {{org.apache.avro.data.JodaTimeConversions}} in Avro 1.9.
> h1. Beam Java SDK cannot upgrade Avro to 1.9
> Beam has Spark runners and Spark has not yet upgraded to Avro 1.9.
> Illustration of the dependency
> !avro-beam-dependency-graph.png|width=799,height=385!
> h1. Short-term Solution
> As illustrated above, as long as Beam Java SDK uses only the intersection of 
> Avro classes, method, and fields between Avro 1.8 and 1.9, it will provide 
> flexibility in runtime Avro versions (as it did until Beam 2.16).
> h2. Difference of the TimeConversion Classes
> Avro 1.9's TimestampConversion overrides {{getRecommendedSchema}} method. 
> Details below:
> Avro 1.8's TimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
>   }
> {code}
> Avro 1.9's JodaTimeConversions.TimestampConversion:
> {code:java}
>   public static class TimestampConversion extends Conversion {
> @Override
> public Class getConvertedType() {
>   return DateTime.class;
> }
> @Override
> public String getLogicalTypeName() {
>   return "timestamp-millis";
> }
> @Override
> public DateTime fromLong(Long millisFromEpoch, Schema schema, LogicalType 
> type) {
>   return new DateTime(millisFromEpoch, DateTimeZone.UTC);
> }
> @Override
> public Long toLong(DateTime timestamp, Schema schema, LogicalType type) {
>   return timestamp.getMillis();
> }
> @Override
> public Schema getRecommendedSchema() {
>   return 
> LogicalTypes.timestampMillis().addToSchema(Schema.create(Schema.Type.LONG));
> }
>   }
> {code}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9130) sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with ImportError: No module named google.protobuf.message

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9130?focusedWorklogId=373790=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373790
 ]

ASF GitHub Bot logged work on BEAM-9130:


Author: ASF GitHub Bot
Created on: 17/Jan/20 18:57
Start Date: 17/Jan/20 18:57
Worklog Time Spent: 10m 
  Work Description: udim commented on issue #10629: [BEAM-9130] Migrate 
HDFS IT to use tox env.
URL: https://github.com/apache/beam/pull/10629#issuecomment-575752267
 
 
   Opened https://issues.apache.org/jira/browse/BEAM-9148 for flaky postcommit 
test (test_big_query_standard_sql).
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373790)
Time Spent: 1h 20m  (was: 1h 10m)

> sdks:python:test-suites:direct:py2:hdfsIntegrationTest is failing with 
> ImportError: No module named google.protobuf.message
> ---
>
> Key: BEAM-9130
> URL: https://issues.apache.org/jira/browse/BEAM-9130
> Project: Beam
>  Issue Type: Improvement
>  Components: test-failures
>Reporter: Valentyn Tymofieiev
>Priority: Major
>  Labels: currently-failing
>  Time Spent: 1h 20m
>  Remaining Estimate: 0h
>
> From logs:
> {noformat}
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 267, in _bootstrap
> 16:33:50  self.run()
> 16:33:50File "/usr/local/lib/python2.7/multiprocessing/process.py", line 
> 114, in run
> 16:33:50  self._target(*self._args, **self._kwargs)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 357, in 
> _install_grpcio_tools_and_generate_proto_files
> 16:33:50  generate_proto_files(force=force)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 324, in 
> generate_proto_files
> 16:33:50  generate_urn_files(log, out_dir)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 65, in 
> generate_urn_files
> 16:33:50  import google.protobuf.message as message
> 16:33:50  ImportError: No module named google.protobuf.message
> 16:33:50  Traceback (most recent call last):
> 16:33:50File "setup.py", line 305, in 
> 16:33:50  'mypy': generate_protos_first(mypy),
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/__init__.py", line 145, in 
> setup
> 16:33:50  return distutils.core.setup(**attrs)
> 16:33:50File "/usr/local/lib/python2.7/distutils/core.py", line 151, in 
> setup
> 16:33:50  dist.run_commands()
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 953, in 
> run_commands
> 16:33:50  self.run_command(cmd)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File 
> "/usr/local/lib/python2.7/site-packages/setuptools/command/sdist.py", line 
> 44, in run
> 16:33:50  self.run_command('egg_info')
> 16:33:50File "/usr/local/lib/python2.7/distutils/cmd.py", line 326, in 
> run_command
> 16:33:50  self.distribution.run_command(command)
> 16:33:50File "/usr/local/lib/python2.7/distutils/dist.py", line 972, in 
> run_command
> 16:33:50  cmd_obj.run()
> 16:33:50File "setup.py", line 229, in run
> 16:33:50  gen_protos.generate_proto_files(log=log)
> 16:33:50File "/app/sdks/python/gen_protos.py", line 291, in 
> generate_proto_files
> 16:33:50  raise ValueError("Proto generation failed (see log for 
> details).")
> 16:33:50  ValueError: Proto generation failed (see log for 
> details
> {noformat}
> {noformat}
> import google.protobuf.message as message
> ImportError: No module named google.protobuf.message
> {noformat}



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Work logged] (BEAM-9063) Migrate docker images to apache namespace.

2020-01-17 Thread ASF GitHub Bot (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9063?focusedWorklogId=373787=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-373787
 ]

ASF GitHub Bot logged work on BEAM-9063:


Author: ASF GitHub Bot
Created on: 17/Jan/20 18:56
Start Date: 17/Jan/20 18:56
Worklog Time Spent: 10m 
  Work Description: ibzib commented on issue #10612: [NOT READY TO 
MERGE][BEAM-9063] migrate docker images to apache
URL: https://github.com/apache/beam/pull/10612#issuecomment-575751659
 
 
   retest this please
 

This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
---

Worklog Id: (was: 373787)
Time Spent: 40m  (was: 0.5h)

> Migrate docker images to apache namespace.
> --
>
> Key: BEAM-9063
> URL: https://issues.apache.org/jira/browse/BEAM-9063
> Project: Beam
>  Issue Type: Task
>  Components: beam-community
>Reporter: Hannah Jiang
>Assignee: Hannah Jiang
>Priority: Major
> Fix For: Not applicable
>
>  Time Spent: 40m
>  Remaining Estimate: 0h
>
> https://hub.docker.com/u/apache



--
This message was sent by Atlassian Jira
(v8.3.4#803005)


[jira] [Updated] (BEAM-9148) test flakiness: BigQueryQueryToTableIT.test_big_query_standard_sql

2020-01-17 Thread Udi Meiri (Jira)


 [ 
https://issues.apache.org/jira/browse/BEAM-9148?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel
 ]

Udi Meiri updated BEAM-9148:

Description: 
There might be other flaky test cases from the same class, but I'm focusing on 
test_big_query_standard_sql here.

{code}
19:39:12  ==
19:39:12  FAIL: test_big_query_standard_sql 
(apache_beam.io.gcp.big_query_query_to_table_it_test.BigQueryQueryToTableIT)
19:39:12  --
19:39:12  Traceback (most recent call last):
19:39:12File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/io/gcp/big_query_query_to_table_it_test.py",
 line 172, in test_big_query_standard_sql
19:39:12  big_query_query_to_table_pipeline.run_bq_pipeline(options)
19:39:12File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/io/gcp/big_query_query_to_table_pipeline.py",
 line 84, in run_bq_pipeline
19:39:12  result = p.run()
19:39:12File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/testing/test_pipeline.py",
 line 112, in run
19:39:12  else test_runner_api))
19:39:12File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/pipeline.py",
 line 461, in run
19:39:12  self._options).run(False)
19:39:12File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/pipeline.py",
 line 474, in run
19:39:12  return self.runner.run_pipeline(self, self._options)
19:39:12File 
"/home/jenkins/jenkins-slave/workspace/beam_PostCommit_Python37/src/sdks/python/apache_beam/runners/direct/test_direct_runner.py",
 line 53, in run_pipeline
19:39:12  hc_assert_that(self.result, pickler.loads(on_success_matcher))
19:39:12  AssertionError: 
19:39:12  Expected: (Test pipeline expected terminated in state: DONE and 
Expected checksum is 158a8ea1c254fcf40d4ed3e7c0242c3ea0a29e72)
19:39:12   but: Expected checksum is 
158a8ea1c254fcf40d4ed3e7c0242c3ea0a29e72 Actual checksum is 
da39a3ee5e6b4b0d3255bfef95601890afd80709
19:39:12  
19:39:12   >> begin captured logging << 
19:39:12  root: DEBUG: Unhandled type_constraint: Union[]
19:39:12  root: DEBUG: Unhandled type_constraint: Union[]
19:39:12  apache_beam.runners.direct.direct_runner: INFO: Running pipeline with 
DirectRunner.
19:39:12  apache_beam.io.gcp.bigquery_tools: DEBUG: Query SELECT * FROM (SELECT 
"apple" as fruit) UNION ALL (SELECT "orange" as fruit) does not reference any 
tables.
19:39:12  apache_beam.io.gcp.bigquery_tools: WARNING: Dataset 
apache-beam-testing:temp_dataset_90f5797bdb5f4137af750399f91a8e66 does not 
exist so we will create it as temporary with location=None
19:39:12  apache_beam.io.gcp.bigquery: DEBUG: Creating or getting table 
 with schema {'fields': [{'name': 'fruit', 
'type': 'STRING', 'mode': 'NULLABLE'}]}.
19:39:12  apache_beam.io.gcp.bigquery_tools: DEBUG: Created the table with id 
output_table
19:39:12  apache_beam.io.gcp.bigquery_tools: INFO: Created table 
apache-beam-testing.python_query_to_table_15792323245106.output_table with 
schema ]>. Result: ]>
19:39:12   selfLink: 
'https://www.googleapis.com/bigquery/v2/projects/apache-beam-testing/datasets/python_query_to_table_15792323245106/tables/output_table'
19:39:12   tableReference: 
19:39:12   type: 'TABLE'>.
19:39:12  apache_beam.io.gcp.bigquery: DEBUG: Attempting to flush to all 
destinations. Total buffered: 2
19:39:12  apache_beam.io.gcp.bigquery: DEBUG: Flushing data to 
apache-beam-testing:python_query_to_table_15792323245106.output_table. Total 2 
rows.
19:39:12  apache_beam.io.gcp.tests.bigquery_matcher: INFO: Attempting to 
perform query SELECT fruit from 
`python_query_to_table_15792323245106.output_table`; to BQ
19:39:12  google.auth.transport._http_client: DEBUG: Making request: GET 
http://169.254.169.254
19:39:12  google.auth.transport._http_client: DEBUG: Making request: GET 
http://metadata.google.internal/computeMetadata/v1/project/project-id
19:39:12  urllib3.util.retry: DEBUG: Converted retries value: 3 -> 
Retry(total=3, connect=None, read=None, redirect=None, status=None)
19:39:12  google.auth.transport.requests: DEBUG: Making request: GET 
http://metadata.google.internal/computeMetadata/v1/instance/service-accounts/default/?recursive=true
19:39:12  urllib3.connectionpool: DEBUG: Starting new HTTP connection (1): 
metadata.google.internal:80
19:39:12  urllib3.connectionpool: DEBUG: http://metadata.google.internal:80 
"GET /computeMetadata/v1/instance/service-accounts/default/?recursive=true 
HTTP/1.1" 200 144
19:39:12  google.auth.transport.requests: DEBUG: Making request: GET 

  1   2   3   >