(beam) branch users/damccorm/jobName deleted (was 8f8c12d4e7f)

2024-10-04 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/jobName
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 8f8c12d4e7f Improve wheels job name

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated: Improve wheels job name (#32644)

2024-10-04 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new decc997ea57 Improve wheels job name (#32644)
decc997ea57 is described below

commit decc997ea57a61449062d1d7ac47d8ac53b04d9d
Author: Danny McCormick 
AuthorDate: Fri Oct 4 14:27:36 2024 -0400

Improve wheels job name (#32644)
---
 .github/workflows/build_wheels.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 25030231a5e..d1e99f2bd57 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -211,7 +211,7 @@ jobs:
 run: gsutil cp -r -a public-read source/* ${{ env.GCP_PATH }}
 
   build_wheels:
-name: Build python wheels on ${{matrix.arch}} for ${{ matrix.os_python.os 
}}
+name: Build python ${{matrix.py_version}} wheels on 
${{matrix.os_python.arch}} for ${{ matrix.os_python.os }}
 needs:
   - check_env_variables
   - build_source



(beam) branch master updated: Update groupbykey.py (#32359)

2024-10-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 6561abf7beb Update groupbykey.py (#32359)
6561abf7beb is described below

commit 6561abf7beb3d19e4ff4fa9726c4351b9019c1ab
Author: Christoph Grotz 
AuthorDate: Thu Oct 3 17:37:56 2024 +0200

Update groupbykey.py (#32359)

The original example was not actually counting the produce but grouping the 
produce per season. Maybe it's better to rename the variables to reflect this, 
in order to not confuse the reader.
---
 .../examples/snippets/transforms/aggregation/groupbykey.py| 8 
 1 file changed, 4 insertions(+), 4 deletions(-)

diff --git 
a/sdks/python/apache_beam/examples/snippets/transforms/aggregation/groupbykey.py
 
b/sdks/python/apache_beam/examples/snippets/transforms/aggregation/groupbykey.py
index aca39f6a219..a2d32b564a3 100644
--- 
a/sdks/python/apache_beam/examples/snippets/transforms/aggregation/groupbykey.py
+++ 
b/sdks/python/apache_beam/examples/snippets/transforms/aggregation/groupbykey.py
@@ -40,9 +40,9 @@ def groupbykey(test=None):
   import apache_beam as beam
 
   with beam.Pipeline() as pipeline:
-produce_counts = (
+produce_per_season = (
 pipeline
-| 'Create produce counts' >> beam.Create([
+| 'Create produce list' >> beam.Create([
 ('spring', '🍓'),
 ('spring', '🥕'),
 ('spring', '🍆'),
@@ -54,12 +54,12 @@ def groupbykey(test=None):
 ('fall', '🍅'),
 ('winter', '🍆'),
 ])
-| 'Group counts per produce' >> beam.GroupByKey()
+| 'Group produce per season' >> beam.GroupByKey()
 | beam.MapTuple(lambda k, vs: (k, sorted(vs)))  # sort and format
 | beam.Map(print))
 # [END groupbykey]
 if test:
-  test(produce_counts)
+  test(produce_per_season)
 
 
 if __name__ == '__main__':



(beam) 01/01: Improve wheels job name

2024-10-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/jobName
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 8f8c12d4e7f24f9d959ee8dbdead66afcf389dd3
Author: Danny McCormick 
AuthorDate: Thu Oct 3 11:14:30 2024 -0400

Improve wheels job name
---
 .github/workflows/build_wheels.yml | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 25030231a5e..d1e99f2bd57 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -211,7 +211,7 @@ jobs:
 run: gsutil cp -r -a public-read source/* ${{ env.GCP_PATH }}
 
   build_wheels:
-name: Build python wheels on ${{matrix.arch}} for ${{ matrix.os_python.os 
}}
+name: Build python ${{matrix.py_version}} wheels on 
${{matrix.os_python.arch}} for ${{ matrix.os_python.os }}
 needs:
   - check_env_variables
   - build_source



(beam) branch users/damccorm/jobName created (now 8f8c12d4e7f)

2024-10-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/jobName
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 8f8c12d4e7f Improve wheels job name

This branch includes the following new commits:

 new 8f8c12d4e7f Improve wheels job name

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch master updated: Rag opensearch usecase with Beam's MLTransform (#32018)

2024-10-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 1d7b0bcde9e Rag opensearch usecase with Beam's MLTransform (#32018)
1d7b0bcde9e is described below

commit 1d7b0bcde9e5f6954f00845a93827d89532ac2c3
Author: Ayush Pandey <32012449+itsayushpan...@users.noreply.github.com>
AuthorDate: Thu Oct 3 09:10:12 2024 -0400

Rag opensearch usecase with Beam's MLTransform (#32018)

* Adding insertion and enrichment pipeline

* Enhanced Data Schema

* Added Apache Licensed to the notebook

* Adding Chunking Strategy

* removed unused imports

* Modified insertion logic in redis for incorporating chunking strategy

* refacted redis code

* code review changes

* Added chunking code in notebook

* Added code review changes

* Code review changes: using chunking strategy as enum

* Added Code Review Changes

* Code review changes

* Added code review changes

* Added Code Review Changes

* Code review changes

* Ingestion and Enrichment pipeline for OpenSearch Vector DB

* Added logic for reading password from .env file

* Added opensearch vector notebook

* Update credentials.env

* Added code review changes

* Added Description in opensearch notebook

* Added description in opensearch notebook

* Code review changes
---
 .../beam-ml/rag_usecase/opensearch_connector.py|  372 +
 .../beam-ml/rag_usecase/opensearch_enrichment.py   |  134 ++
 .../rag_usecase/opensearch_rag_pipeline.ipynb  | 1719 
 3 files changed, 2225 insertions(+)

diff --git a/examples/notebooks/beam-ml/rag_usecase/opensearch_connector.py 
b/examples/notebooks/beam-ml/rag_usecase/opensearch_connector.py
new file mode 100644
index 000..fc83c8d443c
--- /dev/null
+++ b/examples/notebooks/beam-ml/rag_usecase/opensearch_connector.py
@@ -0,0 +1,372 @@
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+#
+
+from __future__ import absolute_import
+
+import apache_beam as beam
+
+from apache_beam.transforms import DoFn
+from apache_beam.transforms import PTransform
+from apache_beam.transforms import Reshuffle
+
+from typing import Optional
+from opensearchpy import OpenSearch
+
+import os
+from dotenv import load_dotenv
+
+load_dotenv()
+
+# Set the logging level to reduce verbose information
+import logging
+
+logging.root.setLevel(logging.INFO)
+logger = logging.getLogger(__name__)
+
+__all__ = ['InsertDocInOpenSearch', 'InsertEmbeddingInOpenSearch']
+
+"""This module implements IO classes to read document in Opensearch.
+
+
+Insert Doc in OpenSearch:
+-
+:class:`InsertDocInOpenSearch` is a ``PTransform`` that writes key and values 
to a
+configured sink, and the write is conducted through a Opensearch pipeline.
+
+The ptransform works by getting the first and second elements from the input,
+this means that inputs like `[k,v]` or `(k,v)` are valid.
+
+Example usage::
+
+  pipeline | InsertDocInOpenSearch(host='localhost',
+  port=6379,
+  username='admin',
+  password='admin'
+  batch_size=100)
+
+
+No backward compatibility guarantees. Everything in this module is 
experimental.
+"""
+
+
+class InsertDocInOpenSearch(PTransform):
+"""InsertDocInOpensearch is a ``PTransform`` that writes a ``PCollection`` 
of
+key, value tuple or 2-element array into a Opensearch server.
+"""
+
+def __init__(self,
+ host: str,
+ port: int,
+ username: Optional[str],
+ password: Optional[str],
+ batch_size: int = 100
+ ):
+"""
+Args:
+host (str): The opensearch host
+port (int): The opensearch port
+us

(beam) branch master updated: Call out breaking assert_that change more explicitly (#32624)

2024-10-02 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new d9ffc23ae13 Call out breaking assert_that change more explicitly 
(#32624)
d9ffc23ae13 is described below

commit d9ffc23ae13742788d1c38c5109ef518a24d89cb
Author: Danny McCormick 
AuthorDate: Wed Oct 2 09:52:50 2024 -0400

Call out breaking assert_that change more explicitly (#32624)
---
 CHANGES.md  | 1 +
 sdks/python/apache_beam/testing/util.py | 7 ++-
 2 files changed, 7 insertions(+), 1 deletion(-)

diff --git a/CHANGES.md b/CHANGES.md
index d92639d626b..3d224157e59 100644
--- a/CHANGES.md
+++ b/CHANGES.md
@@ -75,6 +75,7 @@
 
 ## Breaking Changes
 
+* In Python, assert_that now throws if it is not in a pipeline context instead 
of silently succeeding ([#30771](https://github.com/apache/beam/pull/30771))
 * In Python and YAML, ReadFromJson now override the dtype from None to
   an explicit False.  Most notably, string values like `"123"` are preserved
   as strings rather than silently coerced (and possibly truncated) to numeric
diff --git a/sdks/python/apache_beam/testing/util.py 
b/sdks/python/apache_beam/testing/util.py
index f7fabde43d4..8532d1c1f97 100644
--- a/sdks/python/apache_beam/testing/util.py
+++ b/sdks/python/apache_beam/testing/util.py
@@ -266,7 +266,12 @@ def assert_that(
 # The pipeline was already run. The user most likely called assert_that
 # after the pipeleline context.
 raise RuntimeError(
-'assert_that must be used within a beam.Pipeline context')
+'assert_that must be used within a beam.Pipeline context. ' +
+'Prior to Beam 2.60.0, asserts outside of the context of a pipeline ' +
+'were silently ignored, starting with Beam 2.60.0 this is no longer ' +
+'allowed. To fix, move your assert_that call into your pipeline ' +
+'context so that it is added before the pipeline is run. For more ' +
+'information, see https://github.com/apache/beam/pull/30771')
 
   # Usually, the uniqueness of the label is left to the pipeline
   # writer to guarantee. Since we're in a testing context, we'll



(beam) branch users/damccorm/breaking deleted (was 05489829fc4)

2024-10-02 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/breaking
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 05489829fc4 Call out breaking assert_that change more explicitly

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) 01/01: Call out breaking assert_that change more explicitly

2024-10-02 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/breaking
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 05489829fc4c5a7f8f0b103551f9c9b20365e89f
Author: Danny McCormick 
AuthorDate: Wed Oct 2 09:00:28 2024 -0400

Call out breaking assert_that change more explicitly
---
 CHANGES.md  | 1 +
 sdks/python/apache_beam/testing/util.py | 7 ++-
 2 files changed, 7 insertions(+), 1 deletion(-)

diff --git a/CHANGES.md b/CHANGES.md
index d92639d626b..3d224157e59 100644
--- a/CHANGES.md
+++ b/CHANGES.md
@@ -75,6 +75,7 @@
 
 ## Breaking Changes
 
+* In Python, assert_that now throws if it is not in a pipeline context instead 
of silently succeeding ([#30771](https://github.com/apache/beam/pull/30771))
 * In Python and YAML, ReadFromJson now override the dtype from None to
   an explicit False.  Most notably, string values like `"123"` are preserved
   as strings rather than silently coerced (and possibly truncated) to numeric
diff --git a/sdks/python/apache_beam/testing/util.py 
b/sdks/python/apache_beam/testing/util.py
index f7fabde43d4..8532d1c1f97 100644
--- a/sdks/python/apache_beam/testing/util.py
+++ b/sdks/python/apache_beam/testing/util.py
@@ -266,7 +266,12 @@ def assert_that(
 # The pipeline was already run. The user most likely called assert_that
 # after the pipeleline context.
 raise RuntimeError(
-'assert_that must be used within a beam.Pipeline context')
+'assert_that must be used within a beam.Pipeline context. ' +
+'Prior to Beam 2.60.0, asserts outside of the context of a pipeline ' +
+'were silently ignored, starting with Beam 2.60.0 this is no longer ' +
+'allowed. To fix, move your assert_that call into your pipeline ' +
+'context so that it is added before the pipeline is run. For more ' +
+'information, see https://github.com/apache/beam/pull/30771')
 
   # Usually, the uniqueness of the label is left to the pipeline
   # writer to guarantee. Since we're in a testing context, we'll



(beam) branch users/damccorm/breaking created (now 05489829fc4)

2024-10-02 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/breaking
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 05489829fc4 Call out breaking assert_that change more explicitly

This branch includes the following new commits:

 new 05489829fc4 Call out breaking assert_that change more explicitly

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/bq_fix deleted (was bc0dde5fc0c)

2024-09-30 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_fix
in repository https://gitbox.apache.org/repos/asf/beam.git


 was bc0dde5fc0c feedback

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated: Force BQIO to output elements in the correct row (#32584)

2024-09-30 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new e640b2578fb Force BQIO to output elements in the correct row (#32584)
e640b2578fb is described below

commit e640b2578fb1d31ac5823fc95cceff346624d4b3
Author: Danny McCormick 
AuthorDate: Mon Sep 30 12:43:17 2024 -0400

Force BQIO to output elements in the correct row (#32584)

* Fix bqio

* import fix

* syntax

* feedback
---
 .github/trigger_files/beam_PostCommit_Python.json  |  2 +-
 ...tCommit_Python_ValidatesContainer_Dataflow.json |  3 +-
 sdks/python/apache_beam/io/gcp/bigquery.py | 59 +-
 .../apache_beam/io/gcp/bigquery_write_it_test.py   |  1 +
 4 files changed, 38 insertions(+), 27 deletions(-)

diff --git a/.github/trigger_files/beam_PostCommit_Python.json 
b/.github/trigger_files/beam_PostCommit_Python.json
index 30ee463ad4e..1eb60f6e495 100644
--- a/.github/trigger_files/beam_PostCommit_Python.json
+++ b/.github/trigger_files/beam_PostCommit_Python.json
@@ -1,5 +1,5 @@
 {
   "comment": "Modify this file in a trivial way to cause this test suite to 
run.",
-  "modification": 2
+  "modification": 3
 }
 
diff --git 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
index d6c608f6dab..4897480d69a 100644
--- 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
+++ 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
@@ -1,3 +1,4 @@
 {
-"comment": "Modify this file in a trivial way to cause this test suite to 
run"
+"comment": "Modify this file in a trivial way to cause this test suite to 
run",
+"modification": 1
 }
\ No newline at end of file
diff --git a/sdks/python/apache_beam/io/gcp/bigquery.py 
b/sdks/python/apache_beam/io/gcp/bigquery.py
index b897df2d32a..2cb64742f26 100644
--- a/sdks/python/apache_beam/io/gcp/bigquery.py
+++ b/sdks/python/apache_beam/io/gcp/bigquery.py
@@ -418,7 +418,6 @@ from apache_beam.transforms.external import 
SchemaAwareExternalTransform
 from apache_beam.transforms.sideinputs import SIDE_INPUT_PREFIX
 from apache_beam.transforms.sideinputs import get_sideinput_index
 from apache_beam.transforms.util import ReshufflePerKey
-from apache_beam.transforms.window import GlobalWindows
 from apache_beam.typehints.row_type import RowTypeConstraint
 from apache_beam.typehints.schemas import schema_from_element_type
 from apache_beam.utils import retry
@@ -1581,7 +1580,8 @@ class BigQueryWriteFn(DoFn):
 additional_create_parameters=self.additional_bq_parameters)
 _KNOWN_TABLES.add(str_table_reference)
 
-  def process(self, element, *schema_side_inputs):
+  def process(
+  self, element, window_value=DoFn.WindowedValueParam, 
*schema_side_inputs):
 destination = bigquery_tools.get_hashable_destination(element[0])
 
 if callable(self.schema):
@@ -1608,12 +1608,11 @@ class BigQueryWriteFn(DoFn):
 return [
 pvalue.TaggedOutput(
 BigQueryWriteFn.FAILED_ROWS_WITH_ERRORS,
-GlobalWindows.windowed_value(
+window_value.with_value(
 (destination, row_and_insert_id[0], error))),
 pvalue.TaggedOutput(
 BigQueryWriteFn.FAILED_ROWS,
-GlobalWindows.windowed_value(
-(destination, row_and_insert_id[0])))
+window_value.with_value((destination, row_and_insert_id[0])))
 ]
 
   # Flush current batch first if adding this row will exceed our limits
@@ -1624,11 +1623,11 @@ class BigQueryWriteFn(DoFn):
 flushed_batch = self._flush_batch(destination)
 # After flushing our existing batch, we now buffer the current row
 # for the next flush
-self._rows_buffer[destination].append(row_and_insert_id)
+self._rows_buffer[destination].append((row_and_insert_id, 
window_value))
 self._destination_buffer_byte_size[destination] = row_byte_size
 return flushed_batch
 
-  self._rows_buffer[destination].append(row_and_insert_id)
+  self._rows_buffer[destination].append((row_and_insert_id, window_value))
   self._destination_buffer_byte_size[destination] += row_byte_size
   self._total_buffered_rows += 1
   if self._total_buffered_rows >= self._max_buffered_rows:
@@ -1636,7 +1635,8 @@ class BigQueryWriteFn(DoFn):
 else:
   # The input is already batched per destination, flush the rows now.
   batched_rows = element[1]
-  self._rows_buffer[destination].extend(batched_rows)
+  for r in batched_rows:
+self._rows_buffer[destination].appen

(beam) branch users/damccorm/bq_fix updated (c19dc93ad00 -> bc0dde5fc0c)

2024-09-30 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_fix
in repository https://gitbox.apache.org/repos/asf/beam.git


from c19dc93ad00 syntax
 add bc0dde5fc0c feedback

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/io/gcp/bigquery.py | 8 ++--
 1 file changed, 6 insertions(+), 2 deletions(-)



(beam) branch users/damccorm/bq_fix updated (e41c2eccb9f -> c19dc93ad00)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_fix
in repository https://gitbox.apache.org/repos/asf/beam.git


from e41c2eccb9f import fix
 add c19dc93ad00 syntax

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/io/gcp/bigquery.py | 36 +-
 1 file changed, 16 insertions(+), 20 deletions(-)



(beam) branch users/damccorm/bq_fix updated (62bd70e852a -> e41c2eccb9f)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_fix
in repository https://gitbox.apache.org/repos/asf/beam.git


from 62bd70e852a Fix bqio
 add e41c2eccb9f import fix

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/io/gcp/bigquery.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)



(beam) 01/01: Fix bqio

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/bq_fix
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 62bd70e852a99c3b5c878ba3e598148cc172746c
Author: Danny McCormick 
AuthorDate: Fri Sep 27 15:59:57 2024 -0400

Fix bqio
---
 .github/trigger_files/beam_PostCommit_Python.json  |  2 +-
 ...tCommit_Python_ValidatesContainer_Dataflow.json |  3 +-
 sdks/python/apache_beam/io/gcp/bigquery.py | 41 +-
 .../apache_beam/io/gcp/bigquery_write_it_test.py   |  1 +
 4 files changed, 29 insertions(+), 18 deletions(-)

diff --git a/.github/trigger_files/beam_PostCommit_Python.json 
b/.github/trigger_files/beam_PostCommit_Python.json
index 30ee463ad4e..1eb60f6e495 100644
--- a/.github/trigger_files/beam_PostCommit_Python.json
+++ b/.github/trigger_files/beam_PostCommit_Python.json
@@ -1,5 +1,5 @@
 {
   "comment": "Modify this file in a trivial way to cause this test suite to 
run.",
-  "modification": 2
+  "modification": 3
 }
 
diff --git 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
index d6c608f6dab..4897480d69a 100644
--- 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
+++ 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
@@ -1,3 +1,4 @@
 {
-"comment": "Modify this file in a trivial way to cause this test suite to 
run"
+"comment": "Modify this file in a trivial way to cause this test suite to 
run",
+"modification": 1
 }
\ No newline at end of file
diff --git a/sdks/python/apache_beam/io/gcp/bigquery.py 
b/sdks/python/apache_beam/io/gcp/bigquery.py
index b897df2d32a..9ecf9e635c9 100644
--- a/sdks/python/apache_beam/io/gcp/bigquery.py
+++ b/sdks/python/apache_beam/io/gcp/bigquery.py
@@ -1581,7 +1581,11 @@ class BigQueryWriteFn(DoFn):
 additional_create_parameters=self.additional_bq_parameters)
 _KNOWN_TABLES.add(str_table_reference)
 
-  def process(self, element, *schema_side_inputs):
+  def process(
+  self,
+  element,
+  window_value=beam.DoFn.WindowedValueParam,
+  *schema_side_inputs):
 destination = bigquery_tools.get_hashable_destination(element[0])
 
 if callable(self.schema):
@@ -1608,12 +1612,11 @@ class BigQueryWriteFn(DoFn):
 return [
 pvalue.TaggedOutput(
 BigQueryWriteFn.FAILED_ROWS_WITH_ERRORS,
-GlobalWindows.windowed_value(
+window_value.with_value(
 (destination, row_and_insert_id[0], error))),
 pvalue.TaggedOutput(
 BigQueryWriteFn.FAILED_ROWS,
-GlobalWindows.windowed_value(
-(destination, row_and_insert_id[0])))
+window_value.with_value((destination, row_and_insert_id[0])))
 ]
 
   # Flush current batch first if adding this row will exceed our limits
@@ -1624,11 +1627,11 @@ class BigQueryWriteFn(DoFn):
 flushed_batch = self._flush_batch(destination)
 # After flushing our existing batch, we now buffer the current row
 # for the next flush
-self._rows_buffer[destination].append(row_and_insert_id)
+self._rows_buffer[destination].append((row_and_insert_id, 
window_value))
 self._destination_buffer_byte_size[destination] = row_byte_size
 return flushed_batch
 
-  self._rows_buffer[destination].append(row_and_insert_id)
+  self._rows_buffer[destination].append((row_and_insert_id, window_value))
   self._destination_buffer_byte_size[destination] += row_byte_size
   self._total_buffered_rows += 1
   if self._total_buffered_rows >= self._max_buffered_rows:
@@ -1636,7 +1639,8 @@ class BigQueryWriteFn(DoFn):
 else:
   # The input is already batched per destination, flush the rows now.
   batched_rows = element[1]
-  self._rows_buffer[destination].extend(batched_rows)
+  for r in batched_rows:
+self._rows_buffer[destination].append((r, window_value))
   return self._flush_batch(destination)
 
   def finish_bundle(self):
@@ -1659,7 +1663,7 @@ class BigQueryWriteFn(DoFn):
   def _flush_batch(self, destination):
 
 # Flush the current batch of rows to BigQuery.
-rows_and_insert_ids = self._rows_buffer[destination]
+rows_and_insert_ids_with_windows = self._rows_buffer[destination]
 table_reference = bigquery_tools.parse_table_reference(destination)
 if table_reference.projectId is None:
   table_reference.projectId = vp.RuntimeValueProvider.get_value(
@@ -1668,9 +1672,11 @@ class BigQueryWriteFn(DoFn):
 _LOGGER.debug(
 'Flushing data to %s. Total %s rows.',
 destination,
-len(rows_and_insert_ids))
-  

(beam) branch users/damccorm/bq_fix created (now 62bd70e852a)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_fix
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 62bd70e852a Fix bqio

This branch includes the following new commits:

 new 62bd70e852a Fix bqio

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/bq_writes deleted (was 6f8f9a9be16)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_writes
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 6f8f9a9be16 fmt

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch users/damccorm/bq_writes created (now d674a8ed417)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_writes
in repository https://gitbox.apache.org/repos/asf/beam.git


  at d674a8ed417 Always write to BQ from global window

This branch includes the following new commits:

 new d674a8ed417 Always write to BQ from global window

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/bq_writes updated (d674a8ed417 -> 6f8f9a9be16)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_writes
in repository https://gitbox.apache.org/repos/asf/beam.git


from d674a8ed417 Always write to BQ from global window
 add 6f8f9a9be16 fmt

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)



(beam) 01/01: Always write to BQ from global window

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/bq_writes
in repository https://gitbox.apache.org/repos/asf/beam.git

commit d674a8ed41764f6d37f8a6f1981c8dea8c2530ae
Author: Danny McCormick 
AuthorDate: Fri Sep 27 13:12:56 2024 -0400

Always write to BQ from global window
---
 .github/trigger_files/beam_PostCommit_Python.json | 2 +-
 .../beam_PostCommit_Python_ValidatesContainer_Dataflow.json   | 3 ++-
 sdks/python/apache_beam/io/gcp/bigquery.py| 3 +++
 sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py  | 4 ++--
 4 files changed, 8 insertions(+), 4 deletions(-)

diff --git a/.github/trigger_files/beam_PostCommit_Python.json 
b/.github/trigger_files/beam_PostCommit_Python.json
index 30ee463ad4e..1eb60f6e495 100644
--- a/.github/trigger_files/beam_PostCommit_Python.json
+++ b/.github/trigger_files/beam_PostCommit_Python.json
@@ -1,5 +1,5 @@
 {
   "comment": "Modify this file in a trivial way to cause this test suite to 
run.",
-  "modification": 2
+  "modification": 3
 }
 
diff --git 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
index d6c608f6dab..4897480d69a 100644
--- 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
+++ 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
@@ -1,3 +1,4 @@
 {
-"comment": "Modify this file in a trivial way to cause this test suite to 
run"
+"comment": "Modify this file in a trivial way to cause this test suite to 
run",
+"modification": 1
 }
\ No newline at end of file
diff --git a/sdks/python/apache_beam/io/gcp/bigquery.py 
b/sdks/python/apache_beam/io/gcp/bigquery.py
index b897df2d32a..a5f84dc441e 100644
--- a/sdks/python/apache_beam/io/gcp/bigquery.py
+++ b/sdks/python/apache_beam/io/gcp/bigquery.py
@@ -1864,6 +1864,9 @@ class _StreamToBigQuery(PTransform):
 return (
 tagged_data
 | 'FromHashableTableRef' >> beam.Map(_restore_table_ref)
+# Use global window for writes since we're outputting back into the
+# global window.
+| 'Window into Global Window' >> beam.WindowInto(GlobalWindows())
 | 'StreamInsertRows' >> ParDo(
 bigquery_write_fn, *self.schema_side_inputs).with_outputs(
 BigQueryWriteFn.FAILED_ROWS,
diff --git a/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py 
b/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py
index b0140793cf7..bb5c36a3e9b 100644
--- a/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py
+++ b/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py
@@ -470,7 +470,7 @@ class BigQueryWriteIntegrationTests(unittest.TestCase):
 input_data = [{
 'number': 1,
 'str': 'some_string',
-}]
+}]*500
 
 table_schema = {
 "fields": [{
@@ -483,7 +483,7 @@ class BigQueryWriteIntegrationTests(unittest.TestCase):
 bq_result_errors = [({
 'number': 1,
 'str': 'some_string',
-}, "Not Found")]
+}, "Not Found")]*500
 
 args = self.test_pipeline.get_full_options_as_args()
 



(beam) branch users/damccorm/bq_sampling updated (4e0ad877043 -> 82a05fee9fe)

2024-09-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 4e0ad877043 return type
 add 82a05fee9fe lint

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/utils/windowed_value_test.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)



(beam) branch users/damccorm/bq_sampling updated (67c8349a024 -> 4e0ad877043)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 67c8349a024 fix to_global_window
 add 4e0ad877043 return type

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/coders/coder_impl.py | 4 +++-
 1 file changed, 3 insertions(+), 1 deletion(-)



(beam) branch users/damccorm/bq_sampling updated (674d3caba0a -> 67c8349a024)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 674d3caba0a ssmall fix
 add 67c8349a024 fix to_global_window

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/transforms/window.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)



(beam) branch users/damccorm/bq_sampling updated (a67a9d85189 -> e76d4c65a05)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from a67a9d85189 Add hash function
 add e76d4c65a05 use member functions instead of inheritance

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/coders/coder_impl.py | 21 +--
 sdks/python/apache_beam/transforms/window.py | 30 +++-
 2 files changed, 27 insertions(+), 24 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (e76d4c65a05 -> 674d3caba0a)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from e76d4c65a05 use member functions instead of inheritance
 add 674d3caba0a ssmall fix

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/transforms/window.py | 10 ++
 1 file changed, 6 insertions(+), 4 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (8e76e1aee1b -> a67a9d85189)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 8e76e1aee1b revert setup changes
 add a67a9d85189 Add hash function

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/coders/coder_impl.py |  2 +-
 sdks/python/apache_beam/transforms/window.py | 17 +
 2 files changed, 10 insertions(+), 9 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (948eb33fa8a -> 280691bec5a)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 948eb33fa8a Simpler repro of problem
 add 280691bec5a Fix coder issues

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/transforms/window.py | 4 ++--
 sdks/python/apache_beam/utils/windowed_value_test.py | 9 +
 sdks/python/setup.py | 5 +++--
 3 files changed, 10 insertions(+), 8 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (280691bec5a -> fed480e2983)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 280691bec5a Fix coder issues
 add fed480e2983 clean up equals

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/transforms/window.py | 13 +++--
 1 file changed, 11 insertions(+), 2 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (e2705dde452 -> 8e76e1aee1b)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from e2705dde452 fully translate back
 add 8e76e1aee1b revert setup changes

No new revisions were added by this update.

Summary of changes:
 sdks/python/setup.py | 5 ++---
 1 file changed, 2 insertions(+), 3 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (fed480e2983 -> e2705dde452)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from fed480e2983 clean up equals
 add e2705dde452 fully translate back

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/coders/coder_impl.py | 7 +++
 1 file changed, 7 insertions(+)



(beam) branch users/damccorm/bq_sampling updated (65e382594e3 -> 948eb33fa8a)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 65e382594e3 Got the repro
 add 948eb33fa8a Simpler repro of problem

No new revisions were added by this update.

Summary of changes:
 .github/trigger_files/beam_PostCommit_Python.json|  2 +-
 ...am_PostCommit_Python_ValidatesContainer_Dataflow.json |  3 +--
 sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py |  1 -
 sdks/python/apache_beam/utils/windowed_value_test.py | 16 
 4 files changed, 18 insertions(+), 4 deletions(-)



(beam) branch users/damccorm/bq_sampling updated (3a472e03401 -> 65e382594e3)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


from 3a472e03401 More suites
 add 65e382594e3 Got the repro

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py | 5 +++--
 1 file changed, 3 insertions(+), 2 deletions(-)



(beam) branch users/damccorm/bq_sampling updated: More suites

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/users/damccorm/bq_sampling by 
this push:
 new 3a472e03401 More suites
3a472e03401 is described below

commit 3a472e0340149954ef9c56e2657f6fd873a72549
Author: Danny McCormick 
AuthorDate: Thu Sep 26 10:13:35 2024 -0400

More suites
---
 .../beam_PostCommit_Python_ValidatesContainer_Dataflow.json| 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)

diff --git 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
index d6c608f6dab..a88b73a2acd 100644
--- 
a/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
+++ 
b/.github/trigger_files/beam_PostCommit_Python_ValidatesContainer_Dataflow.json
@@ -1,3 +1,4 @@
 {
-"comment": "Modify this file in a trivial way to cause this test suite to 
run"
+"comment": "Modify this file in a trivial way to cause this test suite to 
run",
+"modification": "1"
 }
\ No newline at end of file



(beam) branch users/damccorm/bq_sampling created (now 3d3168773b2)

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 3d3168773b2 Try to repro failure

This branch includes the following new commits:

 new 3d3168773b2 Try to repro failure

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) 01/01: Try to repro failure

2024-09-26 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/bq_sampling
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 3d3168773b29edab2a7de19facdd0219fdc98c75
Author: Danny McCormick 
AuthorDate: Thu Sep 26 10:12:13 2024 -0400

Try to repro failure
---
 .github/trigger_files/beam_PostCommit_Python.json| 2 +-
 sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py | 4 ++--
 2 files changed, 3 insertions(+), 3 deletions(-)

diff --git a/.github/trigger_files/beam_PostCommit_Python.json 
b/.github/trigger_files/beam_PostCommit_Python.json
index 2934a91b84b..30ee463ad4e 100644
--- a/.github/trigger_files/beam_PostCommit_Python.json
+++ b/.github/trigger_files/beam_PostCommit_Python.json
@@ -1,5 +1,5 @@
 {
   "comment": "Modify this file in a trivial way to cause this test suite to 
run.",
-  "modification": 1
+  "modification": 2
 }
 
diff --git a/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py 
b/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py
index b0140793cf7..bb5c36a3e9b 100644
--- a/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py
+++ b/sdks/python/apache_beam/io/gcp/bigquery_write_it_test.py
@@ -470,7 +470,7 @@ class BigQueryWriteIntegrationTests(unittest.TestCase):
 input_data = [{
 'number': 1,
 'str': 'some_string',
-}]
+}]*500
 
 table_schema = {
 "fields": [{
@@ -483,7 +483,7 @@ class BigQueryWriteIntegrationTests(unittest.TestCase):
 bq_result_errors = [({
 'number': 1,
 'str': 'some_string',
-}, "Not Found")]
+}, "Not Found")]*500
 
 args = self.test_pipeline.get_full_options_as_args()
 



(beam) branch damccorm-patch-1 deleted (was 6a46b0de0db)

2024-09-24 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 6a46b0de0db Take version from default

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated (2d5e72c615d -> 5d8b0418cef)

2024-09-24 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


from 2d5e72c615d Bump go.mongodb.org/mongo-driver from 1.16.1 to 1.17.0 in 
/sdks (#32497)
 add 5d8b0418cef Update build.gradle to use 3.12 for metrics report (#32547)

No new revisions were added by this update.

Summary of changes:
 .test-infra/jenkins/build.gradle | 3 ---
 1 file changed, 3 deletions(-)



(beam) branch master updated: Vllm model handler (#32410)

2024-09-24 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 97cb452e4b9 Vllm model handler (#32410)
97cb452e4b9 is described below

commit 97cb452e4b92603ea0a28a225c6ff1d60200840e
Author: Danny McCormick 
AuthorDate: Tue Sep 24 14:00:43 2024 -0400

Vllm model handler (#32410)

* Vllm first pass [wip]

* Example for integration tests wip

* Still wip

* Test changes

* Dockerfile improvements

* Remove bad change

* Clean up test args

* clean up invocation

* string fix

* string fix

* clean up

* lint

* Get tests working with 5xx driver

* cleanup

* Fixes, everything is now working

* Batching

* lint

* Feedback + CHANGES.md
---
 .github/trigger_files/beam_PostCommit_Python.json  |   4 +-
 CHANGES.md |  15 +-
 build.gradle.kts   |   1 +
 .../apache_beam/examples/inference/README.md   |  80 ++
 .../examples/inference/vllm_text_completion.py | 162 +++
 .../ml/inference/test_resources/vllm.dockerfile|  47 
 .../apache_beam/ml/inference/vllm_inference.py | 312 +
 sdks/python/setup.py   |   1 +
 sdks/python/test-suites/dataflow/common.gradle |  39 +++
 9 files changed, 646 insertions(+), 15 deletions(-)

diff --git a/.github/trigger_files/beam_PostCommit_Python.json 
b/.github/trigger_files/beam_PostCommit_Python.json
index d01a47e7265..30ee463ad4e 100644
--- a/.github/trigger_files/beam_PostCommit_Python.json
+++ b/.github/trigger_files/beam_PostCommit_Python.json
@@ -1,5 +1,5 @@
 {
-  "comment": "modify this file in a trivial way to cause this test suite to 
run.",
-  "modification": 1
+  "comment": "Modify this file in a trivial way to cause this test suite to 
run.",
+  "modification": 2
 }
 
diff --git a/CHANGES.md b/CHANGES.md
index d58ceffeb41..c123a8e1a4d 100644
--- a/CHANGES.md
+++ b/CHANGES.md
@@ -57,18 +57,13 @@
 
 ## Highlights
 
-* New highly anticipated feature X added to Python SDK 
([#X](https://github.com/apache/beam/issues/X)).
-* New highly anticipated feature Y added to Java SDK 
([#Y](https://github.com/apache/beam/issues/Y)).
-
-## I/Os
-
-* Support for X source added (Java/Python) 
([#X](https://github.com/apache/beam/issues/X)).
+* Added support for using vLLM in the RunInference transform (Python) 
([#32528](https://github.com/apache/beam/issues/32528))
 
 ## New Features / Improvements
 
 * Dataflow worker can install packages from Google Artifact Registry Python 
repositories (Python) ([#32123](https://github.com/apache/beam/issues/32123)).
 * Added support for Zstd codec in SerializableAvroCodecFactory (Java) 
([#32349](https://github.com/apache/beam/issues/32349))
-* X feature added (Java/Python) 
([#X](https://github.com/apache/beam/issues/X)).
+* Added support for using vLLM in the RunInference transform (Python) 
([#32528](https://github.com/apache/beam/issues/32528))
 
 ## Breaking Changes
 
@@ -77,11 +72,9 @@
   as strings rather than silently coerced (and possibly truncated) to numeric
   values.  To retain the old behavior, pass `dtype=True` (or any other value
   accepted by `pandas.read_json`).
-* X behavior was changed ([#X](https://github.com/apache/beam/issues/X)).
 
 ## Deprecations
 
-* X behavior is deprecated and will be removed in X versions 
([#X](https://github.com/apache/beam/issues/X)).
 * Python 3.8 is reaching EOL and support is being removed in Beam 2.61.0. The 
2.60.0 release will warn users
 when running on 3.8. ([#31192](https://github.com/apache/beam/issues/31192))
 
@@ -92,10 +85,6 @@ when running on 3.8. 
([#31192](https://github.com/apache/beam/issues/31192))
 ## Security Fixes
 * Fixed (CVE--)[https://www.cve.org/CVERecord?id=CVE--] 
(Java/Python/Go) ([#X](https://github.com/apache/beam/issues/X)).
 
-## Known Issues
-
-* ([#X](https://github.com/apache/beam/issues/X)).
-
 # [2.59.0] - 2024-09-11
 
 ## Highlights
diff --git a/build.gradle.kts b/build.gradle.kts
index d74cae3267e..38b58b6979e 100644
--- a/build.gradle.kts
+++ b/build.gradle.kts
@@ -542,6 +542,7 @@ tasks.register("python312PostCommit") {
   dependsOn(":sdks:python:test-suites:direct:py312:postCommitIT")
   dependsOn(":sdks:python:test-suites:direct:py312:hdfsIntegrationTest")
   dependsOn(":sdks:python:test-suites:portable:py312:postCommitPy312")
+  
dependsOn(":sdks:python:test-suites:dataflow:py312:inferencePostCommitITPy312")
 }
 
 tasks.register("portablePythonPreCommit") {
diff --git a/sdks/python/apache_beam/examples/inference/README.md 
b/sdks/pyth

(beam) branch damccorm-patch-1 updated (81541f0a0e9 -> 6a46b0de0db)

2024-09-24 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


from 81541f0a0e9 Update build.gradle to use 3.12
 add 6a46b0de0db Take version from default

No new revisions were added by this update.

Summary of changes:
 .test-infra/jenkins/build.gradle | 3 ---
 1 file changed, 3 deletions(-)



(beam) 01/01: Update build.gradle to use 3.12

2024-09-24 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 81541f0a0e9f336a6c99db8271d14004de3b7006
Author: Danny McCormick 
AuthorDate: Tue Sep 24 12:37:52 2024 -0400

Update build.gradle to use 3.12
---
 .test-infra/jenkins/build.gradle | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/.test-infra/jenkins/build.gradle b/.test-infra/jenkins/build.gradle
index 37c9c4d8d6a..9f4acf8b791 100644
--- a/.test-infra/jenkins/build.gradle
+++ b/.test-infra/jenkins/build.gradle
@@ -23,7 +23,7 @@ applyGroovyNature()
 applyPythonNature()
 
 // TODO(https://github.com/apache/beam/issues/20209): Don't hardcode this 
version, take the value provided by Python nature.
-pythonVersion = '3.8'
+pythonVersion = '3.12'
 
 task generateMetricsReport {
   dependsOn setupVirtualenv



(beam) branch damccorm-patch-1 created (now 81541f0a0e9)

2024-09-24 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 81541f0a0e9 Update build.gradle to use 3.12

This branch includes the following new commits:

 new 81541f0a0e9 Update build.gradle to use 3.12

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/assert deleted (was 149f355cc84)

2024-09-22 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/assert
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 149f355cc84 Fix elements

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated: Add schrodinger logo (#32525)

2024-09-22 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 6adb1453890 Add schrodinger logo (#32525)
6adb1453890 is described below

commit 6adb1453890e804d7dd62be38e3ba367ceff2eb0
Author: Hai Joey Tran 
AuthorDate: Sun Sep 22 16:09:38 2024 -0400

Add schrodinger logo (#32525)

* Add schrodinger card

* move png
---
 .../www/site/content/en/case-studies/schrodinger.md  |  19 +++
 .../static/images/logos/powered-by/schrodinger.png   | Bin 0 -> 145837 bytes
 2 files changed, 19 insertions(+)

diff --git a/website/www/site/content/en/case-studies/schrodinger.md 
b/website/www/site/content/en/case-studies/schrodinger.md
new file mode 100644
index 000..178f87a768d
--- /dev/null
+++ b/website/www/site/content/en/case-studies/schrodinger.md
@@ -0,0 +1,19 @@
+---
+title:  "Schrodinger"
+icon: /images/logos/powered-by/schrodinger.png
+hasNav: false
+hasLink: "https://www.schrodinger.com/";
+---
+
diff --git a/website/www/site/static/images/logos/powered-by/schrodinger.png 
b/website/www/site/static/images/logos/powered-by/schrodinger.png
new file mode 100644
index 000..83a95c752a9
Binary files /dev/null and 
b/website/www/site/static/images/logos/powered-by/schrodinger.png differ



(beam) branch users/damccorm/assert updated (50dd0487724 -> 149f355cc84)

2024-09-22 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/assert
in repository https://gitbox.apache.org/repos/asf/beam.git


from 50dd0487724 Update beam_PostCommit_Python.json
 add 149f355cc84 Fix elements

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/ml/gcp/visionml_test_it.py | 3 ++-
 1 file changed, 2 insertions(+), 1 deletion(-)



(beam) branch users/damccorm/assert updated (855bd54bba1 -> 50dd0487724)

2024-09-21 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/assert
in repository https://gitbox.apache.org/repos/asf/beam.git


from 855bd54bba1 Move assert into pipeline
 add 50dd0487724 Update beam_PostCommit_Python.json

No new revisions were added by this update.

Summary of changes:
 .github/trigger_files/beam_PostCommit_Python.json | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)



(beam) branch users/damccorm/assert created (now 855bd54bba1)

2024-09-21 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/assert
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 855bd54bba1 Move assert into pipeline

This branch includes the following new commits:

 new 855bd54bba1 Move assert into pipeline

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) 01/01: Move assert into pipeline

2024-09-21 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/assert
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 855bd54bba17be7bcdde19914cef863819f21cb7
Author: Danny McCormick 
AuthorDate: Sat Sep 21 11:53:31 2024 -0400

Move assert into pipeline
---
 sdks/python/apache_beam/ml/gcp/visionml_test_it.py | 22 +++---
 1 file changed, 11 insertions(+), 11 deletions(-)

diff --git a/sdks/python/apache_beam/ml/gcp/visionml_test_it.py 
b/sdks/python/apache_beam/ml/gcp/visionml_test_it.py
index ea3fc9768ff..cbd59307c24 100644
--- a/sdks/python/apache_beam/ml/gcp/visionml_test_it.py
+++ b/sdks/python/apache_beam/ml/gcp/visionml_test_it.py
@@ -64,17 +64,17 @@ class VisionMlTestIT(unittest.TestCase):
   context_side_input=beam.pvalue.AsDict(contexts))
   | beam.ParDo(extract))
 
-assert_that(
-output,
-equal_to([
-'WAITING?\nPLEASE\nTURN OFF\nYOUR\nENGINE',
-'WAITING?',
-'PLEASE',
-'TURN',
-'OFF',
-'YOUR',
-'ENGINE'
-]))
+  assert_that(
+  output,
+  equal_to([
+  'WAITING?\nPLEASE\nTURN OFF\nYOUR\nENGINE',
+  'WAITING?',
+  'PLEASE',
+  'TURN',
+  'OFF',
+  'YOUR',
+  'ENGINE'
+  ]))
 
 
 if __name__ == '__main__':



(beam) branch master updated (7474e6a7427 -> 6a095456aa3)

2024-09-20 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


from 7474e6a7427 Rollback Bigtable throttling counter (#32442)
 add 6a095456aa3 [Python] Add a couple quality-of-life improvemenets to 
`testing.util.assert_that` (#30771)

No new revisions were added by this update.

Summary of changes:
 .../transforms/aggregation/groupby_expr.py |   5 +-
 .../aggregation/groupby_global_aggregate.py|   5 +-
 .../aggregation/groupby_simple_aggregate.py|   5 +-
 .../transforms/aggregation/groupby_test.py | 125 +
 sdks/python/apache_beam/testing/util.py|  17 +++
 sdks/python/apache_beam/testing/util_test.py   |  13 +++
 sdks/python/apache_beam/transforms/trigger_test.py |  11 +-
 sdks/python/apache_beam/transforms/util_test.py|   8 +-
 8 files changed, 121 insertions(+), 68 deletions(-)



(beam) branch liferoad-patch-2 deleted (was 4a00cd48b11)

2024-09-19 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch liferoad-patch-2
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 4a00cd48b11 Update run_inference_huggingface.ipynb

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated: Publish Java Expansion Service SNAPSHOT containers (#32491)

2024-09-19 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new bb96ac0456c Publish Java Expansion Service SNAPSHOT containers (#32491)
bb96ac0456c is described below

commit bb96ac0456cb002908e1e723cc5677039e34f7ce
Author: Chamikara Jayalath 
AuthorDate: Thu Sep 19 07:02:32 2024 -0700

Publish Java Expansion Service SNAPSHOT containers (#32491)
---
 .github/workflows/beam_Publish_Beam_SDK_Snapshots.yml | 1 +
 1 file changed, 1 insertion(+)

diff --git a/.github/workflows/beam_Publish_Beam_SDK_Snapshots.yml 
b/.github/workflows/beam_Publish_Beam_SDK_Snapshots.yml
index 72de1f15229..61ef31a0023 100644
--- a/.github/workflows/beam_Publish_Beam_SDK_Snapshots.yml
+++ b/.github/workflows/beam_Publish_Beam_SDK_Snapshots.yml
@@ -71,6 +71,7 @@ jobs:
   - "python:container:py310"
   - "python:container:py311"
   - "python:container:py312"
+  - "java:expansion-service:container"
 steps:
   - uses: actions/checkout@v4
   - name: Setup repository



(beam) branch master updated (c4e2e950531 -> 85581a3c02f)

2024-09-17 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


from c4e2e950531 Bump github.com/aws/aws-sdk-go-v2/service/s3 in /sdks 
(#32397)
 add 85581a3c02f Update run_inference_huggingface.ipynb (#32488)

No new revisions were added by this update.

Summary of changes:
 examples/notebooks/beam-ml/run_inference_huggingface.ipynb | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)



(beam) branch master updated: [yaml] Add examples for Spanner IO in YAML (#32288)

2024-09-16 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 9f8a4b2d177 [yaml] Add examples for Spanner IO in YAML (#32288)
9f8a4b2d177 is described below

commit 9f8a4b2d177c6308da112411c976b8ed1425e412
Author: Reeba Qureshi <64488642+reeba...@users.noreply.github.com>
AuthorDate: Tue Sep 17 02:12:51 2024 +0600

[yaml] Add examples for Spanner IO in YAML (#32288)

* Add example for spanner read

* Add example for spanner write

* move spanner examples

* minor update

* minor changes

1. Add good element in spanner write example to pass checks.
2. Remove spanner examples from examples_test.py for the time being.

* add license
---
 .../apache_beam/yaml/examples/io/spanner_read.yaml | 80 ++
 .../yaml/examples/io/spanner_write.yaml| 53 ++
 2 files changed, 133 insertions(+)

diff --git a/sdks/python/apache_beam/yaml/examples/io/spanner_read.yaml 
b/sdks/python/apache_beam/yaml/examples/io/spanner_read.yaml
new file mode 100644
index 000..c86d42c1e0c
--- /dev/null
+++ b/sdks/python/apache_beam/yaml/examples/io/spanner_read.yaml
@@ -0,0 +1,80 @@
+# coding=utf-8
+#
+# Licensed to the Apache Software Foundation (ASF) under one or more
+# contributor license agreements.  See the NOTICE file distributed with
+# this work for additional information regarding copyright ownership.
+# The ASF licenses this file to You under the Apache License, Version 2.0
+# (the "License"); you may not use this file except in compliance with
+# the License.  You may obtain a copy of the License at
+#
+#http://www.apache.org/licenses/LICENSE-2.0
+#
+# Unless required by applicable law or agreed to in writing, software
+# distributed under the License is distributed on an "AS IS" BASIS,
+# WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
+# See the License for the specific language governing permissions and
+# limitations under the License.
+
+pipeline:
+  transforms:
+
+  # Reading data from a Spanner database. The table used here has the 
following columns:
+  # shipment_id (String), customer_id (String), shipment_date (String), 
shipment_cost (Float64), customer_name (String), customer_email (String)
+  # ReadFromSpanner transform is called using project_id, instance_id, 
database_id and a query
+  # A table with a list of columns can also be specified instead of a query
+- type: ReadFromSpanner
+  name: ReadShipments
+  config:
+project_id: 'apache-beam-testing'
+instance_id: 'shipment-test'
+database_id: 'shipment'
+query: 'SELECT * FROM shipments'
+
+  # Filtering the data based on a specific condition
+  # Here, the condition is used to keep only the rows where the customer_id is 
'C1'
+- type: Filter
+  name: FilterShipments
+  input: ReadShipments
+  config:
+language: python
+keep: "customer_id == 'C1'"
+
+  # Mapping the data fields and applying transformations
+  # A new field 'shipment_cost_category' is added with a custom transformation
+  # A callable is defined to categorize shipment cost
+- type: MapToFields
+  name: MapFieldsForSpanner
+  input: FilterShipments
+  config:
+language: python
+fields:
+  shipment_id: shipment_id
+  customer_id: customer_id
+  shipment_date: shipment_date
+  shipment_cost: shipment_cost
+  customer_name: customer_name
+  customer_email: customer_email
+  shipment_cost_category:
+callable: |
+  def categorize_cost(row):
+  cost = float(row[3])
+  if cost < 50:
+  return 'Low Cost'
+  elif cost < 200:
+  return 'Medium Cost'
+  else:
+  return 'High Cost'
+  
+  # Writing the transformed data to a CSV file  
+- type: WriteToCsv
+  name: WriteBig
+  input: MapFieldsForSpanner
+  config:
+path: shipments.csv
+
+
+  # On executing the above pipeline, a new CSV file is created with the 
following records
+
+# Expected:
+#  Row(shipment_id='S1', customer_id='C1', shipment_date='2023-05-01', 
shipment_cost=150.0, customer_name='Alice', customer_email='al...@example.com', 
shipment_cost_category='Medium Cost')
+#  Row(shipment_id='S3', customer_id='C1', shipment_date='2023-05-10', 
shipment_cost=20.0, customer_name='Alice', customer_email='al...@example.com', 
sh

(beam) branch users/damccorm/beamsummit deleted (was 45036736e0a)

2024-09-13 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/beamsummit
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 45036736e0a Remove beam summit banner

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated: Bump spanner max bound back to <4 (#32411)

2024-09-12 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new ed84acb2967 Bump spanner max bound back to <4 (#32411)
ed84acb2967 is described below

commit ed84acb2967cf4ffa30a41bd53398ed770271f47
Author: Danny McCormick 
AuthorDate: Thu Sep 12 13:41:15 2024 -0400

Bump spanner max bound back to <4 (#32411)
---
 sdks/python/setup.py | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/sdks/python/setup.py b/sdks/python/setup.py
index dd03c590ce3..f9eb9cd8173 100644
--- a/sdks/python/setup.py
+++ b/sdks/python/setup.py
@@ -441,7 +441,7 @@ if __name__ == '__main__':
   'google-cloud-bigquery-storage>=2.6.3,<3',
   'google-cloud-core>=2.0.0,<3',
   'google-cloud-bigtable>=2.19.0,<3',
-  'google-cloud-spanner>=3.0.0,<3.48',
+  'google-cloud-spanner>=3.0.0,<4',
   # GCP Packages required by ML functionality
   'google-cloud-dlp>=3.0.0,<4',
   'google-cloud-language>=2.0,<3',



(beam-starter-python) branch main updated: Bump apache-beam from 2.58.1 to 2.59.0 (#36)

2024-09-12 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch main
in repository https://gitbox.apache.org/repos/asf/beam-starter-python.git


The following commit(s) were added to refs/heads/main by this push:
 new cc1f2b7  Bump apache-beam from 2.58.1 to 2.59.0 (#36)
cc1f2b7 is described below

commit cc1f2b787b5182b0762309c5483f915fd39c8932
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
AuthorDate: Thu Sep 12 09:34:34 2024 -0400

Bump apache-beam from 2.58.1 to 2.59.0 (#36)

Bumps [apache-beam](https://github.com/apache/beam) from 2.58.1 to 2.59.0.
- [Release notes](https://github.com/apache/beam/releases)
- [Changelog](https://github.com/apache/beam/blob/master/CHANGES.md)
- [Commits](https://github.com/apache/beam/compare/v2.58.1...v2.59.0)

---
updated-dependencies:
- dependency-name: apache-beam
  dependency-type: direct:production
  update-type: version-update:semver-minor
...

Signed-off-by: dependabot[bot] 
Co-authored-by: dependabot[bot] 
<49699333+dependabot[bot]@users.noreply.github.com>
---
 requirements.txt | 2 +-
 1 file changed, 1 insertion(+), 1 deletion(-)

diff --git a/requirements.txt b/requirements.txt
index c7f74c3..f6bbd90 100644
--- a/requirements.txt
+++ b/requirements.txt
@@ -1 +1 @@
-apache-beam==2.58.1
+apache-beam==2.59.0



(beam) branch master updated: Temporarily fix pip check failure on xgboost and grpcio (#32432)

2024-09-12 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 17718a97eaf Temporarily fix pip check failure on xgboost and grpcio 
(#32432)
17718a97eaf is described below

commit 17718a97eafe46796d0eaf0ac38f134a3adbe2bf
Author: Shunping Huang 
AuthorDate: Thu Sep 12 09:34:01 2024 -0400

Temporarily fix pip check failure on xgboost and grpcio (#32432)

* Pin pip version to 24.1 so pip check wont't fail

* Trigger precomit python ml tests to verify the fix.

* Pin pip version in pyproject.toml

* Try to enforce pip version in tox.

* Disable xgboost tests by removing xgboost dependency.

* Add tox env for macos

* Revert some unnecessary changes
---
 .github/trigger_files/beam_PreCommit_Python_ML.json |  0
 .github/workflows/python_tests.yml  |  8 ++--
 sdks/python/setup.py|  5 -
 sdks/python/tox.ini | 15 +++
 4 files changed, 25 insertions(+), 3 deletions(-)

diff --git a/.github/trigger_files/beam_PreCommit_Python_ML.json 
b/.github/trigger_files/beam_PreCommit_Python_ML.json
new file mode 100644
index 000..e69de29bb2d
diff --git a/.github/workflows/python_tests.yml 
b/.github/workflows/python_tests.yml
index 3ef9a5fe26b..e7461be2507 100644
--- a/.github/workflows/python_tests.yml
+++ b/.github/workflows/python_tests.yml
@@ -113,10 +113,14 @@ jobs:
   python-version: ${{ matrix.params.py_ver }}
   - name: Install tox
 run: pip install tox
-  - name: Run tests basic unix
-if: startsWith(matrix.os, 'ubuntu') || startsWith(matrix.os, 'macos')
+  - name: Run tests basic linux
+if: startsWith(matrix.os, 'ubuntu')
 working-directory: ./sdks/python
 run: tox -c tox.ini run -e ${{ matrix.params.tox_env }}
+  - name: Run tests basic macos
+if: startsWith(matrix.os, 'macos')
+working-directory: ./sdks/python
+run: tox -c tox.ini run -e ${{ matrix.params.tox_env }}-macos
   - name: Run tests basic windows
 if: startsWith(matrix.os, 'windows')
 working-directory: ./sdks/python
diff --git a/sdks/python/setup.py b/sdks/python/setup.py
index 7bcff2bacfd..dd03c590ce3 100644
--- a/sdks/python/setup.py
+++ b/sdks/python/setup.py
@@ -498,7 +498,10 @@ if __name__ == '__main__':
   'tf2onnx',
   'torch',
   'transformers',
-  'xgboost<2.0',  # https://github.com/apache/beam/issues/31252
+  # Comment out xgboost as it is breaking presubmit python ml
+  # tests due to tag check introduced since pip 24.2
+  # https://github.com/apache/beam/issues/31285
+  # 'xgboost<2.0',  # https://github.com/apache/beam/issues/31252
   ],
   'aws': ['boto3>=1.9,<2'],
   'azure': [
diff --git a/sdks/python/tox.ini b/sdks/python/tox.ini
index aa0200f7500..da4cde009ce 100644
--- a/sdks/python/tox.ini
+++ b/sdks/python/tox.ini
@@ -68,6 +68,21 @@ commands_post =
 commands = false {envname} is misconfigured
 
 [testenv:py{38,39,310,311,312}]
+commands_pre =
+  python --version
+  pip --version
+  pip check
+  bash {toxinidir}/scripts/run_tox_cleanup.sh
+commands =
+  python apache_beam/examples/complete/autocomplete_test.py
+  bash {toxinidir}/scripts/run_pytest.sh {envname} "{posargs}"
+
+[testenv:py{38,39,310,311,312}-macos]
+commands_pre =
+  python --version
+  pip --version
+  # pip check
+  bash {toxinidir}/scripts/run_tox_cleanup.sh
 commands =
   python apache_beam/examples/complete/autocomplete_test.py
   bash {toxinidir}/scripts/run_pytest.sh {envname} "{posargs}"



(beam) branch master updated: Dont run flaky test on windows (#32419)

2024-09-10 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new ebcb2dbd160 Dont run flaky test on windows (#32419)
ebcb2dbd160 is described below

commit ebcb2dbd160cd729ccc683d3130b0380bbae6efb
Author: Danny McCormick 
AuthorDate: Tue Sep 10 14:55:23 2024 -0400

Dont run flaky test on windows (#32419)

* Revert "Try deflaking test timing (#32351)"

This reverts commit 7d6f6fb55bc9ffed3468828487132b039daacd3a.

* Dont run flaky test on windows
---
 sdks/python/apache_beam/ml/inference/base_test.py | 13 +++--
 1 file changed, 7 insertions(+), 6 deletions(-)

diff --git a/sdks/python/apache_beam/ml/inference/base_test.py 
b/sdks/python/apache_beam/ml/inference/base_test.py
index 76772132784..31f02c9c61c 100644
--- a/sdks/python/apache_beam/ml/inference/base_test.py
+++ b/sdks/python/apache_beam/ml/inference/base_test.py
@@ -878,21 +878,22 @@ class RunInferenceBaseTest(unittest.TestCase):
   bad_without_error, equal_to(expected_bad), label='assert:failures')
 
   @unittest.skipIf(
-  sys.version_info < (3, 11),
+  sys.platform == "win32" or sys.version_info < (3, 11),
   "This test relies on the __del__ lifecycle method, but __del__ does " +
-  "not get invoked in the same way on older versions of Python, " +
-  "breaking this test. See " +
+  "not get invoked in the same way on older versions of Python or on " +
+  "windows, breaking this test. See " +
   "github.com/python/cpython/issues/87950#issuecomment-1807570983 " +
   "for example.")
   def test_run_inference_timeout_does_garbage_collection(self):
 with tempfile.TemporaryDirectory() as tmp_dirname:
   tmp_path = os.path.join(tmp_dirname, 'tmp_filename')
+  expected_file_contents = 'Deleted FakeSlowModel'
   with TestPipeline() as pipeline:
 # Start with bad example which gets timed out.
 # Then provide plenty of time for GC to happen.
-examples = [20] + [1] * 15 + [20, 20, 20]
+examples = [20] + [1] * 15
 expected_good = [1] * 15
-expected_bad = [20, 20, 20, 20]
+expected_bad = [20]
 pcoll = pipeline | 'start' >> beam.Create(examples)
 main, other = pcoll | base.RunInference(
 FakeSlowModelHandler(
@@ -909,7 +910,7 @@ class RunInferenceBaseTest(unittest.TestCase):
 
   with open(tmp_path) as f:
 s = f.read()
-self.assertNotEqual(s, '')
+self.assertEqual(s, expected_file_contents)
 
   def test_run_inference_impl_inference_args(self):
 with TestPipeline() as pipeline:



(beam) 02/02: Dont run flaky test on windows

2024-09-10 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/test_fix
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 15fe088cbaba64cca3ac8c2036576bbf4f9e5cc6
Author: Danny McCormick 
AuthorDate: Tue Sep 10 13:04:23 2024 -0400

Dont run flaky test on windows
---
 sdks/python/apache_beam/ml/inference/base_test.py | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/sdks/python/apache_beam/ml/inference/base_test.py 
b/sdks/python/apache_beam/ml/inference/base_test.py
index 359a372bc5b..31f02c9c61c 100644
--- a/sdks/python/apache_beam/ml/inference/base_test.py
+++ b/sdks/python/apache_beam/ml/inference/base_test.py
@@ -878,10 +878,10 @@ class RunInferenceBaseTest(unittest.TestCase):
   bad_without_error, equal_to(expected_bad), label='assert:failures')
 
   @unittest.skipIf(
-  sys.version_info < (3, 11),
+  sys.platform == "win32" or sys.version_info < (3, 11),
   "This test relies on the __del__ lifecycle method, but __del__ does " +
-  "not get invoked in the same way on older versions of Python, " +
-  "breaking this test. See " +
+  "not get invoked in the same way on older versions of Python or on " +
+  "windows, breaking this test. See " +
   "github.com/python/cpython/issues/87950#issuecomment-1807570983 " +
   "for example.")
   def test_run_inference_timeout_does_garbage_collection(self):



(beam) branch users/damccorm/spanner created (now 3719c1e7eea)

2024-09-09 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/spanner
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 3719c1e7eea Bump spanner max bound back to <4

This branch includes the following new commits:

 new 3719c1e7eea Bump spanner max bound back to <4

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch master updated: [yaml] Adding Spanner IO Providers for Beam YAML (#31987)

2024-09-09 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 173dd486842 [yaml] Adding Spanner IO Providers for Beam YAML (#31987)
173dd486842 is described below

commit 173dd486842cec312a975530a9867b54f9f44cf4
Author: Reeba Qureshi <64488642+reeba...@users.noreply.github.com>
AuthorDate: Mon Sep 9 19:58:04 2024 +0530

[yaml] Adding Spanner IO Providers for Beam YAML (#31987)

* Add Spanner IO providers to YAML SDK

* add handling logic for more datatypes

* delete examples

* minor changes

* minor change

* add integration test

* add docs

* minor change

* minor changes

1. Removed serialiazability from ErrorHandling.java
2. Removed double map definitions from MutationUtils.java
3. Added checkNotNull in spanner write provider
4. Modified some variables in spanner wrapper
5. Change instance id in integration tests

* Update spanner_wrapper.py

import retry

* minor changes

1. replace checknotnull with checkargument in spanner read provider
2. use the correct table  name (tmp_table) in integration test

* minor changes

1. Added serializable to error handling
2. Corrected validation methods in spanner read
3. Added retry import and removed default project name in spanner wrapper
4. Corrected instance and database names in spanner integration test
5. Corrected table name in query

* formatting

* Update SpannerWriteSchemaTransformProvider.java

* correct lint failures

* correct lint failures

* correct lint failures

* Update SpannerReadSchemaTransformProvider.java

* correct lint failures

* Update SpannerWriteSchemaTransformProvider.java

* spanner version update
---
 .../transforms/providers/ErrorHandling.java|   3 +-
 .../beam/sdk/io/gcp/spanner/MutationUtils.java |  60 ++
 .../SpannerReadSchemaTransformProvider.java| 235 +
 .../SpannerWriteSchemaTransformProvider.java   | 160 +++---
 sdks/python/apache_beam/io/gcp/spanner_wrapper.py  |  76 +++
 sdks/python/apache_beam/yaml/integration_tests.py  |  17 ++
 sdks/python/apache_beam/yaml/standard_io.yaml  |  27 +++
 sdks/python/apache_beam/yaml/tests/spanner.yaml|  95 +
 sdks/python/setup.py   |   2 +-
 9 files changed, 645 insertions(+), 30 deletions(-)

diff --git 
a/sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/providers/ErrorHandling.java
 
b/sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/providers/ErrorHandling.java
index 7fa29708c9f..053521dbfb3 100644
--- 
a/sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/providers/ErrorHandling.java
+++ 
b/sdks/java/core/src/main/java/org/apache/beam/sdk/schemas/transforms/providers/ErrorHandling.java
@@ -18,13 +18,14 @@
 package org.apache.beam.sdk.schemas.transforms.providers;
 
 import com.google.auto.value.AutoValue;
+import java.io.Serializable;
 import javax.annotation.Nullable;
 import org.apache.beam.sdk.schemas.Schema;
 import org.apache.beam.sdk.schemas.annotations.SchemaFieldDescription;
 import org.apache.beam.sdk.values.Row;
 
 @AutoValue
-public abstract class ErrorHandling {
+public abstract class ErrorHandling implements Serializable {
   @SchemaFieldDescription("The name of the output PCollection containing 
failed writes.")
   public abstract String getOutput();
 
diff --git 
a/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationUtils.java
 
b/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationUtils.java
index c0654b2cb05..5a106a34b0c 100644
--- 
a/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationUtils.java
+++ 
b/sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/spanner/MutationUtils.java
@@ -26,8 +26,11 @@ import com.google.cloud.ByteArray;
 import com.google.cloud.Timestamp;
 import com.google.cloud.spanner.Key;
 import com.google.cloud.spanner.Mutation;
+import com.google.cloud.spanner.Value;
 import java.math.BigDecimal;
+import java.util.HashMap;
 import java.util.List;
+import java.util.Map;
 import java.util.stream.StreamSupport;
 import org.apache.beam.sdk.schemas.Schema;
 import org.apache.beam.sdk.transforms.SerializableFunction;
@@ -351,4 +354,61 @@ final class MutationUtils {
 beamIterableType.getTypeName()));
 }
   }
+
+  public static Row createRowFromMutation(Schema schema, Mutation mutation) {
+Map mutationHashMap = new HashMap<>();
+mutation
+.asMap()
+.forEach(
+

(beam) branch damccorm-patch-1 deleted (was fb6db657d8a)

2024-09-05 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


 was fb6db657d8a Merge branch 'master' of https://github.com/apache/beam 
into damccorm-patch-1

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated (696978018ed -> 3b4ecd7a987)

2024-09-05 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


from 696978018ed Bump actions/download-artifact from 4.1.7 to 4.1.8 (#32396)
 add 3b4ecd7a987 Bump commons-cli:commons-cli from 1.8.0 to 1.9.0 (#32350)

No new revisions were added by this update.

Summary of changes:
 release/build.gradle.kts | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)



(beam) 01/01: Undo part of artifact action upgrade to fix workflow

2024-09-05 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/wheels_fix
in repository https://gitbox.apache.org/repos/asf/beam.git

commit f52a535ef334397aa1061fa6b30d1381272971d1
Author: Danny McCormick 
AuthorDate: Thu Sep 5 09:29:12 2024 -0700

Undo part of artifact action upgrade to fix workflow
---
 .github/workflows/build_wheels.yml | 12 
 1 file changed, 4 insertions(+), 8 deletions(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 3b718b0..f97f4de50ae 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -127,13 +127,11 @@ jobs:
 # https://github.com/pypa/setuptools/issues/4300 changed naming. Match 
both old and new names.
 run: mv $(ls | grep "apache-beam-\|apache_beam-") apache-beam-source
   - name: Upload source as artifact
-# Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
 uses: actions/upload-artifact@v4
 with:
   name: source
   path: sdks/python/apache-beam-source
   - name: Upload compressed sources as artifacts
-# Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
 uses: actions/upload-artifact@v4
 with:
   name: source_zip
@@ -172,14 +170,12 @@ jobs:
 run: mv $(ls | grep "apache-beam-\|apache_beam-") apache-beam-source-rc
   - name: Upload RC source as artifact
 if: steps.is_rc.outputs.is_rc == 1
-# Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
 uses: actions/upload-artifact@v4
 with:
   name: source_rc${{ steps.get_rc_version.outputs.RC_NUM }}
   path: sdks/python/apache-beam-source-rc
   - name: Upload compressed RC sources as artifacts
 if: steps.is_rc.outputs.is_rc == 1
-# Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
 uses: actions/upload-artifact@v4
 with:
   name: source_zip_rc${{ steps.get_rc_version.outputs.RC_NUM }}
@@ -283,8 +279,8 @@ jobs:
   shell: bash
 - name: Upload wheels as artifacts
   if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
-  # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/upload-artifact@v4
+  # Pinned to v3 because of 
https://github.com/actions/upload-artifact?tab=readme-ov-file#breaking-changes
+  uses: actions/upload-artifact@v3
   with:
 name: wheelhouse-${{ matrix.os_python.os }}${{ (matrix.os_python.arch 
== 'aarch64' && '-aarch64') || '' }}
 path: apache-beam-source/wheelhouse/
@@ -333,8 +329,8 @@ jobs:
 arch: aarch64
 steps:
 - name: Download wheels from artifacts
-  # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v4.1.8
+  # Pinned to v3 because of 
https://github.com/actions/upload-artifact?tab=readme-ov-file#breaking-changes
+  uses: actions/download-artifact@v3
   with:
 name: wheelhouse-${{ matrix.os }}${{ (matrix.arch == 'aarch64' && 
'-aarch64') || '' }}
 path: wheelhouse/



(beam) branch master updated: Bump actions/download-artifact from 4.1.7 to 4.1.8 (#32396)

2024-09-05 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 696978018ed Bump actions/download-artifact from 4.1.7 to 4.1.8 (#32396)
696978018ed is described below

commit 696978018eddf32d8ed57b5c00829360f1e7476c
Author: dependabot[bot] <49699333+dependabot[bot]@users.noreply.github.com>
AuthorDate: Thu Sep 5 08:15:21 2024 -0700

Bump actions/download-artifact from 4.1.7 to 4.1.8 (#32396)

Bumps 
[actions/download-artifact](https://github.com/actions/download-artifact) from 
4.1.7 to 4.1.8.
- [Release notes](https://github.com/actions/download-artifact/releases)
- 
[Commits](https://github.com/actions/download-artifact/compare/v4.1.7...v4.1.8)

---
updated-dependencies:
- dependency-name: actions/download-artifact
  dependency-type: direct:production
  update-type: version-update:semver-patch
...

Signed-off-by: dependabot[bot] 
Co-authored-by: dependabot[bot] 
<49699333+dependabot[bot]@users.noreply.github.com>
---
 .github/workflows/build_wheels.yml | 8 
 .github/workflows/python_tests.yml | 2 +-
 2 files changed, 5 insertions(+), 5 deletions(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index ebac61eec83..3b718b0 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -207,7 +207,7 @@ jobs:
 steps:
   - name: Download compressed sources from artifacts
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/download-artifact@v4.1.7
+uses: actions/download-artifact@v4.1.8
 with:
   name: source_zip
   path: source/
@@ -238,14 +238,14 @@ jobs:
 steps:
 - name: Download python source distribution from artifacts
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v4.1.7
+  uses: actions/download-artifact@v4.1.8
   with:
 name: source
 path: apache-beam-source
 - name: Download Python SDK RC source distribution from artifacts
   if: ${{ needs.build_source.outputs.is_rc == 1 }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v4.1.7
+  uses: actions/download-artifact@v4.1.8
   with:
 name: source_rc${{ needs.build_source.outputs.rc_num }}
 path: apache-beam-source-rc
@@ -334,7 +334,7 @@ jobs:
 steps:
 - name: Download wheels from artifacts
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v4.1.7
+  uses: actions/download-artifact@v4.1.8
   with:
 name: wheelhouse-${{ matrix.os }}${{ (matrix.arch == 'aarch64' && 
'-aarch64') || '' }}
 path: wheelhouse/
diff --git a/.github/workflows/python_tests.yml 
b/.github/workflows/python_tests.yml
index 7a086e1ebea..3ef9a5fe26b 100644
--- a/.github/workflows/python_tests.yml
+++ b/.github/workflows/python_tests.yml
@@ -172,7 +172,7 @@ jobs:
   python-version: ${{ matrix.python }}
   go-version: default
   - name: Download source from artifacts
-uses: actions/download-artifact@v4.1.7
+uses: actions/download-artifact@v4.1.8
 with:
   name: python_sdk_source
   path: apache-beam-source



(beam) branch master updated: Upgrade artifact actions to v4 (#32391)

2024-09-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 34a7c4f4198 Upgrade artifact actions to v4 (#32391)
34a7c4f4198 is described below

commit 34a7c4f41984a3789cb6fb64973b951e698c15f1
Author: Danny McCormick 
AuthorDate: Tue Sep 3 17:33:39 2024 -0700

Upgrade artifact actions to v4 (#32391)

* Upgrade artifact actions to v4

* Fix typo
---
 .github/workflows/build_wheels.yml | 20 ++--
 .github/workflows/python_tests.yml |  2 +-
 2 files changed, 11 insertions(+), 11 deletions(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 97a350c3fcc..ebac61eec83 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -128,13 +128,13 @@ jobs:
 run: mv $(ls | grep "apache-beam-\|apache_beam-") apache-beam-source
   - name: Upload source as artifact
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source
   path: sdks/python/apache-beam-source
   - name: Upload compressed sources as artifacts
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source_zip
   path: sdks/python/dist
@@ -173,14 +173,14 @@ jobs:
   - name: Upload RC source as artifact
 if: steps.is_rc.outputs.is_rc == 1
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source_rc${{ steps.get_rc_version.outputs.RC_NUM }}
   path: sdks/python/apache-beam-source-rc
   - name: Upload compressed RC sources as artifacts
 if: steps.is_rc.outputs.is_rc == 1
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source_zip_rc${{ steps.get_rc_version.outputs.RC_NUM }}
   path: sdks/python/dist
@@ -207,7 +207,7 @@ jobs:
 steps:
   - name: Download compressed sources from artifacts
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/download-artifact@v3
+uses: actions/download-artifact@v4.1.7
 with:
   name: source_zip
   path: source/
@@ -238,14 +238,14 @@ jobs:
 steps:
 - name: Download python source distribution from artifacts
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v3
+  uses: actions/download-artifact@v4.1.7
   with:
 name: source
 path: apache-beam-source
 - name: Download Python SDK RC source distribution from artifacts
   if: ${{ needs.build_source.outputs.is_rc == 1 }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v3
+  uses: actions/download-artifact@v4.1.7
   with:
 name: source_rc${{ needs.build_source.outputs.rc_num }}
 path: apache-beam-source-rc
@@ -284,7 +284,7 @@ jobs:
 - name: Upload wheels as artifacts
   if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/upload-artifact@v3
+  uses: actions/upload-artifact@v4
   with:
 name: wheelhouse-${{ matrix.os_python.os }}${{ (matrix.os_python.arch 
== 'aarch64' && '-aarch64') || '' }}
 path: apache-beam-source/wheelhouse/
@@ -310,7 +310,7 @@ jobs:
 - name: Upload RC wheels as artifacts
   if: ${{ needs.build_source.outputs.is_rc == 1 }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/upload-artifact@v3
+  uses: actions/upload-artifact@v4
   with:
 name: wheelhouse-rc${{ needs.build_source.outputs.rc_num }}-${{ 
matrix.os_python.os }}${{ (matrix.arch == 'aarch64' && '-aarch64') || '' }}
 path: apache-beam-source-rc/wheelhouse/
@@ -334,7 +334,7 @@ jobs:
 steps:
 - name: Download wheels from artifacts
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v3
+  uses: actions/download-artifact@v4.1.7
   with:
 name: wheelhouse-${{ matrix.os }}${{ (matrix.arch

(beam) branch users/damccorm/artifact_versions deleted (was 57874a0fda4)

2024-09-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/artifact_versions
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 57874a0fda4 Fix typo

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch users/damccorm/artifact_versions updated (3327296894e -> 57874a0fda4)

2024-09-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/artifact_versions
in repository https://gitbox.apache.org/repos/asf/beam.git


from 3327296894e Upgrade artifact actions to v4
 add 57874a0fda4 Fix typo

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_wheels.yml | 8 
 1 file changed, 4 insertions(+), 4 deletions(-)



(beam) 01/01: Upgrade artifact actions to v4

2024-09-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/artifact_versions
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 3327296894edd3eeecbdc5bed554317d15c70993
Author: Danny McCormick 
AuthorDate: Tue Sep 3 16:49:11 2024 -0700

Upgrade artifact actions to v4
---
 .github/workflows/build_wheels.yml | 20 ++--
 .github/workflows/python_tests.yml |  2 +-
 2 files changed, 11 insertions(+), 11 deletions(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 97a350c3fcc..d956e2702e9 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -128,13 +128,13 @@ jobs:
 run: mv $(ls | grep "apache-beam-\|apache_beam-") apache-beam-source
   - name: Upload source as artifact
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source
   path: sdks/python/apache-beam-source
   - name: Upload compressed sources as artifacts
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source_zip
   path: sdks/python/dist
@@ -173,14 +173,14 @@ jobs:
   - name: Upload RC source as artifact
 if: steps.is_rc.outputs.is_rc == 1
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source_rc${{ steps.get_rc_version.outputs.RC_NUM }}
   path: sdks/python/apache-beam-source-rc
   - name: Upload compressed RC sources as artifacts
 if: steps.is_rc.outputs.is_rc == 1
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/upload-artifact@v3
+uses: actions/upload-artifact@v4
 with:
   name: source_zip_rc${{ steps.get_rc_version.outputs.RC_NUM }}
   path: sdks/python/dist
@@ -207,7 +207,7 @@ jobs:
 steps:
   - name: Download compressed sources from artifacts
 # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-uses: actions/download-artifact@v3
+uses: actions/download-artifact@v4.1.7.1.7
 with:
   name: source_zip
   path: source/
@@ -238,14 +238,14 @@ jobs:
 steps:
 - name: Download python source distribution from artifacts
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v3
+  uses: actions/download-artifact@v4.1.7.1.7
   with:
 name: source
 path: apache-beam-source
 - name: Download Python SDK RC source distribution from artifacts
   if: ${{ needs.build_source.outputs.is_rc == 1 }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v3
+  uses: actions/download-artifact@v4.1.7.1.7
   with:
 name: source_rc${{ needs.build_source.outputs.rc_num }}
 path: apache-beam-source-rc
@@ -284,7 +284,7 @@ jobs:
 - name: Upload wheels as artifacts
   if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/upload-artifact@v3
+  uses: actions/upload-artifact@v4
   with:
 name: wheelhouse-${{ matrix.os_python.os }}${{ (matrix.os_python.arch 
== 'aarch64' && '-aarch64') || '' }}
 path: apache-beam-source/wheelhouse/
@@ -310,7 +310,7 @@ jobs:
 - name: Upload RC wheels as artifacts
   if: ${{ needs.build_source.outputs.is_rc == 1 }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/upload-artifact@v3
+  uses: actions/upload-artifact@v4
   with:
 name: wheelhouse-rc${{ needs.build_source.outputs.rc_num }}-${{ 
matrix.os_python.os }}${{ (matrix.arch == 'aarch64' && '-aarch64') || '' }}
 path: apache-beam-source-rc/wheelhouse/
@@ -334,7 +334,7 @@ jobs:
 steps:
 - name: Download wheels from artifacts
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
-  uses: actions/download-artifact@v3
+  uses: actions/download-artifact@v4.1.7.1.7
   with:
 name: wheelhouse-${{ matrix.os }}${{ (matrix.arch == 'aarch64' && 
'-aarch64') || '' }}
 path: wheelhouse/
diff --git a/.github/workflows/python_tests.yml 
b/.github/workflows/python_tests.yml
index

(beam) branch users/damccorm/artifact_versions created (now 3327296894e)

2024-09-03 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/artifact_versions
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 3327296894e Upgrade artifact actions to v4

This branch includes the following new commits:

 new 3327296894e Upgrade artifact actions to v4

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/ri_test deleted (was 1b1bdd8b176)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/ri_test
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 1b1bdd8b176 Merge branch 'master' of https://github.com/apache/beam 
into users/damccorm/ri_test

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated (55678b2edcf -> 7d6f6fb55bc)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


from 55678b2edcf Pause delete images in public AR (#32354)
 add 7d6f6fb55bc Try deflaking test timing (#32351)

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/ml/inference/base_test.py | 7 +++
 1 file changed, 3 insertions(+), 4 deletions(-)



(beam) branch damccorm-patch-1 updated (6060a7213d6 -> fb6db657d8a)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


from 6060a7213d6 Bump commons-cli:commons-cli from 1.8.0 to 1.9.0
 add 923fde0f070 Fix pr building of wheels (#32353)
 add fb6db657d8a Merge branch 'master' of https://github.com/apache/beam 
into damccorm-patch-1

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_wheels.yml | 2 ++
 1 file changed, 2 insertions(+)



(beam) branch users/damccorm/ri_test updated (cbc792fbc89 -> 1b1bdd8b176)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/ri_test
in repository https://gitbox.apache.org/repos/asf/beam.git


from cbc792fbc89 Simpler technique
 add 923fde0f070 Fix pr building of wheels (#32353)
 add 1b1bdd8b176 Merge branch 'master' of https://github.com/apache/beam 
into users/damccorm/ri_test

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_wheels.yml | 2 ++
 1 file changed, 2 insertions(+)



(beam) branch damccorm-patch-2 deleted (was 9c39d23f3d3)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-2
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 9c39d23f3d3 Fix pr building of wheels

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated (399112b2fab -> 923fde0f070)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


from 399112b2fab Fix BigtableIO.write() client sharing (#32340)
 add 923fde0f070 Fix pr building of wheels (#32353)

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_wheels.yml | 2 ++
 1 file changed, 2 insertions(+)



(beam) 01/01: Fix pr building of wheels

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch damccorm-patch-2
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 9c39d23f3d34d40e7397ad6ab0cb16867df56093
Author: Danny McCormick 
AuthorDate: Wed Aug 28 20:32:12 2024 +0100

Fix pr building of wheels
---
 .github/workflows/build_wheels.yml | 2 ++
 1 file changed, 2 insertions(+)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 69ffec573fc..97a350c3fcc 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -274,6 +274,7 @@ jobs:
   if: startsWith(matrix.os_python.os, 'macos')
   run: brew install coreutils
 - name: Add checksums
+  if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
   working-directory: apache-beam-source/wheelhouse/
   run: |
 for file in *.whl; do
@@ -281,6 +282,7 @@ jobs:
 done
   shell: bash
 - name: Upload wheels as artifacts
+  if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
   uses: actions/upload-artifact@v3
   with:



(beam) branch damccorm-patch-2 created (now 9c39d23f3d3)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-2
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 9c39d23f3d3 Fix pr building of wheels

This branch includes the following new commits:

 new 9c39d23f3d3 Fix pr building of wheels

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/ri_test updated (fc2e1bbfbf3 -> cbc792fbc89)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/ri_test
in repository https://gitbox.apache.org/repos/asf/beam.git


from fc2e1bbfbf3 Add another potential timeout
 add cbc792fbc89 Simpler technique

No new revisions were added by this update.

Summary of changes:
 sdks/python/apache_beam/ml/inference/base_test.py | 9 -
 1 file changed, 4 insertions(+), 5 deletions(-)



(beam) branch users/damccorm/ri_test updated: Add another potential timeout

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/ri_test
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/users/damccorm/ri_test by this 
push:
 new fc2e1bbfbf3 Add another potential timeout
fc2e1bbfbf3 is described below

commit fc2e1bbfbf38f3d7f9ef074517cd16ed8c36a2ac
Author: Danny McCormick 
AuthorDate: Wed Aug 28 20:15:39 2024 +0100

Add another potential timeout
---
 sdks/python/apache_beam/ml/inference/base_test.py | 6 +++---
 1 file changed, 3 insertions(+), 3 deletions(-)

diff --git a/sdks/python/apache_beam/ml/inference/base_test.py 
b/sdks/python/apache_beam/ml/inference/base_test.py
index 00f6563d5f1..cbe12443b31 100644
--- a/sdks/python/apache_beam/ml/inference/base_test.py
+++ b/sdks/python/apache_beam/ml/inference/base_test.py
@@ -891,9 +891,9 @@ class RunInferenceBaseTest(unittest.TestCase):
   with TestPipeline() as pipeline:
 # Start with bad example which gets timed out.
 # Then provide plenty of time for GC to happen.
-examples = [20] + [1] * 60
-expected_good = [1] * 60
-expected_bad = [20]
+examples = [20] + [1] * 30 + [20]
+expected_good = [1] * 30
+expected_bad = [20, 20]
 pcoll = pipeline | 'start' >> beam.Create(examples)
 main, other = pcoll | base.RunInference(
 FakeSlowModelHandler(



(beam) branch users/damccorm/ri_test created (now 2fc6e1cb06a)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/ri_test
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 2fc6e1cb06a Try deflaking test timing

This branch includes the following new commits:

 new 2fc6e1cb06a Try deflaking test timing

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) 01/01: Try deflaking test timing

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/ri_test
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 2fc6e1cb06a4c8e95b037e8556a0c3e6682fe2c1
Author: Danny McCormick 
AuthorDate: Wed Aug 28 20:14:14 2024 +0100

Try deflaking test timing
---
 sdks/python/apache_beam/ml/inference/base_test.py | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/sdks/python/apache_beam/ml/inference/base_test.py 
b/sdks/python/apache_beam/ml/inference/base_test.py
index 359a372bc5b..00f6563d5f1 100644
--- a/sdks/python/apache_beam/ml/inference/base_test.py
+++ b/sdks/python/apache_beam/ml/inference/base_test.py
@@ -891,8 +891,8 @@ class RunInferenceBaseTest(unittest.TestCase):
   with TestPipeline() as pipeline:
 # Start with bad example which gets timed out.
 # Then provide plenty of time for GC to happen.
-examples = [20] + [1] * 15
-expected_good = [1] * 15
+examples = [20] + [1] * 60
+expected_good = [1] * 60
 expected_bad = [20]
 pcoll = pipeline | 'start' >> beam.Create(examples)
 main, other = pcoll | base.RunInference(



(beam) branch damccorm-patch-1 created (now 6060a7213d6)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 6060a7213d6 Bump commons-cli:commons-cli from 1.8.0 to 1.9.0

This branch includes the following new commits:

 new 6060a7213d6 Bump commons-cli:commons-cli from 1.8.0 to 1.9.0

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) 01/01: Bump commons-cli:commons-cli from 1.8.0 to 1.9.0

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 6060a7213d63e343bc31d29e0bf6eecf39228dfa
Author: Danny McCormick 
AuthorDate: Wed Aug 28 20:09:07 2024 +0100

Bump commons-cli:commons-cli from 1.8.0 to 1.9.0
---
 release/build.gradle.kts | 4 ++--
 1 file changed, 2 insertions(+), 2 deletions(-)

diff --git a/release/build.gradle.kts b/release/build.gradle.kts
index 3b55542f939..ca1c152c9eb 100644
--- a/release/build.gradle.kts
+++ b/release/build.gradle.kts
@@ -29,8 +29,8 @@ val library = project.extensions.extraProperties["library"] 
as Map

(beam) branch master updated: Fix remote execution test flakiness in tearDown (#32328)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new a6cef9210b9 Fix remote execution test flakiness in tearDown (#32328)
a6cef9210b9 is described below

commit a6cef9210b92e2006c29c2f6fe90545a7abe2c8d
Author: Yi Hu 
AuthorDate: Wed Aug 28 14:36:29 2024 -0400

Fix remote execution test flakiness in tearDown (#32328)
---
 .../beam/runners/fnexecution/control/RemoteExecutionTest.java  | 10 ++
 1 file changed, 6 insertions(+), 4 deletions(-)

diff --git 
a/runners/java-fn-execution/src/test/java/org/apache/beam/runners/fnexecution/control/RemoteExecutionTest.java
 
b/runners/java-fn-execution/src/test/java/org/apache/beam/runners/fnexecution/control/RemoteExecutionTest.java
index 9bdd089c0cd..874748d7b97 100644
--- 
a/runners/java-fn-execution/src/test/java/org/apache/beam/runners/fnexecution/control/RemoteExecutionTest.java
+++ 
b/runners/java-fn-execution/src/test/java/org/apache/beam/runners/fnexecution/control/RemoteExecutionTest.java
@@ -264,10 +264,12 @@ public class RemoteExecutionTest implements Serializable {
 try {
   sdkHarnessExecutorFuture.get();
 } catch (ExecutionException e) {
-  if (e.getCause() instanceof RuntimeException
-  && e.getCause().getCause() instanceof InterruptedException) {
-// expected
-  } else {
+  Throwable ex = e.getCause();
+  while (ex instanceof RuntimeException) {
+ex = ex.getCause();
+  }
+  // InterruptedException at call tree is expected
+  if (!(ex instanceof InterruptedException)) {
 throw e;
   }
 }



(beam) branch master updated: Filter out old actions runs from dashboard (#32347)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new a85b0a636dc Filter out old actions runs from dashboard (#32347)
a85b0a636dc is described below

commit a85b0a636dc54483e20f981e3ee5b2b3369578c6
Author: Danny McCormick 
AuthorDate: Wed Aug 28 17:26:58 2024 +0100

Filter out old actions runs from dashboard (#32347)
---
 .test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py | 3 +++
 1 file changed, 3 insertions(+)

diff --git 
a/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py 
b/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py
index e8fe60b4a7d..fd4a5e713fe 100644
--- a/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py
+++ b/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py
@@ -307,6 +307,8 @@ async def fetch_workflow_runs():
 number_of_entries_per_page = 100  # The number of results per page (max 
100)
 params = {"branch": "master", "page": page, "per_page": 
number_of_entries_per_page}
 concurrent_requests = 30  # Number of requests to send simultaneously
+start = datetime.now() - timedelta(days=90)
+earliest_run_creation_date = start.strftime('%Y-%m-%d')
 semaphore = asyncio.Semaphore(concurrent_requests)
 
 print("Start fetching recent workflow runs")
@@ -338,6 +340,7 @@ async def fetch_workflow_runs():
 "page": page,
 "per_page": number_of_entries_per_page,
 "exclude_pull_requests": "true",
+"created": f'>={earliest_run_creation_date}',
 }
 workflow_run_tasks.append(fetch(runs_url, semaphore, params, 
headers))
 page += 1



(beam) branch users/damccorm/grafanaSync deleted (was 20d27727ba4)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/grafanaSync
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 20d27727ba4 Filter out old actions runs from dashboard

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch dependabot/gradle/commons-cli-commons-cli-1.9.0 updated (d02ac289fb0 -> e7e18aad2b1)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch 
dependabot/gradle/commons-cli-commons-cli-1.9.0
in repository https://gitbox.apache.org/repos/asf/beam.git


from d02ac289fb0 Bump commons-cli:commons-cli from 1.8.0 to 1.9.0
 add e7e18aad2b1 Fix pr building of wheels (#32348)

No new revisions were added by this update.

Summary of changes:
 .github/workflows/build_wheels.yml | 6 --
 1 file changed, 4 insertions(+), 2 deletions(-)



(beam) branch damccorm-patch-1 deleted (was 278904a32eb)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


 was 278904a32eb Fix pr building of wheels

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) 01/01: Fix pr building of wheels

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 278904a32eb60849890708e75aca9e34df06ce88
Author: Danny McCormick 
AuthorDate: Wed Aug 28 15:08:42 2024 +0100

Fix pr building of wheels
---
 .github/workflows/build_wheels.yml | 6 --
 1 file changed, 4 insertions(+), 2 deletions(-)

diff --git a/.github/workflows/build_wheels.yml 
b/.github/workflows/build_wheels.yml
index 69ffec573fc..ccea89a4c20 100644
--- a/.github/workflows/build_wheels.yml
+++ b/.github/workflows/build_wheels.yml
@@ -274,6 +274,7 @@ jobs:
   if: startsWith(matrix.os_python.os, 'macos')
   run: brew install coreutils
 - name: Add checksums
+  if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
   working-directory: apache-beam-source/wheelhouse/
   run: |
 for file in *.whl; do
@@ -281,6 +282,7 @@ jobs:
 done
   shell: bash
 - name: Upload wheels as artifacts
+  if: ${{ contains(matrix.os_python.python, matrix.py_version) }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
   uses: actions/upload-artifact@v3
   with:
@@ -298,7 +300,7 @@ jobs:
   run: cibuildwheel --print-build-identifiers && cibuildwheel --output-dir 
wheelhouse
   shell: bash
 - name: Add RC checksums
-  if: ${{ needs.build_source.outputs.is_rc == 1 }}
+  if: ${{ needs.build_source.outputs.is_rc == 1 && 
contains(matrix.os_python.python, matrix.py_version) }}
   working-directory: apache-beam-source-rc/wheelhouse/
   run: |
 for file in *.whl; do
@@ -306,7 +308,7 @@ jobs:
 done
   shell: bash
 - name: Upload RC wheels as artifacts
-  if: ${{ needs.build_source.outputs.is_rc == 1 }}
+  if: ${{ needs.build_source.outputs.is_rc == 1 && 
contains(matrix.os_python.python, matrix.py_version) }}
   # Pinned to v3 because of 
https://github.com/actions/download-artifact/issues/249
   uses: actions/upload-artifact@v3
   with:



(beam) branch damccorm-patch-1 created (now 278904a32eb)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch damccorm-patch-1
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 278904a32eb Fix pr building of wheels

This branch includes the following new commits:

 new 278904a32eb Fix pr building of wheels

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) branch users/damccorm/grafanaSync created (now 20d27727ba4)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/grafanaSync
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 20d27727ba4 Filter out old actions runs from dashboard

This branch includes the following new commits:

 new 20d27727ba4 Filter out old actions runs from dashboard

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) 01/01: Filter out old actions runs from dashboard

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/grafanaSync
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 20d27727ba46fd4b920e7519f03f0d44ce015f3f
Author: Danny McCormick 
AuthorDate: Wed Aug 28 14:34:55 2024 +0100

Filter out old actions runs from dashboard
---
 .test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py | 3 +++
 1 file changed, 3 insertions(+)

diff --git 
a/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py 
b/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py
index e8fe60b4a7d..fd4a5e713fe 100644
--- a/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py
+++ b/.test-infra/metrics/sync/github/github_runs_prefetcher/code/main.py
@@ -307,6 +307,8 @@ async def fetch_workflow_runs():
 number_of_entries_per_page = 100  # The number of results per page (max 
100)
 params = {"branch": "master", "page": page, "per_page": 
number_of_entries_per_page}
 concurrent_requests = 30  # Number of requests to send simultaneously
+start = datetime.now() - timedelta(days=90)
+earliest_run_creation_date = start.strftime('%Y-%m-%d')
 semaphore = asyncio.Semaphore(concurrent_requests)
 
 print("Start fetching recent workflow runs")
@@ -338,6 +340,7 @@ async def fetch_workflow_runs():
 "page": page,
 "per_page": number_of_entries_per_page,
 "exclude_pull_requests": "true",
+"created": f'>={earliest_run_creation_date}',
 }
 workflow_run_tasks.append(fetch(runs_url, semaphore, params, 
headers))
 page += 1



(beam) branch users/damccorm/pulsar deleted (was b6be53bb5b0)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/pulsar
in repository https://gitbox.apache.org/repos/asf/beam.git


 was b6be53bb5b0 spotless

The revisions that were on this branch are still contained in
other references; therefore, this change does not discard any commits
from the repository.



(beam) branch master updated: Add quality warnings to pulsar (#32346)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch master
in repository https://gitbox.apache.org/repos/asf/beam.git


The following commit(s) were added to refs/heads/master by this push:
 new 8cc80ff44c5 Add quality warnings to pulsar (#32346)
8cc80ff44c5 is described below

commit 8cc80ff44c56125d59e010b4299582ecef6b40ae
Author: Danny McCormick 
AuthorDate: Wed Aug 28 14:10:37 2024 +0100

Add quality warnings to pulsar (#32346)

* Add quality warnings to pulsar

* spotless
---
 .../main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java | 15 +++
 .../org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java |  5 +
 .../org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java  |  5 +
 .../java/org/apache/beam/sdk/io/pulsar/package-info.java  |  6 +-
 4 files changed, 30 insertions(+), 1 deletion(-)

diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java
index 6d0f0a08a01..aaff08a96d3 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java
@@ -31,12 +31,22 @@ import org.apache.pulsar.client.api.PulsarClient;
 import org.checkerframework.checker.nullness.qual.Nullable;
 import org.joda.time.Instant;
 
+/**
+ * Class for reading and writing from Apache Pulsar. Support is currently 
incomplete, and there may
+ * be bugs; see https://github.com/apache/beam/issues/31078 for more info, and 
comment in that issue
+ * if you run into issues with this IO.
+ */
 @SuppressWarnings({"rawtypes", "nullness"})
 public class PulsarIO {
 
   /** Static class, prevent instantiation. */
   private PulsarIO() {}
 
+  /**
+   * Read from Apache Pulsar. Support is currently incomplete, and there may 
be bugs; see
+   * https://github.com/apache/beam/issues/31078 for more info, and comment in 
that issue if you run
+   * into issues with this IO.
+   */
   public static Read read() {
 return new AutoValue_PulsarIO_Read.Builder()
 .setPulsarClient(PulsarIOUtils.PULSAR_CLIENT_SERIALIZABLE_FUNCTION)
@@ -144,6 +154,11 @@ public class PulsarIO {
 }
   }
 
+  /**
+   * Write to Apache Pulsar. Support is currently incomplete, and there may be 
bugs; see
+   * https://github.com/apache/beam/issues/31078 for more info, and comment in 
that issue if you run
+   * into issues with this IO.
+   */
   public static Write write() {
 return new AutoValue_PulsarIO_Write.Builder().build();
   }
diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
index 97a8dcd6e2b..a4e0a78e634 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
@@ -43,6 +43,11 @@ import org.joda.time.Instant;
 import org.slf4j.Logger;
 import org.slf4j.LoggerFactory;
 
+/**
+ * Transform for reading from Apache Pulsar. Support is currently incomplete, 
and there may be bugs;
+ * see https://github.com/apache/beam/issues/31078 for more info, and comment 
in that issue if you
+ * run into issues with this IO.
+ */
 @DoFn.UnboundedPerElement
 @SuppressWarnings({"rawtypes", "nullness"})
 public class ReadFromPulsarDoFn extends DoFn {
diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
index 9659940e02b..375e8ce92a3 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
@@ -23,6 +23,11 @@ import org.apache.pulsar.client.api.Producer;
 import org.apache.pulsar.client.api.PulsarClient;
 import org.apache.pulsar.client.api.PulsarClientException;
 
+/**
+ * Transform for writing to Apache Pulsar. Support is currently incomplete, 
and there may be bugs;
+ * see https://github.com/apache/beam/issues/31078 for more info, and comment 
in that issue if you
+ * run into issues with this IO.
+ */
 @DoFn.UnboundedPerElement
 @SuppressWarnings({"rawtypes", "nullness"})
 public class WriteToPulsarDoFn extends DoFn {
diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
index 8b7ab101a72..ffa15257fe5 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
@@ -15,5 +15,9 @@
  * See the

(beam) branch users/damccorm/pulsar updated (523f999d31f -> b6be53bb5b0)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/pulsar
in repository https://gitbox.apache.org/repos/asf/beam.git


from 523f999d31f Add quality warnings to pulsar
 add b6be53bb5b0 spotless

No new revisions were added by this update.

Summary of changes:
 .../org/apache/beam/sdk/io/pulsar/PulsarIO.java| 30 +++---
 .../beam/sdk/io/pulsar/ReadFromPulsarDoFn.java | 10 
 .../beam/sdk/io/pulsar/WriteToPulsarDoFn.java  | 10 
 .../apache/beam/sdk/io/pulsar/package-info.java| 10 
 4 files changed, 30 insertions(+), 30 deletions(-)



(beam) branch users/damccorm/pulsar created (now 523f999d31f)

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/pulsar
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 523f999d31f Add quality warnings to pulsar

This branch includes the following new commits:

 new 523f999d31f Add quality warnings to pulsar

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




(beam) 01/01: Add quality warnings to pulsar

2024-08-28 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/pulsar
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 523f999d31f240c6e9418b53a756e62b252ccfea
Author: Danny McCormick 
AuthorDate: Wed Aug 28 13:50:00 2024 +0100

Add quality warnings to pulsar
---
 .../main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java | 15 +++
 .../org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java |  5 +
 .../org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java  |  5 +
 .../java/org/apache/beam/sdk/io/pulsar/package-info.java  |  6 +-
 4 files changed, 30 insertions(+), 1 deletion(-)

diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java
index 6d0f0a08a01..e5cde0aec01 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/PulsarIO.java
@@ -31,12 +31,22 @@ import org.apache.pulsar.client.api.PulsarClient;
 import org.checkerframework.checker.nullness.qual.Nullable;
 import org.joda.time.Instant;
 
+/** Class for reading and writing from Apache Pulsar. Support is
+ * currently incomplete, and there may be bugs; see
+ * https://github.com/apache/beam/issues/31078 for more info, and comment
+ * in that issue if you run into issues with this IO.
+*/
 @SuppressWarnings({"rawtypes", "nullness"})
 public class PulsarIO {
 
   /** Static class, prevent instantiation. */
   private PulsarIO() {}
 
+  /** Read from Apache Pulsar. Support is currently incomplete, and there may
+   * be bugs; see
+   * https://github.com/apache/beam/issues/31078 for more info, and comment
+   * in that issue if you run into issues with this IO.
+  */
   public static Read read() {
 return new AutoValue_PulsarIO_Read.Builder()
 .setPulsarClient(PulsarIOUtils.PULSAR_CLIENT_SERIALIZABLE_FUNCTION)
@@ -144,6 +154,11 @@ public class PulsarIO {
 }
   }
 
+  /** Write to Apache Pulsar. Support is currently incomplete, and there may
+   * be bugs; see
+   * https://github.com/apache/beam/issues/31078 for more info, and comment
+   * in that issue if you run into issues with this IO.
+  */
   public static Write write() {
 return new AutoValue_PulsarIO_Write.Builder().build();
   }
diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
index 97a8dcd6e2b..14ca4b4d5b6 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/ReadFromPulsarDoFn.java
@@ -43,6 +43,11 @@ import org.joda.time.Instant;
 import org.slf4j.Logger;
 import org.slf4j.LoggerFactory;
 
+/** Transform for reading from Apache Pulsar. Support is
+ * currently incomplete, and there may be bugs; see
+ * https://github.com/apache/beam/issues/31078 for more info, and comment
+ * in that issue if you run into issues with this IO.
+*/
 @DoFn.UnboundedPerElement
 @SuppressWarnings({"rawtypes", "nullness"})
 public class ReadFromPulsarDoFn extends DoFn {
diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
index 9659940e02b..3199bf792bd 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/WriteToPulsarDoFn.java
@@ -23,6 +23,11 @@ import org.apache.pulsar.client.api.Producer;
 import org.apache.pulsar.client.api.PulsarClient;
 import org.apache.pulsar.client.api.PulsarClientException;
 
+/** Transform for writing to Apache Pulsar. Support is
+ * currently incomplete, and there may be bugs; see
+ * https://github.com/apache/beam/issues/31078 for more info, and comment
+ * in that issue if you run into issues with this IO.
+*/
 @DoFn.UnboundedPerElement
 @SuppressWarnings({"rawtypes", "nullness"})
 public class WriteToPulsarDoFn extends DoFn {
diff --git 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
index 8b7ab101a72..27effe2e077 100644
--- 
a/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
+++ 
b/sdks/java/io/pulsar/src/main/java/org/apache/beam/sdk/io/pulsar/package-info.java
@@ -15,5 +15,9 @@
  * See the License for the specific language governing permissions and
  * limitations under the License.
  */
-/** Transforms for reading and writing from Apache Pulsar. */
+/** Transforms for reading and writing from Apache Pulsar. Support is
+ 

(beam) 01/01: Remove beam summit banner

2024-08-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a commit to branch users/damccorm/beamsummit
in repository https://gitbox.apache.org/repos/asf/beam.git

commit 45036736e0a2e4cf8562392be6bb0dcffe99bae1
Author: Danny McCormick 
AuthorDate: Tue Aug 27 18:00:15 2024 +0100

Remove beam summit banner
---
 website/www/site/layouts/partials/header.html | 6 --
 1 file changed, 6 deletions(-)

diff --git a/website/www/site/layouts/partials/header.html 
b/website/www/site/layouts/partials/header.html
index a01ce5de084..139ae9bc885 100644
--- a/website/www/site/layouts/partials/header.html
+++ b/website/www/site/layouts/partials/header.html
@@ -207,12 +207,6 @@
 
 
   
-
-  https://beamsummit.org/register/";>
-
-
-  
-
 
   https://tour.beam.apache.org";>
 



(beam) branch users/damccorm/beamsummit created (now 45036736e0a)

2024-08-27 Thread damccorm
This is an automated email from the ASF dual-hosted git repository.

damccorm pushed a change to branch users/damccorm/beamsummit
in repository https://gitbox.apache.org/repos/asf/beam.git


  at 45036736e0a Remove beam summit banner

This branch includes the following new commits:

 new 45036736e0a Remove beam summit banner

The 1 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.




  1   2   3   4   5   6   7   8   9   10   >