[GitHub] [beam] ifat-afek commented on issue #21569: Beam slowness compared to flink-native

2023-01-22 Thread via GitHub
ifat-afek commented on issue #21569: URL: https://github.com/apache/beam/issues/21569#issuecomment-1399916871 Thanks! I already tried fasterCopy and the performance improvement was not significant. -- This is an automated message from the Apache Git Service. To respond to the message,

[GitHub] [beam] jihad-akl commented on issue #25114: [Bug]: ReadFromKafka not forwarding in streaming mode version 2.44.0

2023-01-22 Thread via GitHub
jihad-akl commented on issue #25114: URL: https://github.com/apache/beam/issues/25114#issuecomment-1399879220 True, so how can use the apache beam pipeline in streaming mode if it only gather data and not send them to the next step? The print received does not trigger every time I receive

[GitHub] [beam] Abacn commented on pull request #25115: Fix Python PreCommit dependency compatibility test not run or redundant setup

2023-01-22 Thread via GitHub
Abacn commented on PR #25115: URL: https://github.com/apache/beam/pull/25115#issuecomment-1399710162 Python_Dataframes precommit now has Task like `:sdks:python:test-suites:tox:py38:testPy38pandas-15` running: https://ci-beam.apache.org/job/beam_PreCommit_Python_Dataframes_Commit/225/consol

[GitHub] [beam] Abacn opened a new pull request, #25115: Fix Python PreCommit dependency compatibility test not run or redundant setup

2023-01-22 Thread via GitHub
Abacn opened a new pull request, #25115: URL: https://github.com/apache/beam/pull/25115 Followup of #24866 It is found that dataframe compatibility matrix tests are actually not exercised after splitting due to a typo. Also, sharded precommits set up full version of pyarrow an

[GitHub] [beam] Abacn commented on pull request #23785: [Test only][Do not merge] Draft PR for testing remote runs

2023-01-22 Thread via GitHub
Abacn commented on PR #23785: URL: https://github.com/apache/beam/pull/23785#issuecomment-1399672718 Run Load Tests Python ParDo Flink Batch -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the sp

[GitHub] [beam] Abacn commented on pull request #23785: [Test only][Do not merge] Draft PR for testing remote runs

2023-01-22 Thread via GitHub
Abacn commented on PR #23785: URL: https://github.com/apache/beam/pull/23785#issuecomment-1399666275 run seed job -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubsc

[GitHub] [beam] Abacn commented on issue #21569: Beam slowness compared to flink-native

2023-01-22 Thread via GitHub
Abacn commented on issue #21569: URL: https://github.com/apache/beam/issues/21569#issuecomment-1399657621 For someone finding this issue relevant to them, adding a pipeline option `--fasterCopy=true` may help. context: #13240 -- This is an automated message from the Apache Git Service. To

[GitHub] [beam] Abacn commented on issue #25114: [Bug]: ReadFromKafka not forwarding in streaming mode version 2.44.0

2023-01-22 Thread via GitHub
Abacn commented on issue #25114: URL: https://github.com/apache/beam/issues/25114#issuecomment-1399639396 streaming by definition will not end; despite python directly runner is not for production and do not have full support for streaming. This is most likely working as intended -- This

[GitHub] [beam] Abacn commented on pull request #25111: Bump timeout for default transient expansion service

2023-01-22 Thread via GitHub
Abacn commented on PR #25111: URL: https://github.com/apache/beam/pull/25111#issuecomment-1399631751 3 runs of previously flaky test all succeeded: https://ci-beam.apache.org/job/beam_PostCommit_XVR_Direct_PR/ (run 138,139,140), ready for review -- This is an automated message from t

[GitHub] [beam] reuvenlax commented on pull request #25094: Externalizing the StreamWriter parameters for StorageWrites

2023-01-22 Thread via GitHub
reuvenlax commented on PR #25094: URL: https://github.com/apache/beam/pull/25094#issuecomment-1399628448 Run Java_GCP_IO_Direct PreCommit -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the speci

[GitHub] [beam] prodriguezdefino commented on pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
prodriguezdefino commented on PR #25113: URL: https://github.com/apache/beam/pull/25113#issuecomment-1399623014 LGTM -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsu

[GitHub] [beam] reuvenlax commented on pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
reuvenlax commented on PR #25113: URL: https://github.com/apache/beam/pull/25113#issuecomment-1399618157 The static map is in CreateTableHelpers.java -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go

[GitHub] [beam] prodriguezdefino commented on pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
prodriguezdefino commented on PR #25113: URL: https://github.com/apache/beam/pull/25113#issuecomment-1399614846 > One thing to note: CreateTableHelpers keeps a process-wide static cache of tables known to exist, so it was never done on every bundle. The problem was. that when using at-least

[GitHub] [beam] reuvenlax commented on pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
reuvenlax commented on PR #25113: URL: https://github.com/apache/beam/pull/25113#issuecomment-1399613804 One thing to note: CreateTableHelpers keeps a process-wide static cache of tables known to exist, so it was never done on every bundle. The problem was. that when using at-least-once we

[GitHub] [beam] prodriguezdefino commented on a diff in pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
prodriguezdefino commented on code in PR #25113: URL: https://github.com/apache/beam/pull/25113#discussion_r1083545037 ## sdks/java/io/google-cloud-platform/src/main/java/org/apache/beam/sdk/io/gcp/bigquery/CreateTableHelpers.java: ## @@ -46,8 +48,28 @@ public class CreateTableH

[GitHub] [beam] Abacn commented on pull request #25111: Bump timeout for default transient expansion service

2023-01-22 Thread via GitHub
Abacn commented on PR #25111: URL: https://github.com/apache/beam/pull/25111#issuecomment-1399612351 Run XVR_Direct PostCommit -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [beam] reuvenlax commented on pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
reuvenlax commented on PR #25113: URL: https://github.com/apache/beam/pull/25113#issuecomment-1399568660 Run Java PreCommit -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment.

[GitHub] [beam] reuvenlax commented on pull request #25113: #25112 No longer use GetTable to implement CREATE_IF_NEEDED to avoid low quotas

2023-01-22 Thread via GitHub
reuvenlax commented on PR #25113: URL: https://github.com/apache/beam/pull/25113#issuecomment-1399568608 Run Spotless PreCommit -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific commen

[GitHub] [beam] Abacn commented on pull request #25111: Bump timeout for default transient expansion service

2023-01-22 Thread via GitHub
Abacn commented on PR #25111: URL: https://github.com/apache/beam/pull/25111#issuecomment-1399505198 Run XVR_Direct PostCommit -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment

[GitHub] [beam] jihad-akl opened a new issue, #25114: [Bug]:

2023-01-22 Thread via GitHub
jihad-akl opened a new issue, #25114: URL: https://github.com/apache/beam/issues/25114 ### What happened? ReadFromKafka not forwarding in streaming mode. beam_options = PipelineOptions(streaming = True) pipeline = beam.Pipeline(options=beam_options) messages = (