[ 
https://issues.apache.org/jira/browse/BEAM-9901?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17102041#comment-17102041
 ] 

Ismaël Mejía commented on BEAM-9901:
------------------------------------

Thanks, It is great if we get an intern to advance this subject even if not 
totally complete, count on me for extra advice/reviews if needed. I suggest to 
ignore the generator part and focus on advancing the write and execution of the 
quries. Java's generator can be used in a first instance to export data into 
both Pubsub and GCS. Also running the queries from the generated data would 
give us a more standard baseline to compare performance of similar queries 
between portable and non portable runners.



> Beam python nexmark benchmark suite
> -----------------------------------
>
>                 Key: BEAM-9901
>                 URL: https://issues.apache.org/jira/browse/BEAM-9901
>             Project: Beam
>          Issue Type: Task
>          Components: benchmarking-py, testing-nexmark
>            Reporter: Yichi Zhang
>            Priority: Major
>             Fix For: Not applicable
>
>
> Nexmark is a suite of queries (pipelines) used to measure performance and 
> non-regression in Beam. Currently it exists in java sdk: 
> [https://github.com/apache/beam/tree/master/sdks/java/testing/nexmark/src/main/java/org/apache/beam/sdk/nexmark]
> In this project we would like to create the nexmark benchmark suite in python 
> sdk equivalent to what BEAM has for java. This allows us to determine 
> performance impact on pull requests for python pipelines.



--
This message was sent by Atlassian Jira
(v8.3.4#803005)

Reply via email to