[ 
https://issues.apache.org/jira/browse/BEAM-6351?focusedWorklogId=184888&page=com.atlassian.jira.plugin.system.issuetabpanels:worklog-tabpanel#worklog-184888
 ]

ASF GitHub Bot logged work on BEAM-6351:
----------------------------------------

                Author: ASF GitHub Bot
            Created on: 14/Jan/19 16:35
            Start Date: 14/Jan/19 16:35
    Worklog Time Spent: 10m 
      Work Description: apilloud commented on pull request #7497: [BEAM-6351] 
Divide separate job for "smoke" load test suites
URL: https://github.com/apache/beam/pull/7497#discussion_r247564649
 
 

 ##########
 File path: .test-infra/jenkins/job_LoadTests_Java.groovy
 ##########
 @@ -69,6 +53,53 @@ for (testConfiguration in testsConfigurations) {
     ) {
         description(testConfiguration.jobDescription)
         commonJobProperties.setTopLevelMainJobProperties(delegate)
-        loadTestsBuilder.buildTest(delegate, testConfiguration.jobDescription, 
testConfiguration.runner, testConfiguration.jobProperties, 
testConfiguration.itClass)
+        loadTestsBuilder.loadTest(delegate, testConfiguration.jobDescription, 
testConfiguration.runner, testConfiguration.jobProperties, 
testConfiguration.itClass)
+    }
+}
+
+def smokeTestConfigurations = [
+        [
+                title        : 'GroupByKey load test Direct',
+                itClass      : 
'org.apache.beam.sdk.loadtests.GroupByKeyLoadTest',
+                runner       : CommonTestProperties.Runner.DIRECT,
+                jobProperties: [
+                        publishToBigQuery: true,
+                        bigQueryDataset  : 'load_test_SMOKE',
+                        bigQueryTable    : 'direct_gbk',
+                        sourceOptions    : 
'{"numRecords":100000,"splitPointFrequencyRecords":1}',
+                        stepOptions      : 
'{"outputRecordsPerInputRecord":1,"preservesInputKeyDistribution":true}',
+                        fanout           : 10,
+                        iterations       : 1,
+                ]
+        ],
+        [
+                title        : 'GroupByKey load test Dataflow',
 
 Review comment:
   I think adding smoke tests for other runners is probably useful. Most have 
local runners so the test should be fairly quick. Dataflow doesn't have this 
option and takes a few minutes to start up, so tests less than a few minutes 
will be mostly startup time.
 
----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Issue Time Tracking
-------------------

            Worklog Id:     (was: 184888)
            Time Spent: 10m
    Remaining Estimate: 0h

> OutOfMemoryError on DirectRunner while running load tests 
> ----------------------------------------------------------
>
>                 Key: BEAM-6351
>                 URL: https://issues.apache.org/jira/browse/BEAM-6351
>             Project: Beam
>          Issue Type: Bug
>          Components: testing
>            Reporter: Kasia Kucharczyk
>            Assignee: Lukasz Gajowy
>            Priority: Major
>          Time Spent: 10m
>  Remaining Estimate: 0h
>
> The GroupByKey Java load test with 1000000000 number of records is failing on 
> DirectRunner with OutOfMemory Error. Then  the test is aborted after timeout.
> This is [example failing run of the 
> job|https://builds.apache.org/job/beam_Java_LoadTests_GroupByKey_Direct_Small_PR/2/].
> The stacktrace is following:
> {code:java}
> Exception in thread "main" java.lang.RuntimeException: 
> java.lang.OutOfMemoryError: GC overhead limit exceeded
> 18:02:15      at 
> org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:204)
> 18:02:15      at 
> org.apache.beam.runners.direct.DirectRunner.run(DirectRunner.java:64)
> 18:02:15      at org.apache.beam.sdk.Pipeline.run(Pipeline.java:313)
> 18:02:15      at org.apache.beam.sdk.Pipeline.run(Pipeline.java:299)
> 18:02:15      at org.apache.beam.sdk.loadtests.LoadTest.run(LoadTest.java:75)
> 18:02:15      at 
> org.apache.beam.sdk.loadtests.GroupByKeyLoadTest.run(GroupByKeyLoadTest.java:58)
> 18:02:15      at 
> org.apache.beam.sdk.loadtests.GroupByKeyLoadTest.main(GroupByKeyLoadTest.java:130)
> 18:02:15 Caused by: java.lang.OutOfMemoryError: GC overhead limit exceeded
> 18:02:15      at 
> org.apache.beam.sdk.util.CoderUtils.decodeFromByteArray(CoderUtils.java:97)
> 18:02:15      at 
> org.apache.beam.sdk.util.CoderUtils.decodeFromByteArray(CoderUtils.java:92)
> 18:02:15      at 
> org.apache.beam.sdk.util.MutationDetectors$CodedValueMutationDetector.<init>(MutationDetectors.java:117)
> 18:02:15      at 
> org.apache.beam.sdk.util.MutationDetectors.forValueWithCoder(MutationDetectors.java:44)
> 18:02:15      at 
> org.apache.beam.runners.direct.ImmutabilityCheckingBundleFactory$ImmutabilityEnforcingBundle.add(ImmutabilityCheckingBundleFactory.java:112)
> 18:02:15      at 
> org.apache.beam.runners.direct.BoundedReadEvaluatorFactory$BoundedReadEvaluator.processElement(BoundedReadEvaluatorFactory.java:151)
> 18:02:15      at 
> org.apache.beam.runners.direct.DirectTransformExecutor.processElements(DirectTransformExecutor.java:160)
> 18:02:15      at 
> org.apache.beam.runners.direct.DirectTransformExecutor.run(DirectTransformExecutor.java:124)
> 18:02:15      at 
> java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511)
> 18:02:15      at java.util.concurrent.FutureTask.run(FutureTask.java:266)
> 18:02:15      at 
> java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149)
> 18:02:15      at 
> java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624)
> 18:02:15      at java.lang.Thread.run(Thread.java:748)
> 19:29:58 Build timed out (after 100 minutes). Marking the build as aborted.
> {code}
> The command to run this test is:
> {code:java}
> gradlew --info  
> -PloadTest.mainClass=org.apache.beam.sdk.loadtests.GroupByKeyLoadTest 
> -Prunner=:beam-runners-direct-java '-PloadTest.args=--publishToBigQuery=false 
> --sourceOptions={"numRecords":1000000000,"splitPointFrequencyRecords":1,"keySizeBytes":1,"valueSizeBytes":9,"numHotKeys":0,"hotKeyFraction":0,"seed":123456,"bundleSizeDistribution":{"type":"const","const":42},"forceNumInitialBundles":100,"progressShape":"LINEAR","initializeDelayDistribution":{"type":"const","const":42}}
>  
> --stepOptions={"outputRecordsPerInputRecord":1,"preservesInputKeyDistribution":true,"perBundleDelay":10000,"perBundleDelayType":"MIXED","cpuUtilizationInMixedDelay":0.5}
>  --fanout=10 --iterations=1 --runner=DirectRunner' 
> :beam-sdks-java-load-tests:run
> {code}



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

Reply via email to