[GitHub] [flink] pnowojski commented on a change in pull request #8409: [FLINK-12478] Decompose monolithic run-loops in StreamTask implementa…

GitBox Mon, 20 May 2019 05:00:56 -0700

pnowojski commented on a change in pull request #8409: [FLINK-12478] Decompose 
monolithic run-loops in StreamTask implementa…
URL: https://github.com/apache/flink/pull/8409#discussion_r285551963


 ##########
 File path: 
flink-streaming-java/src/main/java/org/apache/flink/streaming/runtime/tasks/OneInputStreamTask.java
 ##########
 @@ -98,13 +96,8 @@ public void init() throws Exception {
        }
 
        @Override
-       protected void run() throws Exception {
-               // cache processor reference on the stack, to make the code 
more JIT friendly
-               final StreamInputProcessor<IN> inputProcessor = 
this.inputProcessor;
-
-               while (running && inputProcessor.processInput()) {
-                       // all the work happens in the "processInput" method
-               }
+       protected boolean performDefaultAction() throws Exception {
 
 Review comment:
   As I discussed offline with both of you, I think we should mostly focus on 
the higher level benchmarks (as implemented by Stefan). Lower level benchmarks 
might be valuable if there are various of special cases that we want to test 
and we already know that the thing they are covering can be/is a bottleneck 
(network, state backend accesses).
   
   If we can not measure the performance regression in those higher level 
benchmarks, I would say it will be impossible for it to be visible by a final 
user on even higher level (cluster). Lower level benchmarks can either measure 
non visible change or measure a change, that is not even there (JVM might not 
be able to apply the same optimisation with larger context, or vice versa, when 
running full code it might be able to optimise something that it wasn't able to 
optimise during unit style benchmarks).
   
   Regrading the potential improvement here. @StefanRRichter please open a pull 
request with those two newly added benchmarks. I would like first to measure 
couple of results on the master branch for those benchmarks to get a good 
baseline, before merging this PR.

----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.
 
For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


With regards,
Apache Git Services

[GitHub] [flink] pnowojski commented on a change in pull request #8409: [FLINK-12478] Decompose monolithic run-loops in StreamTask implementa…

Reply via email to