lindong28 commented on code in PR #212:
URL: https://github.com/apache/flink-ml/pull/212#discussion_r1115332664


##########
flink-ml-benchmark/src/main/java/org/apache/flink/ml/benchmark/datagenerator/common/DenseVectorArrayGenerator.java:
##########
@@ -42,7 +42,7 @@ protected RowGenerator[] getRowGenerators() {
         return new RowGenerator[] {
             new RowGenerator(getNumValues(), getSeed()) {
                 @Override
-                protected Row nextRow() {
+                protected Row getRow() {

Review Comment:
   I think we need to make this optimization since 2 sec still a big overhead. 
And previously I have seen bigger difference (more than 3 sec) after we make 
this optimization.
   
   We expect Flink ML's performance to be close to Spark ML. That means we 
expect Flink ML's total time for this benchmark to be less than 6 sec after we 
finish all optimizations. Then 2 sec is going to take a big percentage of the 
total time.
   
   Can you explain why this PR will introduce stability issues? Did you observe 
additional stability issues with this PR?



-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: issues-unsubscr...@flink.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org

Reply via email to