[GitHub] spark issue #23236: [SPARK-26275][PYTHON][ML] Increases timeout for Streamin...

BryanCutler Wed, 05 Dec 2018 11:40:07 -0800

Github user BryanCutler commented on the issue:

    https://github.com/apache/spark/pull/23236
  
    Seems ok to me, but there are a few silly things with this test that might 
help also
    
    * why is the `stepSize` so low at 0.01? I think it would be fine at 0.1, 
but even conservatively at 0.03 would still help converge a little faster
    
    * why is it comparing the current error with the second error value 
(errors[1] - errors[-1])? Maybe because errors[1] is bigger, but should just be 
`errors[0]` and make the values more sensible, like maybe check that error is > 
0.2
    
    * having 2 `if` statements to check basically the same condition is a bit 
redundant, just `assert(len(errors) < len(predict_batches)` before the last 
return
    
    * nit: `true_predicted = []` on line 350 isn't used
    
    If you don't feel comfortable changing any of these, just increasing the 
timeout is probably fine



---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

[GitHub] spark issue #23236: [SPARK-26275][PYTHON][ML] Increases timeout for Streamin...

Reply via email to