GitHub user smurakozi opened a pull request:

    https://github.com/apache/spark/pull/20235

    [Spark-22887][ML][TESTS][WIP] ML test for StructuredStreaming: spark.ml.fpm

    ## What changes were proposed in this pull request?
    
    Converting FPGrowth tests to also check code with structured streaming, 
using the  ML testing infrastructure implemented in SPARK-22882. 
    
    Note: this is a WIP, test with Array[Byte] is not yet working due to some 
datatype issues (Array[Byte] vs Binary).
    
    ## How was this patch tested?
    N/A

You can merge this pull request into a Git repository by running:

    $ git pull https://github.com/smurakozi/spark SPARK-22887

Alternatively you can review and apply these changes as the patch at:

    https://github.com/apache/spark/pull/20235.patch

To close this pull request, make a commit to your master/trunk branch
with (at least) the following in the commit message:

    This closes #20235
    
----
commit 331129556003bcf6e4bab6559e80e46ac0858706
Author: Sandor Murakozi <smurakozi@...>
Date:   2018-01-05T12:41:53Z

    test 'FPGrowthModel setMinConfidence should affect rules generation and 
transform' is converted to use testTransformer

commit 93aff2c999eee4a88f7f4a3c32d6c7b601a918ac
Author: Sandor Murakozi <smurakozi@...>
Date:   2018-01-08T13:14:38Z

    Test 'FPGrowth fit and transform with different data types' works with 
streaming, except for Byte

commit 8b0b00070a21bd47537a7c3ad580e2af38a481bd
Author: Sandor Murakozi <smurakozi@...>
Date:   2018-01-11T11:28:46Z

    All tests use testTransformer.
    Test with Array[Byte] is missing.

commit af61845ab6acfa82c4411bce3ab4a20afebd0aa3
Author: Sandor Murakozi <smurakozi@...>
Date:   2018-01-11T11:49:27Z

    Unintentional changes in 93aff2c999 are reverted

----


---

---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to