shahar603 opened a new pull request #30374:
URL: https://github.com/apache/spark/pull/30374


   ### What changes were proposed in this pull request?
   
   Added an optional `initialModel` in the GaussianMixture class.
   
   ### Why are the changes needed?
   
   To allow for non random initialization in ML's Gaussian Mixture. The 
algorithm's results can be affected due to the initialization and the existing 
random initialization may be limiting in some cases. This feature exists in 
MLLIB. 
   
   ### Does this PR introduce _any_ user-facing change?
   
   * A new method: `setInitialModel` in the `GaussianMixture` class
   
   
   ### How was this patch tested?
   
   Yes, tests were added to the `GaussianMixtureSuite.scala` file.
   The tests test:
   * The ability to add the model
   * The fact the GaussianMixture object fit data using the initial parameters
   * That the GaussianMixture class validates that `K` is the same to the 
initial model's `K`
   


----------------------------------------------------------------
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org



---------------------------------------------------------------------
To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org
For additional commands, e-mail: reviews-h...@spark.apache.org

Reply via email to