This is an automated email from the ASF dual-hosted git repository. holden pushed a commit to branch master in repository https://gitbox.apache.org/repos/asf/spark.git
The following commit(s) were added to refs/heads/master by this push: new 91caf0b [DOCS] MINOR Complement the document of stringOrderType for StringIndexer in PySpark 91caf0b is described below commit 91caf0bfce4706a264fcfe222fa500354ce69ff1 Author: Liang-Chi Hsieh <vii...@gmail.com> AuthorDate: Thu Feb 21 08:36:48 2019 -0800 [DOCS] MINOR Complement the document of stringOrderType for StringIndexer in PySpark ## What changes were proposed in this pull request? We revised the behavior of the param `stringOrderType` of `StringIndexer` in case of equal frequency when under frequencyDesc/Asc. This isn't reflected in PySpark's document. We should do it. ## How was this patch tested? Only document change. Closes #23849 from viirya/py-stringindexer-doc. Authored-by: Liang-Chi Hsieh <vii...@gmail.com> Signed-off-by: Holden Karau <hol...@pigscanfly.ca> --- python/pyspark/ml/feature.py | 5 ++++- 1 file changed, 4 insertions(+), 1 deletion(-) diff --git a/python/pyspark/ml/feature.py b/python/pyspark/ml/feature.py index 0d1e9bd..8583046 100755 --- a/python/pyspark/ml/feature.py +++ b/python/pyspark/ml/feature.py @@ -2299,7 +2299,10 @@ class _StringIndexerParams(JavaParams, HasHandleInvalid, HasInputCol, HasOutputC stringOrderType = Param(Params._dummy(), "stringOrderType", "How to order labels of string column. The first label after " + "ordering is assigned an index of 0. Supported options: " + - "frequencyDesc, frequencyAsc, alphabetDesc, alphabetAsc.", + "frequencyDesc, frequencyAsc, alphabetDesc, alphabetAsc. " + + "Default is frequencyDesc. In case of equal frequency when " + + "under frequencyDesc/Asc, the strings are further sorted " + + "alphabetically", typeConverter=TypeConverters.toString) handleInvalid = Param(Params._dummy(), "handleInvalid", "how to handle invalid data (unseen " + --------------------------------------------------------------------- To unsubscribe, e-mail: commits-unsubscr...@spark.apache.org For additional commands, e-mail: commits-h...@spark.apache.org