Github user xubo245 commented on a diff in the pull request: https://github.com/apache/spark/pull/20456#discussion_r165643385 --- Diff: python/pyspark/sql/dataframe.py --- @@ -667,6 +667,55 @@ def repartition(self, numPartitions, *cols): else: raise TypeError("numPartitions should be an int or Column") + @since("2.3.0") + def repartitionByRange(self, numPartitions, *cols, **kwargs): + """ + Returns a new :class:`DataFrame` partitioned by the given partitioning expressions. The + resulting DataFrame is range partitioned. + + ``numPartitions`` can be an int to specify the target number of partitions or a Column. + If it is a Column, it will be used as the first partitioning column. If not specified, + the default number of partitions is used. --- End diff -- ok, done, please review
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org