Github user cloud-fan commented on a diff in the pull request: https://github.com/apache/spark/pull/18659#discussion_r138273168 --- Diff: python/pyspark/sql/functions.py --- @@ -2112,7 +2113,7 @@ def wrapper(*args): @since(1.3) -def udf(f=None, returnType=StringType()): +def udf(f=None, returnType=StringType(), vectorized=False): --- End diff -- I think `@pandas_udf(DoubleType())` is better than `@udf(DoubleType(), vectorized=True)`, which is more concise.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org