Github user viirya commented on a diff in the pull request: https://github.com/apache/spark/pull/18732#discussion_r141780866 --- Diff: python/pyspark/sql/functions.py --- @@ -2206,6 +2207,10 @@ def pandas_udf(f=None, returnType=StringType()): | 8| JOHN DOE| 22| +----------+--------------+------------+ """ + import pandas as pd + if isinstance(returnType, pd.Series): + returnType = from_pandas_dtypes(returnType) --- End diff -- I think we don't need this. Use consistent way to express the return type should be better.
--- --------------------------------------------------------------------- To unsubscribe, e-mail: reviews-unsubscr...@spark.apache.org For additional commands, e-mail: reviews-h...@spark.apache.org