[jira] [Commented] (SPARK-27234) Continuous Streaming does not support python UDFs

Jungtaek Lim (JIRA) Thu, 21 Mar 2019 20:28:06 -0700


    [ 
https://issues.apache.org/jira/browse/SPARK-27234?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=16798677#comment-16798677
 ]


Jungtaek Lim commented on SPARK-27234:
--------------------------------------

Could you please provide reproducer for Apache vanilla version of Spark? Is it 
still broken if you remove "makeReply" from the code?

If it's specific to "makeReply", you may have to contact Azure / Databricks 
support.

> Continuous Streaming does not support python UDFs
> -------------------------------------------------
>
>                 Key: SPARK-27234
>                 URL: https://issues.apache.org/jira/browse/SPARK-27234
>             Project: Spark
>          Issue Type: Bug
>          Components: Structured Streaming
>    Affects Versions: 2.4.0
>         Environment: Azure Databricks 5.1
>            Reporter: Mark Hamilton
>            Priority: Major
>
> Heres a repro
> get_p_eng = udf(lambda p: "foo")
> df = spark \
>  .readStream \
>  .format("rate") \
>  .load()\
>  .withColumn("p_eng", get_p_eng(col("value")))\
>  .makeReply("p_eng")
> df.isStreaming # Returns True for DataFrames that have streaming sources
> df.printSchema
> # Read all the csv files written atomically in a directory
> df.writeStream.format("console").trigger(continuous="1 second").start()



--
This message was sent by Atlassian JIRA
(v7.6.3#76005)

---------------------------------------------------------------------
To unsubscribe, e-mail: issues-unsubscr...@spark.apache.org
For additional commands, e-mail: issues-h...@spark.apache.org

[jira] [Commented] (SPARK-27234) Continuous Streaming does not support python UDFs

Reply via email to