>>>>
>>>> Regards,
>>>> Dian
>>>>
>>>> 2021年4月16日 下午8:24,Fabian Paul 写道:
>>>>
>>>> Hi Yik San,
>>>>
>>>> I think the usage of vectorized udfs highly depends on your input and
>&g
a-artisans.com>> 写道:
>>>
>>> Hi Yik San,
>>>
>>> I think the usage of vectorized udfs highly depends on your input and
>>> output formats. For your example my first impression would say that parsing
>>> a JSON string i
; a JSON string is always an rather expensive operation and the vectorization
>>> has not much impact on that.
>>>
>>> I am ccing Dian Fu who is more familiar with pyflink
>>>
>>> Best,
>>> Fabian
>>>
>>> On
r with pyflink
>>
>> Best,
>> Fabian
>>
>>> On 16. Apr 2021, at 11:04, Yik San Chan >> <mailto:evan.chanyik...@gmail.com>> wrote:
>>>
>>> The question is cross-posted on Stack Overflow
>>> https://stackoverflow.com/questions/67122265/pyflin
would say that parsing
>> a JSON string is always an rather expensive operation and the vectorization
>> has not much impact on that.
>>
>> I am ccing Dian Fu who is more familiar with pyflink
>>
>> Best,
>> Fabian
>>
>> On 16. Apr 2021, at 11:0
on that.
>
> I am ccing Dian Fu who is more familiar with pyflink
>
> Best,
> Fabian
>
> On 16. Apr 2021, at 11:04, Yik San Chan wrote:
>
> The question is cross-posted on Stack Overflow
> https://stackoverflow.com/questions/67122265/pyflink-udf-when-to-use-vecto
gmail.com>> wrote:
>>
>> The question is cross-posted on Stack Overflow
>> https://stackoverflow.com/questions/67122265/pyflink-udf-when-to-use-vectorized-vs-scalar
>>
>> <https://stackoverflow.com/questions/67122265/pyflink-udf-when-to-use-vectorized-v
more familiar with pyflink
Best,
Fabian
> On 16. Apr 2021, at 11:04, Yik San Chan wrote:
>
> The question is cross-posted on Stack Overflow
> https://stackoverflow.com/questions/67122265/pyflink-udf-when-to-use-vectorized-vs-scalar
>
> <https://stackoverflow.com/questions
The question is cross-posted on Stack Overflow
https://stackoverflow.com/questions/67122265/pyflink-udf-when-to-use-vectorized-vs-scalar
Is there a simple set of rules to follow when deciding between vectorized
vs scalar PyFlink UDF?
According to [docs](
https://ci.apache.org/projects/flink