Re: Dataframe.fillna from 1.3.0

Olivier Girardot Wed, 22 Apr 2015 03:05:46 -0700

I think I found the Coalesce you were talking about, but this is a catalyst
class that I think is not available from pyspark


Regards,

Olivier.

Le mer. 22 avr. 2015 à 11:56, Olivier Girardot <
o.girar...@lateral-thoughts.com> a écrit :

> Where should this *coalesce* come from ? Is it related to the partition
> manipulation coalesce method ?
> Thanks !
>
> Le lun. 20 avr. 2015 à 22:48, Reynold Xin <r...@databricks.com> a écrit :
>
>> Ah ic. You can do something like
>>
>>
>> df.select(coalesce(df("a"), lit(0.0)))
>>
>> On Mon, Apr 20, 2015 at 1:44 PM, Olivier Girardot <
>> o.girar...@lateral-thoughts.com> wrote:
>>
>>> From PySpark it seems to me that the fillna is relying on Java/Scala
>>> code, that's why I was wondering.
>>> Thank you for answering :)
>>>
>>> Le lun. 20 avr. 2015 à 22:22, Reynold Xin <r...@databricks.com> a
>>> écrit :
>>>
>>>> You can just create fillna function based on the 1.3.1 implementation
>>>> of fillna, no?
>>>>
>>>>
>>>> On Mon, Apr 20, 2015 at 2:48 AM, Olivier Girardot <
>>>> o.girar...@lateral-thoughts.com> wrote:
>>>>
>>>>> a UDF might be a good idea no ?
>>>>>
>>>>> Le lun. 20 avr. 2015 à 11:17, Olivier Girardot <
>>>>> o.girar...@lateral-thoughts.com> a écrit :
>>>>>
>>>>> > Hi everyone,
>>>>> > let's assume I'm stuck in 1.3.0, how can I benefit from the *fillna*
>>>>> API
>>>>> > in PySpark, is there any efficient alternative to mapping the records
>>>>> > myself ?
>>>>> >
>>>>> > Regards,
>>>>> >
>>>>> > Olivier.
>>>>> >
>>>>>
>>>>
>>>>
>>

Re: Dataframe.fillna from 1.3.0

Reply via email to